CLI.DataDownload¶
About:¶
The sina_download_files tool is a command-line interface for downloading various NLP resources from pre-specified URLs. It is a part of the nlptools package and provides options to choose which files to download and to specify a download directory. The tool automatically handles file extraction for zip and tar.gz files.
Usage:¶
Below is the usage information that can be generated by running sina_download_files --help.
sina_download_files [OPTIONS]
Options:¶
-f, --files FILES
Names of the files to download. Available files are: ner, morph, salma_model, salma_tokenizer, glosses_dic, lemma_dic, five_grams, four_grams, three_grams, two_grams.
If no file is specified, all files will be downloaded.
Examples:¶
sina_download_files -f morph ner
This command will download only the `morph` and `ner` files to the default directory.
Note:¶
- Ensure that the text input is appropriately encoded in UTF-8 or compatible formats.
- The script automatically handles the extraction of zip and tar.gz files after downloading.
- Ensure you have the necessary permissions to write to the specified directory.
- The default download directory is based on the operating system and can be obtained using the `get_appdatadir` function.