CLI.DataDownload

About:

The sina_download_files tool is a command-line interface for downloading various NLP resources from pre-specified URLs. It is a part of the nlptools package and provides options to choose which files to download and to specify a download directory. The tool automatically handles file extraction for zip and tar.gz files.

Usage:

Below is the usage information that can be generated by running sina_download_files --help.

sina_download_files [OPTIONS]

Options:

-f, --files FILES
  Names of the files to download. Available files are: ner, morph, salma_model, salma_tokenizer, glosses_dic, lemma_dic, five_grams, four_grams, three_grams, two_grams. 
  If no file is specified, all files will be downloaded.

Examples:

sina_download_files -f morph ner
This command will download only the `morph` and `ner` files to the default directory.

Note:

- Ensure that the text input is appropriately encoded in UTF-8 or compatible formats.
- The script automatically handles the extraction of zip and tar.gz files after downloading.
- Ensure you have the necessary permissions to write to the specified directory.
- The default download directory is based on the operating system and can be obtained using the `get_appdatadir` function.