CLI.utils.arStrip¶
About:¶
The arStrip command offers functionality to strip various elements from Arabic text using the SinaTools arStrip API. It provides flexibility to selectively strip diacritics, small diacritics, shaddah, digits, alif, and special characters.
Usage:¶
Below is the usage information that can be generated by running arStrip -–help.
Usage:
arStrip --text=TEXT [OPTIONS]
arStrip --file "path/to/your/file.txt" [OPTIONS]
Options:
--text TEXT
The Arabic text that needs to be stripped.
--file FILE
File containing text to be stripped.
--diacs BOOL [default=True]
Indicates whether to strip diacritics.
--small_diacs BOOL [default=True]
Indicates whether to strip small diacritics.
--shaddah BOOL [default=True]
Indicates whether to strip shaddah.
--digit BOOL [default=True]
Indicates whether to strip digits.
--alif BOOL [default=True]
Indicates whether to strip alif.
--special_chars BOOL [default=True]
Indicates whether to strip special characters.
Examples:¶
arStrip --text "مُختَبَر سينا لحوسبة اللغة!" --diacs=True --small_diacs=False --shaddah=True --digit=False --alif=False --special_chars=False
arStrip --file "path/to/your/file.txt" --diacs=True --small_diacs=False --shaddah=True --digit=False --alif=False --special_chars=False
CLI.utils.remove_latin¶
About:¶
The remove_latin command removes all Latin characters from the input text.
Usage:¶
Below is the usage information that can be generated by running arStrip -–help.
Usage:
remove_latin --text=TEXT
remove_latin --file "path/to/your/file.txt"
Examples:¶
remove_latin --text --text "123test"
remove_latin --file "path/to/your/file.txt"
CLI.utils.remove_punctuation¶
About:¶
The remove_punctuation command performs delete punctuation marks from the input text.
Usage:¶
Below is the usage information that can be generated by running remove_punctuation --help.
remove_punctuation --text=TEXT
remove_punctuation --file "path/to/your/file.txt"
Examples:¶
remove_punctuation --text "te%s@t...!!?"
remove_punctuation --file "path/to/your/file.txt"