CLI.utils.arStrip

About:

The arStrip command offers functionality to strip various elements from Arabic text using the SinaTools arStrip API. It provides flexibility to selectively strip diacritics, small diacritics, shaddah, digits, alif, and special characters.

Usage:

Below is the usage information that can be generated by running arStrip -–help.

Usage:
    arStrip --text=TEXT [OPTIONS]
    arStrip --file "path/to/your/file.txt" [OPTIONS]
Options:
  --text TEXT
        The Arabic text that needs to be stripped.

  --file FILE
        File containing text to be stripped.

  --diacs BOOL [default=True]
        Indicates whether to strip diacritics.

  --small_diacs BOOL [default=True]
        Indicates whether to strip small diacritics.

  --shaddah BOOL [default=True]
        Indicates whether to strip shaddah.

  --digit BOOL [default=True]
        Indicates whether to strip digits.

  --alif BOOL [default=True]
        Indicates whether to strip alif.

  --special_chars BOOL [default=True]
        Indicates whether to strip special characters.

Examples:

arStrip --text "مُختَبَر سينا لحوسبة اللغة!" --diacs=True --small_diacs=False --shaddah=True --digit=False --alif=False --special_chars=False
arStrip --file "path/to/your/file.txt" --diacs=True --small_diacs=False --shaddah=True --digit=False --alif=False --special_chars=False

CLI.utils.remove_latin

About:

The remove_latin command removes all Latin characters from the input text.

Usage:

Below is the usage information that can be generated by running arStrip -–help.

Usage:
  remove_latin --text=TEXT
  remove_latin --file "path/to/your/file.txt"

Examples:

remove_latin --text --text "123test"  
remove_latin --file "path/to/your/file.txt"

CLI.utils.remove_punctuation

About:

The remove_punctuation command performs delete punctuation marks from the input text.

Usage:

Below is the usage information that can be generated by running remove_punctuation --help.

remove_punctuation --text=TEXT
remove_punctuation --file "path/to/your/file.txt"

Examples:

remove_punctuation --text "te%s@t...!!?"  
remove_punctuation --file "path/to/your/file.txt"