CLI.utils.arStrip

About:

The sina_arStrip command offers functionality to strip various elements from Arabic text using the SinaTools arStrip utility. It provides flexibility to selectively strip diacritics, small diacritics, shaddah, digits, alif, and special characters.

Usage:

Below is the usage information that can be generated by running sina_arStrip -–help.

Usage:
    sina_arStrip --text=TEXT [OPTIONS]
    sina_arStrip --file "path/to/your/file.txt" [OPTIONS]
Options:
  --text TEXT
        The Arabic text that needs to be stripped.

  --file FILE
        File containing text to be stripped.

  --diacs BOOL [default=True]
        Indicates whether to strip diacritics.

  --smallDiacs BOOL [default=True]
        Indicates whether to strip small diacritics.

  --shaddah BOOL [default=True]
        Indicates whether to strip shaddah.

  --digit BOOL [default=True]
        Indicates whether to strip digits.

  --alif BOOL [default=True]
        Indicates whether to strip alif.

  --specialChars BOOL [default=True]
        Indicates whether to strip special characters.

Examples:

sina_arStrip --text "مُختَبَر سينا لحوسبة اللغة!" --diacs=True --smallDiacs=False --shaddah=True --digit=False --alif=False --specialChars=False

sina_arStrip --file "path/to/your/file.txt" --diacs=True --smallDiacs=False --shaddah=True --digit=False --alif=False --specialChars=False

Note:

- This tool is specific to Arabic text, as it focuses on Arabic linguistic elements.
- Ensure that the text input is appropriately encoded in UTF-8 or compatible formats.
- Stripping certain elements might change the meaning or readability of the text. Use it judiciously.