问题
I'm in the process of assessing the capabilities of the NLTK in processing Arabic text in a research to analyze and extract sentiments.
Question is as follows:
- Is the NTLK capable of handling and allows the analysis of Arabic text?
- Is python capable of manipulating\tokenizing Arabic text?
- Will I be able to parse and store Arabic text using Python?
If python and NTLK aren't the tools for this job, what tools would you recommend (if existent)?
Thank you.
EDIT
Based on research:
- NTLK is only capable of stemming Arabic text: Link
- Python is capable of handling Arabic text since it supports UTF-8 unicode: Link
- Parsing and Lemmatization of Arabic text can be done using: SNLPG (The Stanford Natural Language Processing Group) Statistical Parser: Link
回答1:
A simple google search lead to these links:
Arabic Natural Language Processing
Using Python with the Quranic Arabic Corpus
HOWTO: Working with Python, Unicode, and Arabic
Are any of these useful?
来源:https://stackoverflow.com/questions/7386856/python-arabic-nlp