An open API service providing repository metadata for many open source software ecosystems.

GitHub / YassirMatrane / arabicTextClassification

After collecting 40 thousand tweets and preprocessing it, I used word embeddings with arabert and tf-idf along with two neural network architectures and 5 machine learning algorithms. Due to the huge size of the dataset, I chose Amazon SageMaker to train the models

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/YassirMatrane%2FarabicTextClassification
PURL: pkg:github/YassirMatrane/arabicTextClassification

Stars: 1
Forks: 2
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 3.09 MB
Dependencies parsed at: Pending

Created at: over 4 years ago
Updated at: over 2 years ago
Pushed at: over 4 years ago
Last synced at: about 2 years ago

Topics: arabert, flask, machine-learning-algorithms, n-gram, nlp, python, rnn, tf-idf

    Loading...