Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / facebookresearch / stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/facebookresearch%2Fstopes
Stars: 237
Forks: 37
Open Issues: 30
License: mit
Language: Python
Repo Size: 4.64 MB
Dependencies:
3,506
Created: about 2 years ago
Updated: about 1 month ago
Last pushed: 6 months ago
Last synced: about 1 month ago
Topics: dataset, dataset-generation, machine-learning, machine-translation, machine-translation-data-processing, nmt, translation
Files
Dependencies
- docusaurus ^1.14.7
- @fortawesome/fontawesome-svg-core ^6.2.1
- @testing-library/jest-dom ^5.16.5
- @testing-library/react ^13.4.0
- @testing-library/user-event ^13.5.0
- @types/jest ^29.2.3
- @types/node ^18.11.9
- @types/react ^18.0.25
- @types/react-dom ^18.0.9
- bootstrap ^5.2.2
- crypto-hash ^2.0.1
- react ^18.2.0
- react-bootstrap ^2.5.0
- react-dom ^18.2.0
- react-hash-string ^1.0.0
- react-icons ^4.6.0
- react-promise-tracker ^2.1.0
- react-router-dom ^6.4.3
- react-scripts 5.0.1
- react-spinners ^0.13.6
- styled-components ^5.3.6
- typescript ^4.9.3
- use-file-picker ^1.5.1
- wavesurfer-react ^2.2.2
- wavesurfer.js ^6.3.0
- web-vitals ^2.1.4
- POT ==0.8.2
- fairseq2 ==0.1.1
- laser-encoders ==0.0.1
- numpy *
- pandas *
- sacrebleu *
- scikit-learn *
- sentence_transformers ==2.2.2
- sonar-space ==0.1.0
- transformers ==4.31.0
- unbabel-comet ==2.2.0
- numpy *
- pandas *
- sacrebleu *
- scikit-learn *
- fastapi ==0.85.1
- fire ==0.4.0
- mosestokenizer *
- omegaconf ==2.1.0
- onnxruntime *
- python-dotenv *
- python-multipart ==0.0.5
- uvicorn ==0.19.0
- @babel/eslint-parser ^7.18.2 development
- eslint ^8.16.0 development
- eslint-config-airbnb ^19.0.4 development
- eslint-config-prettier ^8.5.0 development
- eslint-plugin-header ^3.1.1 development
- eslint-plugin-import ^2.26.0 development
- eslint-plugin-jsx-a11y ^6.5.1 development
- eslint-plugin-react ^7.30.0 development
- eslint-plugin-react-hooks ^4.5.0 development
- prettier ^2.6.2 development
- stylelint ^14.8.5 development
- @docusaurus/core ^2.0.0-beta.22
- @docusaurus/plugin-google-gtag ^2.0.0-beta.22
- @docusaurus/preset-classic ^2.0.0-beta.22
- @mdx-js/react ^1.6.22
- clsx ^1.1.1
- react ^17.0.2
- react-dom ^17.0.2
- faiss-gpu *
- hydra-core >=1.2.0
- numpy *
- sentence-transformers *
- submitit *
- torch *
- tqdm *
- botok *
- emoji *
- fasttext *
- hydra-core >=1.2.0
- indic-nlp-library *
- khmer-nltk *
- laonlp *
- omegaconf ==2.1.1
- pythainlp *
- python-crfsuite *
- sentence_splitter *
- wandb *
- xxhash *
- actions/checkout v2 composite
- actions/setup-node v3 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- actions/checkout v2 composite
- actions/setup-node v3 composite
- joblib *
- matplotlib ==3.5.3
- numpy *
- pandas *
- scipy ==1.9.1
- sentencepiece *
- statsmodels ==0.13.2
- tqdm *
- matplotlib ==3.5.3
- numpy ==1.23.2
- pandas ==1.5.1
- sacrebleu ==2.3.1
- scikit-learn ==1.1.2
- scipy ==1.9.1
- seaborn ==0.12.0
- sentence-transformers ==2.2.2
- statsmodels ==0.13.5
- torch ==1.12.1
- transformers ==4.22.0
- unbabel-comet ==1.1.3
- hydra-core >=1.2.0
- joblib *
- posix_ipc *
- submitit >=1.4.5
- tqdm *
- prettier 2.8.0 development
- prettier 2.8.0 development