Topic: "huggingface-datasets"
grok-ai/nn-template
Generic template to bootstrap your PyTorch project.
Language: Python - Size: 2.68 MB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 647 - Forks: 69
lukehinds/deepfabric
Train Model Behavior in Agentic Systems
Language: Python - Size: 23.4 MB - Last synced at: about 16 hours ago - Pushed at: about 18 hours ago - Stars: 646 - Forks: 45
xlang-ai/UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
Language: Python - Size: 21 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 557 - Forks: 60
AI-Northstar-Tech/vector-io
Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, backup, re-embed (using any model) or access your vector data from any vector databases or repository.
Language: Jupyter Notebook - Size: 4.39 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 263 - Forks: 29
defeat-beta/defeatbeta-api
An open-source alternative to Yahoo Finance's market data APIs with higher reliability.
Language: Python - Size: 3.91 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 259 - Forks: 17
autogluon/fev
Forecast evaluation library
Language: Python - Size: 1.71 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 127 - Forks: 10
BirkhoffG/jax-dataloader
Pytorch-like dataloaders for JAX.
Language: Jupyter Notebook - Size: 1.02 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 94 - Forks: 3
BUAADreamer/Chinese-LLaVA-Med
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
Language: Python - Size: 2.26 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 93 - Forks: 6
vTuanpham/Large_dataset_translator
Translate large dataset to any language with google translation api and multithreads processing, no key required!
Language: Python - Size: 146 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 72 - Forks: 23
onesuper/HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
Language: Python - Size: 415 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 53 - Forks: 3
git-lfs-fuse/git-lfs-fuse
Mount remote repositories, models and datasets managed by Git LFS locally.
Language: Go - Size: 253 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 50 - Forks: 3
BUAADreamer/MLLM-Finetuning-Demo
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
Language: Python - Size: 61.5 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 49 - Forks: 2
xieincz/huggingface-go
huggingface-go : 高速下载 huggingface 的模型和数据集
Language: Go - Size: 28.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 45 - Forks: 6
raidionics/AeroPath
:hugs: AeroPath: An airway segmentation benchmark dataset with challenging pathology
Language: Jupyter Notebook - Size: 324 KB - Last synced at: 24 days ago - Pushed at: 3 months ago - Stars: 40 - Forks: 6
TirendazAcademy/Hugging-Face-Tutorials
Getting started with Hugging Face
Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 7 months ago - Pushed at: 8 months ago - Stars: 37 - Forks: 10
daspartho/predict-subreddit
NLP model that predicts subreddit based on the title of a post
Language: Jupyter Notebook - Size: 816 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 32 - Forks: 6
SapienzaNLP/ita-bench
A collection of Italian benchmarks for LLM evaluation
Language: Python - Size: 731 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 1
huggingface/pyspark_huggingface
PySpark custom data source for Hugging Face Datasets
Language: Python - Size: 219 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 17 - Forks: 5
mrcabbage972/simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
Language: Python - Size: 72.3 KB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 2
DSYZayn/gopeed-extension-huggingface
A gopeed-extension for downloading models and datasets from huggingface, hf-mirror and modelscope. Huggingface download
Language: JavaScript - Size: 1.57 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 13 - Forks: 1
abhi9ab/DeepSeek-R1-Distill-Llama-8B-finance-v1
Finetuned Deepseek 8b model for finance reasoning
Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 13 - Forks: 1
songys/Korean-HF-datasets-catalog
Detection and automatic updating of Korean datasets uploaded to Hugging Face
Language: Python - Size: 8.42 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 12 - Forks: 1
shunk031/huggingface-datasets_JGLUE
JGLUE: Japanese General Language Understanding Evaluation for huggingface datasets
Language: Python - Size: 464 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 12 - Forks: 3
SmithaUpadhyaya/fashion_image_caption
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features (attributes, style, functionality etc.) of the items and increase online sales by enticing more customers.
Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 1
npuichigo/tarzan
High-level API for tar-based dataset
Language: Python - Size: 27.3 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1
acrion/ditana-assistant
Ditana Assistant: AI-powered CLI/GUI tool for intelligent assistance, leveraging LLMs with OS interaction capabilities and context augmentation, optionally via Wolfram|Alpha
Language: Python - Size: 834 KB - Last synced at: 7 months ago - Pushed at: 8 months ago - Stars: 10 - Forks: 0
antoinejeannot/jurisprudence
French Jurisprudences at your fingertips @ every 72h
Language: Python - Size: 146 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 9 - Forks: 2
balnarendrasapa/road-detection
This is a course project for DSCI-6011 - Deep Learning. deals with Drivable Area and lane segmentation for self driving cars
Language: Jupyter Notebook - Size: 181 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 1
nirmal2i43a5/AI-Powered-Biomedical-NER-with-BioBERT
This project applies Fine-tuning BERT & BioBERT on BC5CDR for biomedical named entity recognition (diseases + chemicals).
Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 8 - Forks: 0
vincentkoc/tiny_qa_benchmark_pp
Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultra-fast LLM smoke-testing & regression-catching.
Language: Python - Size: 310 KB - Last synced at: 18 days ago - Pushed at: 2 months ago - Stars: 8 - Forks: 0
daspartho/bored-ape-diffusion
diffusion model for unconditional image generation of Bored Apes
Language: Jupyter Notebook - Size: 892 KB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 0
The-Data-Dilemma/ParquetToHuggingFace
ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.
Language: Python - Size: 2.85 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 2
hearmeneigh/e621-rising-configs
Configuration files for building E621-Rising v3 SDXL model and dataset
Language: Shell - Size: 130 KB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0
The-Data-Dilemma/Medibeng-Orpheus-3b-0.1-ft-Fine-Tuning
Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English code-switching in healthcare, fine-tuned for seamless patient-doctor interactions.
Language: Python - Size: 939 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 1
shunk031/cookiecutter-huggingface-datasets
cookiecutter for huggingface datasets
Language: Python - Size: 38.1 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0
neverbiasu/hf-mirror-hub
一个从 Hugging Face 镜像站点快速下载模型和数据集的命令行工具。
Language: Python - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0
shunk031/huggingface-datasets_MSCOCO
Microsoft COCO: Common Objects in Context for huggingface datasets
Language: Python - Size: 176 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0
ItzCrazyKns/Dataset-Converter
A Python script for converting URL-based datasets into image datasets.
Language: Python - Size: 1000 Bytes - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1
shunk031/huggingface-datasets_COCOA
COCOA: Semantic Amodal Segmentation for huggingface datasets
Language: Python - Size: 75.2 KB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0
shunk031/huggingface-datasets_wrime
WRIME for huggingface datasets
Language: Python - Size: 51.8 KB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0
abhi9ab/DeepSeek-R1-Distill-Qwen-1.5B-finance-v1
Finetuned Deepseek 1.5b model for finance reasoning
Language: Jupyter Notebook - Size: 33.2 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0
Jpzinn654/qa-portuguese-v1
This is a split 500 thousands rows of a dataset from hugging face in portuguese to train NLP's for Question-and-Answering
Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0
louisbrulenaudet/legalkit-pipeline
Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.
Language: Python - Size: 51.8 KB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 1
adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
Language: Python - Size: 79.1 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0
sileod/metaeval
Collection of tasks for meta-learning and extreme multitask learning
Language: Python - Size: 33.2 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0
Suyashkb/Customer-Support-Chatbot
This repo contains a fine-tuned version of DialoGPT (Conversational model of GPT-2) explicitly for customer support chatbots.
Language: Python - Size: 7.81 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0
e-hossam96/arabic-nano-gpt
Arabic Nano GPT Trained on Arabic Wikipedia Dataset from Wikimedia
Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0
mahikshith/Transformer-Text-Summarizer-Fine-tuning-with-ETL-pipeline-and-Deployment
Fine tuning pre-trained transformer model for custom text summarization with ETL pipeline and end to end deployment
Language: Jupyter Notebook - Size: 116 KB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0
creative-graphic-design/huggingface-datasets_Rico
Rico: A Mobile App Dataset for Building Data-Driven Design Applications for huggingface datasets
Language: Python - Size: 88.9 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
michelecafagna26/HL-dataset
[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.
Size: 5.67 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
creative-graphic-design/huggingface-datasets_Magazine
Magazine dataset from Content-aware Generative Modeling of Graphic Design Layouts for huggingface datasets
Language: Python - Size: 46.9 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
shunk031/huggingface-datasets_livedoor-news-corpus
Japanese Livedoor news corpus for huggingface datasets
Language: Python - Size: 82 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0
wsobanski/scraper-tvp
Scraping large amount of articles for transformer training.
Language: Python - Size: 57.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0
creative-graphic-design/huggingface-datasets_CAMERA
CAMERA (CyberAgent Multimodal Evaluation for Ad Text GeneRAtion) for huggingface datasets
Language: Python - Size: 50.8 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0
davanstrien/hugit-cli
push ImageFolder style image datasets to the 🤗 Hub from the command line
Language: Python - Size: 946 KB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0
alcazar90/rock-glacier-detection
Proyecto curso MDS7201-1, en conjunto con el Centro de Modelamiento Matemático (CMM)
Language: Jupyter Notebook - Size: 462 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
shunk031/huggingface-datasets_jsnli
JSNLI (Japanese SNLI) dataset for huggingface datasets
Language: Python - Size: 58.6 KB - Last synced at: 8 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
joe0731/hf_vram_calc
A CLI tool for estimating GPU VRAM requirements for Hugging Face models, supporting various data types, parallelization strategies, and fine-tuning scenarios like LoRA.
Language: Python - Size: 232 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 2
rolim520/Nine-Tiles-Panic-Solver
Exhaustive solver for the board game Nine Tiles Panic. This project generates and analyzes all 2.9 billion valid layouts using Python & DuckDB to find the single optimal solution for every scoring combination. Features an interactive web visualizer.
Language: Python - Size: 39.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
SeekAI-786/Medi_Bot_CustomGPT
I and my team member has created this MediBot Fine Tune on Medical Question/Answer Dataset From Hugging Face
Language: Jupyter Notebook - Size: 7.78 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0
redis-performance/vector-embeddings
Complete pipeline for generating DBpedia text embeddings using OpenAI's embedding models and publishing them as Hugging Face datasets.
Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0
Md-Emon-Hasan/Fine-Tuning
End-to-end fine-tuning of Hugging Face models using LoRA, QLoRA, quantization, and PEFT techniques. Optimized for low-memory with efficient model deployment
Language: Jupyter Notebook - Size: 5.53 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
vishvaRam/Fine-Tune-Qwen2.5
This repository provides resources and instructions for fine-tuning the Qwen2.5-0.5B model. It includes scripts, tips, and best practices to adapt the model for specific tasks or domains. Designed for researchers and developers, it simplifies the fine-tuning process to achieve optimal performance and accuracy.
Language: Python - Size: 44.9 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
Arya920/Natural_Language_To_SQL_Queries
The task of this project is to Convert Natural Language to SQL Queries
Language: Jupyter Notebook - Size: 9.33 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
Dhanush-R-git/MH-Analysis
The MHRoberta is Mental Health Roberta model. The pretrained Roberta transformer based model fine-tunned on Mental Health dataset by adopting PEFT method.
Language: Jupyter Notebook - Size: 3.67 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0
yliuhz/hf-download
Language: Python - Size: 119 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0
creative-graphic-design/huggingface-datasets_PKU-PosterLayout
PKU-PosterLayout for huggingface datasets
Language: Python - Size: 300 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0
ShawonAshraf/bangla-math-chat
A math dataset for fine-tuning LLMs to chat on math problems in Bangla
Language: Python - Size: 50.8 KB - Last synced at: 22 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0
canstralian/Transformers-Fine-Tuner
Language: Python - Size: 61.5 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0
developer0hye/hugging-face-image-ocr-dataset-upload-example
One of the Hugging Face Image Dataset Upload Guides
Language: Python - Size: 635 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0
dhanushpittala11/SummarizerText_Hf_End2End_1
This is a Text Summarization web application using Huggingface models finetuned on a custom dataset. This project focuses on building an end-to-end pipeline for data ingestion, data transformation, model training ,model evaluation, prediction and API integration, hosting it on the web.
Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 7 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0
EkBass/fin-eng-translations-set
Massive translation set between Finnish and English languages.
Size: 4.19 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0
creative-graphic-design/huggingface-datasets_CGL-Dataset-v2
CGL-Dataset v2 for huggingface datasets
Language: Python - Size: 105 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0
creative-graphic-design/huggingface-datasets_PubLayNet
PubLayNet for huggingface datasets
Language: Python - Size: 113 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
eljandoubi/Copilot
Homemade Copilot: Fine-tune LLM through LoftQ initialization and QLoRA-style training for code generation.
Language: Jupyter Notebook - Size: 241 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
eljandoubi/huggingface_image_classifier
Fine-tune the Vision Transformer (ViT) using LoRA and Optuna for hyperparameter search.
Language: Python - Size: 47.9 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
morikaglobal/finetune_bert_model
Fine-tuning pretrained BERT model for sentiment analysis (text classification)
Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
amk9978/HuggingAnalyser
HuggingFace text generation and text classification models and related spaces analyser written in Python
Language: Python - Size: 283 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
semaj87/summarise-dialogue-flan-t5
Performing the task of dialogue summarisation using Generative AI, whilst comparing the effects of zero shot, one shot and few shot prompt engineering. These steps are used to enhance the completion of Large Language Models (LLMs))
Language: Jupyter Notebook - Size: 278 KB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
creative-graphic-design/huggingface-datasets_PosterErase
PosterErase for huggingface datasets
Language: Python - Size: 47.9 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
cx0/neurips-llm-efficiency
My submission for NeurIPS 2023 LLM Efficiency Challenge.
Size: 17.6 KB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
shunk031/huggingface-datasets_DrawBench
DrawBench for huggingface datasets
Language: Python - Size: 43 KB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0
sankalpie/PiNacle-Coder-v2
The PiNacle Coder site with updated API
Language: Jupyter Notebook - Size: 495 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0
ksgr5566/AutoTuneNLP
A comprehensive toolkit for seamless data generation and fine-tuning of NLP models, all conveniently packed into a single block.
Language: Python - Size: 231 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0
anujsahani01/English-Marathi-Translation
Fine-tuned and compared 3 🤗 pre-trained Multilingual LLMs
Language: Jupyter Notebook - Size: 333 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
Netruk44/repo-search
Search for code by what it does in natural language, using machine learning embeddings.
Language: Python - Size: 82 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1
NusretOzates/hf-datasets-server-py
Python SDK to use HuggingFace's datasets server API
Language: Python - Size: 28.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
shunk031/tango-jglue-benchmarks
Reproducible implementation using ai2-tango for JGLUE, Japanese benchmark
Language: Jsonnet - Size: 149 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
alcazar90/cell-segmentation
A fine-tuning script for training an image segmentation model (SegFormer) on a cell dataset. The workflow involves creating a hugging face dataset repository and tracking your experiments with W&B.
Language: Python - Size: 751 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
Living-with-machines/ai4lam-huggingface-datasets-demo
datasets demo for ai4lam
Language: Jupyter Notebook - Size: 423 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0
daspartho/depression-detector
text classifier to detect depression
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0
Hugging-Face-Supporter/datacards
Find Hugging face datasets that are missing tags. Then Help to fill then in; one-by-one
Language: Python - Size: 104 KB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1
LLFELIPEVV/sistema-fake-news-ia
Sistema autónomo para la detección de noticias falsas en español usando inteligencia artificial, NLP y machine learning. Incluye análisis de datasets, entrenamiento de modelos tradicionales, profundos y Transformers, con API en FastAPI e interfaz web en React.
Size: 310 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
Ashu708907/Music-Genre-Classification-using-Spectrogram-images
🎵 Classify music genres by analyzing spectrogram images with machine learning and deep learning methods for robust and interpretable predictions.
Language: Jupyter Notebook - Size: 6.94 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0
Gift-Ojeabulu/fasttext-language-detection
Simple multilingual text detection using FastText and Hugging Face datasets. Production-ready Python library with real-world examples.
Language: Python - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
paulohl/hugging_face_diffusers_manuscript
Hugging Face Diffusers 🤗 Library book repo *BP Publishers
Language: TeX - Size: 109 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
shashwatpasari/Music-Genre-Classification-using-Spectrogram-images
This repository provides a comprehensive suite of machine learning and deep learning approaches for hierarchical music genre classification using spectrogram images. It includes models built with EfficientNet, Audio Spectrogram Transformer (AST), Custom CNN architectures, and traditional machine learning pipelines.
Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
ianjure/philnet-scraper
Scraper for collecting legitimate and phishing records used in PhiLNet's periodic retraining.
Language: Python - Size: 2.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
approximated-intelligence/embedding-distillation
Train a pooling Head for Dense Embeddings
Language: Python - Size: 105 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
nikitabugrovsky/hf-vector-pipeline
Build datasets for Hugging Face automatically
Language: Python - Size: 17.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0