An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: nvidia-nemo

GoogleCloudPlatform/nvidia-nemo-on-gke

Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine

Language: HCL - Size: 964 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 12 - Forks: 7

bunyaminergen/WavLMMSDD

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 6 - Forks: 3

cr4yfish/nouv

Free AI & Community powered Learning Experience

Language: TypeScript - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 41 - Forks: 0

j3soon/LLM-Tutorial

LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.

Language: Jupyter Notebook - Size: 3.07 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

HROlive/Poland-End-To-End-LLM-Bootcamp

This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.

Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

GameOfPods/PAT

PodcastProject Analytics Toolkit - Project that creates analytics various input data. Exported data is intended to be used in a PodcastProject website

Language: Python - Size: 150 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

transiteration/stt_kz_quartznet15x5

Implementation of a Kazakh Speech-to-Text Model using the NVIDIA NeMo toolkit for efficient transcription of spoken Kazakh speech into text.

Language: Python - Size: 55.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Rumeysakeskin/Turkish-Text-to-Speech

Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan

Language: Python - Size: 8.84 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 3

Rumeysakeskin/ASR-Quantization

Post-training quantization on Nvidia Nemo ASR model

Language: Jupyter Notebook - Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

InfiniteHelios/nemo-audio-profanity-detector-app

Audio profanity detector desktop app developed with PyQt5 using NVidia-Nemo tech.

Language: Python - Size: 238 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

aaaastark/NeMo-WeightsBiases-TTS

Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases

Language: Jupyter Notebook - Size: 5.04 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Rumeysakeskin/NLP-Onnx-TensorRT

Joint Intent/Slot Classification for Jetson Nano, TX1/TX2, Xavier NX, and AGX Xavier

Language: Jupyter Notebook - Size: 146 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

ssharkov03/ru-speech-recognition

Module for russian speech recognition using NVIDIA Nemo.

Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Rumeysakeskin/ASR-fine-tuning-for-low-resource-languages

Transfer learning for ASR with subword encoding CTC model (NVIDIA NeMo Citrinet) on low-resource languages

Language: Jupyter Notebook - Size: 455 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

denizariyan/Real-Time-Auto-Transcriber

Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.

Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

JINHXu/tutorial-speaker-identification-with-nemo

The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.

Language: Python - Size: 176 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 2