An open API service providing repository metadata for many open source software ecosystems.

Topic: "flan-t5"

xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language: Python - Size: 47.1 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 8,482 - Forks: 735

georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Language: Python - Size: 32.7 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 854 - Forks: 101

declare-lab/flan-alpaca

This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as Flan-T5.

Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 352 - Forks: 38

zorazrw/filco

[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton

Language: Python - Size: 44.9 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 194 - Forks: 20

HamedBabaei/LLMs4OL

LLMs4OL:‌ Large Language Models for Ontology Learning

Language: Python - Size: 38.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 66 - Forks: 7

yyDing1/GNER

[ACL-24 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"

Language: Python - Size: 4.69 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 2

rpidanny/llm-prompt-templates

Empower your LLM to do more than you ever thought possible with these state-of-the-art prompt templates.

Language: TypeScript - Size: 641 KB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 42 - Forks: 2

zeyadusf/LLMs-from-Scratch

Build a Large Language Model (From Scratch) book and Finetuned Models

Language: Jupyter Notebook - Size: 8.22 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 41 - Forks: 20

terjanq/hack-a-prompt

Tools and our test data developed for the HackAPrompt 2023 competition

Language: HTML - Size: 1.89 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 32 - Forks: 5

replicate/getting-started-nextjs-language

A template Next.js app for running language models like FLAN-T5 with Replicate's API

Language: JavaScript - Size: 355 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 27 - Forks: 3

Reason-Wang/flan-alpaca-lora

This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.

Language: Python - Size: 6.71 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 23 - Forks: 3

M-Taghizadeh/flan-t5-base-imdb-text-classification

In this implementation, using the Flan T5 large language model, we performed the Text Classification task on the IMDB dataset and obtained a very good accuracy of 93%.

Language: Jupyter Notebook - Size: 42 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 2

CodeName-Detective/Prompt-to-Song-Generation-using-Large-Language-Models

This project uses LLMs to generate music from text by understanding prompts, creating lyrics, determining genre, and composing melodies. It harnesses LLM capabilities to create songs based on text inputs through a multi-step approach.

Language: Jupyter Notebook - Size: 57.6 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 1

mltrev23/google-flan-t5-small

Training and fine-tuning flan-t5-small model based on provided text

Language: Python - Size: 14.6 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 15 - Forks: 0

dylan-slack/Tablet

The TABLET benchmark for evaluating instruction learning with LLMs for tabular prediction.

Language: Python - Size: 41.3 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 1

nicolay-r/THOR-ECAC

The official fork of THoR Chain-of-Thought framework, enhanced and adapted for Emotion Cause Analysis (ECAC-2024)

Language: Python - Size: 5.15 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 0

VanekPetr/flan-t5-text-classifier

Fine-tuning of Flan-5T LLM for text classification

Language: Python - Size: 9.69 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 3

mltrev23/flan-t5-fine-tune

Flan-t5 model fine tune LoRA and Langchain

Language: Python - Size: 34.2 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

arasgungore/Generative-AI-with-LLMs

My solutions to the lab assignments in the Generative AI with Large Language Models course offered by Amazon Web Services.

Language: Jupyter Notebook - Size: 44.9 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

Himidiri/MergeX

Revolutionizing open-world gaming, MergeX harnesses NLP advances to empower players with limitless dialogue interactions with NPCs. By imbuing each character with a unique biography, conversations authentically align with NPC personalities, transcending traditional limitations.

Language: Jupyter Notebook - Size: 243 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 3

ramendrasingla/RL-LLM-Based-Title-Generator

Struggled with creating catchy, human-like titles using ChatGPT or other LLMs? If you've ever been frustrated by AI-generated titles that lack a human touch, I’ve just published something you’ll find useful. My latest blog explores how Reinforcement Learning (RL) can push LLMs to craft more compelling, attention-grabbing headlines

Language: Python - Size: 374 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 3 - Forks: 1

HoangLayor/Finetune-LLMs-for-Sequence-Classification

Finetune Flan-T5 for email classification

Language: Jupyter Notebook - Size: 135 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

sergio11/llm_finetuning_and_evaluation

The LLM FineTuning and Evaluation project 🚀 enhances FLAN-T5 models for tasks like summarizing Spanish news articles 🇪🇸📰. It features detailed notebooks 📚 on fine-tuning and evaluating models to optimize performance for specific applications. 🔍✨

Language: Jupyter Notebook - Size: 499 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

Kuberwastaken/TREAT-CS50

An Early (Flan T-5) version of An Open-Source AI Web App to Detect Triggering Content in Movies and Shows - Kuber Mehta's CS50x Capstone Project

Language: CSS - Size: 1.58 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

himanshuvnm/Foundation-Model-Large-Language-Model-FM-LLM

This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.

Language: Jupyter Notebook - Size: 431 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

purang2/Report-Summarization

Summarize Long Document with Pretrained sequence-to-sequence LM with long-range attention!

Size: 145 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

KRR-Oxford/LLMap-Prelim

A preliminary investigation for ontology alignment (OM) with large language models (LLMs).

Language: Python - Size: 20.4 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

the-good-fellas/flan-t5-ptbr

Tutorial para treino de um modelo baseado Flan-T5 usando Flax no GCP-TPU

Language: Python - Size: 183 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

utsab1009/AI-HealthDoc-Chatbot-PDF-FAISS-FLAN

A privacy-focused local LLM chatbot for chatting with healthcare PDFs using FAISS + FLAN-T5.

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

hrao-dev/SumUp

AI-powered text and news summarizer that delivers clear, concise insights from long-form content.

Language: Jupyter Notebook - Size: 172 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

ShubhRanpara/Auto-Filler

This repository contains my team's internship project work at Flexbox Technologies. We have developed a system that fills the patient details form automatically with the patient data extracted from pdf file.

Language: Python - Size: 6.82 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ShubhRanpara/Auto-Filler-Web

This repository contains my internship project work at Flexbox Technologies. I have developed a system that fills the patient details form automatically with the patient data extracted from pdf file.

Language: HTML - Size: 7.26 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

shoryasethia/ConversationSummarizerLLM

Fine Tuning pegasus and flan-t5 pre-trained language model on dialogsum datasets for conversation summarization to to optimize context window in RAG-LLMs

Language: Jupyter Notebook - Size: 38.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ShubhRanpara/ChatPDF

This repository contains one of my cool project which I have created during my college's MINeD hack-a-thon.

Language: Python - Size: 195 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pankajrawat9075/Dialog-Summarization-with-Generative-AI

Using Open-Source LLMs like FLAN-T5, built a Dialog Summarization model and did fine-tuning with DialogSum HF Dataset

Language: Jupyter Notebook - Size: 78.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

owaisahmadlone/SQL-LLM

Fintetuned Flan-T5 to provide SQL queries given Natural Language Text. Two steps of finetuning: First on general Natural Language to SQL Queries and second on queries related to a single database

Size: 2.86 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

shishir-dwi/HealPal-Chatmate

The LLM-based medical chatbot, powered by the Llama-2-7b-chat-hf model from Meta and implemented within the Langchain framework, offers personalized healthcare support.

Language: Jupyter Notebook - Size: 277 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

HamedBabaei/author-profiling-pan2023

Symbol Team model for PAN@AP 2023 shared task on Profiling Cryptocurrency Influencers with Few-shot Learning

Language: Python - Size: 1.48 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

rochitasundar/Generative-AI-with-Large-Language-Models

This repository contains the lab work for Coursera course on "Generative AI with Large Language Models".

Language: Jupyter Notebook - Size: 218 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

semaj87/summarise-dialogue-flan-t5

Performing the task of dialogue summarisation using Generative AI, whilst comparing the effects of zero shot, one shot and few shot prompt engineering. These steps are used to enhance the completion of Large Language Models (LLMs))

Language: Jupyter Notebook - Size: 278 KB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

UKPLab/starsem2023-arithmetic-based-pretraining

Code and data for the StarSem 2023 paper "Arithmetic-Based Pretraining -- Improvin Numeracy of Pretrained Language Models"

Language: Julia - Size: 108 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

purang2/LLM-NLP-Seminar-PPT-Slides

2023 My NLP PPT Slides (LLM, NLP)

Size: 22.4 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

synkathairo/flan-t5-large-gradio

A gradio frontend for Google's Flan-T5 Large language model, can also be adjusted for other sizes.

Language: Python - Size: 8.79 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

Amama-Fatima/pin-persona

AI powered Pinterest persona generator for historical figures

Language: Jupyter Notebook - Size: 1.09 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

shreyas27092004/flan-t5-detoxification-ppo

Fine-Tuning FLAN-T5 for Detoxification with PPO and PEFT

Language: Jupyter Notebook - Size: 34.2 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

shreyas27092004/Generative-AI-Model-Fine-Tuning-Hugging-Face-Transformers-

Fine-tuning a Generative AI model using Hugging Face Transformers. Includes dataset preprocessing, model training, evaluation, and text generation with a custom fine-tuned model.

Language: Jupyter Notebook - Size: 34.2 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

shreyas27092004/Generative-AI-Summarization-with-FLAN-T5

The primary focus of this lab is to explore dialogue summarization using the **FLAN-T5** model from Hugging Face.

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

nabeelshan78/flanT5-ICL-SFT-PEFT-RLHF

An end-to-end pipeline for adapting FLAN-T5 for dialogue summarization, exploring the full spectrum of modern LLM tuning. Implements and compares Full Fine-Tuning, PEFT (LoRA), and Reinforcement Learning (RLHF) for performance and alignment. Features a PPO-tuned model to reduce toxicity, in-depth analysis notebooks, and interactive Streamlit demo.

Language: Jupyter Notebook - Size: 1.01 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Saeedanbarimoghadam/Cancer-Information-Q-A-with-LLM-and-RAG

Language: Jupyter Notebook - Size: 182 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

huzaifa113/MediRAG

Clinical RAG Assistant using MIMIC-IV-Ext-DiReCT dataset for medical diagnosis. Used FAISS for document retrieval and FLAN T5 for generating answers.

Language: Jupyter Notebook - Size: 1.12 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Daddy-Myth/Flan-T5-rlhf-align

Aligning FLAN-T5 with Reinforcement Learning from Human Feedback (RLHF) for Neutral, Grammatically Correct News Summaries

Language: Jupyter Notebook - Size: 161 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

aunraza19/fine-tuned-flan-t5-legal-summarizer-app

A fine-tuned FLAN-T5 model deployed in a Streamlit app to summarize and analyze legal documents with integrated NER.

Language: Jupyter Notebook - Size: 301 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SuneshSundarasami/Multilabel-Toxicity-Detection-Using-Classical-RNN-and-Transformer-Architectures

End-to-end ML workflow for multi-label toxic comment detection using NLP. Implements advanced text preprocessing, multi-label vectorization, and models (Logistic Regression, RNNs, Transformers). Includes scripts for data cleaning, training, and per-label metrics.

Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

abdulvahapmutlu/research-agent

A hybrid Research Assistant that combines an exact Knowledge Graph (Neo4j) with a Retrieval‑Augmented Generation pipeline (FAISS + Cross‑Encoder + FLAN‑T5) behind a sleek Streamlit interface.

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

hisanusman/Violent-activities-detection-and-scene-understanding

This project uses a vision transformer to detect violent activities in CCTV footages, along with a custom YOLOv8 model for weapon detection and pin-pointing of exact person. Scene understanding has been implemented through CLIP, the results of which are used in automatic AI report generation via LangChain (GPT & FLAN-T5).

Language: Jupyter Notebook - Size: 1.04 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

satyampurwar/large-language-models

Unlocking the Power of Generative AI: In-Context Learning, Instruction Fine-Tuning, Reinforcement Learning Fine-Tuning and Retrieval Augmented Generation.

Language: Jupyter Notebook - Size: 180 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Ajairajv/Detoxified-Summaries-with-FLAN-T5-PPO

Fine-tunes FLAN-T5 using Reinforcement Learning (PPO) and PEFT to generate less toxic summaries, leveraging Meta AI's hate speech reward model for detoxification.

Language: HTML - Size: 101 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Ajairajv/Fine-Tuning-a-Generative-AI-Model

Fine-tunes FLAN-T5 for dialogue summarization using full fine-tuning and LoRA-based PEFT, evaluated with ROUGE metrics and human assessment.

Language: Jupyter Notebook - Size: 28.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

fahimabrar/Large-Language-Model

This repository contains Zero, Few shot Inference, Fine Tuning a Large Language Model For Document Summarization, and lot more is coming...

Language: Jupyter Notebook - Size: 52.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Nsikan56/Chartbot_chatbot

AI-powered chatbot analyzing 63+ years of Billboard Hot 100 data using FLAN-T5 transformers for NLP + regex fallback. Parses queries like "Top 10 songs of 2010" or "How long was Thriller on the chart?" Built with Streamlit, featuring modern UI with CSS animations

Language: Python - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

hrao-dev/QuickPulse

AI-Powered News Summarization with Real-Time Sentiment and Topic Insights

Language: Python - Size: 30.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

radhika262/VOXA-A-Sleek-and-Modern-Chatbot

Voxa – A Sleek and Modern Chatbot, is to develop an AI-powered assistant that can handle a wide range of customer queries—whether typed or spoken—in a natural, context-aware manner

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

shantanumohod23/pdf-chatbot-genai

🤖 A Generative AI chatbot that answers questions from PDFs using FLAN-T5 and sentence-transformers, built with Streamlit.

Language: Python - Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

darshan-vijay/Plant-Doc

Detects plant diseases from leaf images using a fine-tuned ResNet model, then uses a domain-adapted Flan-T5 language model to answer user questions about the disease, including symptoms and treatments, based on curated knowledge.

Language: Jupyter Notebook - Size: 94.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

MyriamBA/Dialogue-Summarizer

An End-to-End Dialogue Summarization Project using LLMs.

Language: Jupyter Notebook - Size: 74.2 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

eeshan15/FLAN-T5-Model-Implementation

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

RuvenGuna94/Dialogue-Summary-Use-Case

GEN AI use case: dialogue summary. This notebook is extracted from the course Generative AI with Large Language Models. It is used to understand how input text can affect model performance.

Language: Jupyter Notebook - Size: 84 KB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

RuvenGuna94/Notes--Generative-AI-with-Large-Language-Models

Projects and exercises from the 'LangChain for LLM Application Development' course by DeepLearning.AI. Learn to build real-world applications with LangChain.

Size: 2.34 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

RuvenGuna94/Dialogue-Summary-PEFT-Fine-Tuning

This notebook fine-tunes the FLAN-T5 model for dialogue summarization, comparing full fine-tuning with Parameter-Efficient Fine-Tuning (PEFT). It evaluates performance using ROUGE metrics, demonstrating PEFT's efficiency while achieving competitive results.

Language: Jupyter Notebook - Size: 289 KB - Last synced at: about 2 hours ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

davide-abbattista/SciQA

The Scientific Question Answering (SciQA) System is an end-to-end solution designed to provide accurate, contextually relevant, and citation-supported answers to user queries.

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

RajVenkat20/LLM-Optimizations-QLoRA-AWQ

This project takes the Flan-T5 LLM and applies QLoRA and AWQ quantization techniques

Language: Python - Size: 488 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

pintamonas4575/NLP-text-detox-MAADM-UPM

NLP for detoxing language phrases in several languages.

Language: Jupyter Notebook - Size: 8.82 MB - Last synced at: 14 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ayax537/CodSoft-TASK101

Naive Bayes Model: Cleaned data, engineered features, and achieved impressive results. FLAN-T5 Large Language Model: Explored zero-shot, one-shot, and few-shot inference techniques.

Language: Python - Size: 3.91 KB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ayax537/codsoft-task4

📣 Forth task on CodSoft Internship During my internship at CodSoft, I had the opportunity to develop a Spam SMS Detection model. The goal was to create a machine-learning solution to classify SMS messages as "ham" or "spam" automatically.

Language: Python - Size: 3.91 KB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

AishwaryaHastak/RAG-using-T5

An End-to-End RAG Pipeline for handling Question and Answer in the Data Science domain

Language: Jupyter Notebook - Size: 2.27 MB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

PrathamLearnsToCode/Fine-tuning-FLAN-T5-with-LoRA-WandB

Fine tune an LLM for summarization task using Low rank adaptation

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

qiqinyi/GenAI-with-LLMs

My lab work of “Generative AI with Large Language Models” course offered by DeepLearning.AI and Amazon Web Services on coursera.

Language: Jupyter Notebook - Size: 28.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

r-gheda/rag-complex-qa

Prompt-engineered RAGs for Open Domain Complex QA

Language: Jupyter Notebook - Size: 34.8 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Abhi-vish/Chat-With-PDFs

Chat with PDFs: A Streamlit-based web app for uploading multiple PDFs, asking natural language questions, and receiving answers using advanced language models.

Language: Python - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

AJlearner46/Deduplicate-flanv2-finetune-LLaMa3-

perform deduplication on FLAN v2 dataset & Finetune LLaMa3 using this dataset

Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

kconstable/LLM-fine-tuning

For this project, I fine-tuned two separate models for three tasks: document summarization, dialogue summarization and text classification

Language: Jupyter Notebook - Size: 195 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

WaveGenAI/lyric-tagger

The codebase for finetuning a flant5 model to include tag in a lyric.

Language: Python - Size: 47.9 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kHarshit/llm-projects

LLM projects

Language: Jupyter Notebook - Size: 4.69 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

purrvaja/Text-Summarization-with-FLAN-T5

Demonstration of LLM techniques such as prompt engineering, full finetuning, PEFT (LoRA) etc.

Language: Jupyter Notebook - Size: 36.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ambideXtrous9/MTP-News-Article-based-Question-Answering-System

MTP-FlanT5-SBERT-Model-for-NewsQA-and-Teacher-Student-Model

Language: Jupyter Notebook - Size: 3.77 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

AkashKV-1998/Dialog-Summarization-using-Large-Language-Model

This repository contains notebook files that discuss Large Language Models (LLMs), covering topics like fine-tuning, prompt engineering, and techniques such as PEFT (Parameter Efficient Fine-Tuning) and PPO (Proximal Policy Optimization) etc.

Language: Jupyter Notebook - Size: 80.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

highplainscomputing/HPC_T5

This repository is made for T5 model where user can train their model on any T5 model version.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

trjo1/genaiwithllms

Fine-tuned FLAN T-5 using Instruction Fine-Tuning (Full), LoRA-based PEFT, and RLHF with PPO

Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jruranski/TherapyAI

Web app for a therapist chatbot. Using a custom fine-tuned local flan-t5 model for summarisation and ChatGPT3.5 for chat.

Language: Jupyter Notebook - Size: 3.4 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

laceymalarky/nlp_question_answer

Developed a generative large language model fine-tuned on Stack Overflow data for question answering and deployed web UI.

Language: Jupyter Notebook - Size: 40 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

claudiatang-git/english-hawaiian-pidgin-translator

Finetuned FLAN-T5 to translate English to Hawaiian Pidgin

Language: Jupyter Notebook - Size: 34.2 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

bsantraigi/Frugal-Prompting-Analysis

Official Code for Analysis Done in the Paper "Frugal Prompting for Dialog Models"

Language: Jupyter Notebook - Size: 3.68 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

sahilichake/Document-Summarization-App-using-LLM

Document Summarization App using large language model (LLM) and Langchain framework. Used a pre-trained T5 model and its tokenizer from Hugging Face Transformers library. Created a summarization pipeline to generate summary using model.

Language: Python - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

aolivtous/LLMs-for-DocVQA

Master's thesis on Large Language Models for Document Visual Question Answering

Language: Python - Size: 12.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

iansnyder333/FakeNews-GPT_Project

Harness the power of GPT-2 and Flan-T5-Large using a single GPU with LoRA.

Language: Python - Size: 8.86 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

kkkravets/Seq2Lightning

Project based on PyTorch-lightning and Transformers for training Seq2SeqLM models, with a primary focus on MT5 and FLAN-T5, yet not limited to them

Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

LaniW/CensoredCrawledConversation

Text-To-Text Textbots to Demonstrate Output Differences Between Models Trained on Filtered/Unfiltered Datasets for HSS4 - The Modern Context: Select Figures and Topics

Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

navneet1083/textsum-tune

This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.

Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

kavlata/LLM-experiments

Multiple LLM based models for NLP tasks. Starting with Question answering on custom data

Language: Jupyter Notebook - Size: 739 KB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

abhinav-neil/socratic-models Fork of milenakapralova/socraticmodels

Socratic models for multimodal reasoning & image captioning

Language: Jupyter Notebook - Size: 48.8 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Related Topics
llm 32 huggingface 23 fine-tuning 22 generative-ai 20 nlp 20 lora 18 large-language-models 17 python 15 transformers 15 prompt-engineering 12 langchain 11 peft-fine-tuning-llm 11 huggingface-transformers 11 summarization 10 machine-learning 9 deep-learning 9 peft 9 natural-language-processing 9 rlhf 8 llms 8 streamlit 8 pytorch 8 question-answering 7 transformer 7 ai 6 chatgpt 6 text-summarization 6 bert 6 t5 6 rag 6 qlora 5 retrieval-augmented-generation 5 python3 4 faiss 4 faiss-vector-database 4 llama3 4 ppo 4 low-rank-adaptation 4 reinforcement-learning 4 language-model 4 dialogue-summarization 4 pdf-converter 3 classification 3 alpaca 3 proximal-policy-optimization 3 parameter-efficient-fine-tuning 3 kl-divergence 3 artificial-intelligence 3 gpt-3 3 t5-model 3 streamlit-webapp 3 llama 3 qa 3 text-generation 3 gpt 3 chatbot 3 rouge 3 openai 3 llm-training 2 zero-shot-learning 2 one-shot-learning 2 huggingface-datasets 2 computer-vision 2 text-classification 2 transformer-models 2 kaggle 2 hate-speech-detection 2 framework 2 fine-tune 2 modeling 2 tk-instruct 2 pdf-chatbot 2 llm-inference 2 sentence-transformers 2 few-shot-learning 2 llama2 2 mistral-7b 2 docx-files 2 aws 2 pdf-document 2 pptx-files 2 chain-of-thought 2 clip 2 gpt-4 2 contrastive-learning 2 transformers-models 2 google 2 visual-question-answering 2 prompt 2 mistral 2 llamacpp 2 news 2 extract-data 2 gradio 2 whisper 2 gemma 2 amazon-web-services 1 notebook-jupyter 1 treat 1 emotion-cause-pair-extraction 1