GitHub / OSU-NLP-Group 1 Repository
OSU-NLP-Group/Online-Mind2Web
An Illusion of Progress? Assessing the Current State of Web Agents
Language: Python - Size: 6.88 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 37 - Forks: 0

OSU-NLP-Group/saev
Sparse autoencoders for vision
Language: Elm - Size: 32.5 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 26 - Forks: 3

OSU-NLP-Group/hal-harness Fork of princeton-pli/hal-harness
Language: Python - Size: 1.97 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

OSU-NLP-Group/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
Language: Jupyter Notebook - Size: 100 MB - Last synced at: 4 days ago - Pushed at: 21 days ago - Stars: 816 - Forks: 108

OSU-NLP-Group/EIA_against_webagent
Language: Python - Size: 4.01 MB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 22 - Forks: 1

OSU-NLP-Group/HippoRAG
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
Language: Python - Size: 79.7 MB - Last synced at: 13 days ago - Pushed at: 16 days ago - Stars: 2,200 - Forks: 177

OSU-NLP-Group/GUI-Agents-Paper-List
Building a comprehensive and handy list of papers for GUI agents
Language: Python - Size: 921 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 288 - Forks: 16

OSU-NLP-Group/WebDreamer
"Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
Language: Python - Size: 1.15 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 63 - Forks: 5

OSU-NLP-Group/SkillWeaver
SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.
Language: Python - Size: 520 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Language: Python - Size: 112 MB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 345 - Forks: 15

OSU-NLP-Group/UGround
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
Language: Python - Size: 78.6 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 200 - Forks: 11

OSU-NLP-Group/TravelPlanner
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
Language: Python - Size: 81.5 MB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 339 - Forks: 48

OSU-NLP-Group/LLM4Chem
Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"
Language: Python - Size: 130 MB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 82 - Forks: 11

OSU-NLP-Group/reversal-curse-binding
Language: Python - Size: 27.7 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

OSU-NLP-Group/SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Language: Python - Size: 375 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 737 - Forks: 96

OSU-NLP-Group/COSMO
[CIKM'24] Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs
Language: Python - Size: 4.85 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 7 - Forks: 0

OSU-NLP-Group/GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
Language: Python - Size: 32.6 MB - Last synced at: 21 days ago - Pushed at: 5 months ago - Stars: 186 - Forks: 19

OSU-NLP-Group/In-Context-Reranking
Code for "Attention in Large Language Models Yeilds Efficient Zero-Shot Re-Rankers"
Language: Python - Size: 425 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 16 - Forks: 1

OSU-NLP-Group/LLM-Planner
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
Language: C - Size: 23.2 MB - Last synced at: 20 days ago - Pushed at: 30 days ago - Stars: 173 - Forks: 18

OSU-NLP-Group/AmpleGCG
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
Language: Python - Size: 680 KB - Last synced at: 21 days ago - Pushed at: 6 months ago - Stars: 59 - Forks: 6

OSU-NLP-Group/KG-R3
Code for the CIKM'23 paper "A Retrieve-and-Read Framework for Knowledge Graph Link Prediction"
Language: Python - Size: 6.36 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 0

OSU-NLP-Group/AttrScore
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
Language: Python - Size: 485 KB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 56 - Forks: 2

OSU-NLP-Group/Middleware
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)
Language: Python - Size: 1.96 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 36 - Forks: 2

OSU-NLP-Group/LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
Language: Python - Size: 45 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 67 - Forks: 3

OSU-NLP-Group/TableLlama
[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".
Language: Python - Size: 21.4 MB - Last synced at: 21 days ago - Pushed at: 12 months ago - Stars: 127 - Forks: 13

OSU-NLP-Group/Deductive-Beam-Search
[COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"
Language: Python - Size: 34.2 KB - Last synced at: 21 days ago - Pushed at: 10 months ago - Stars: 20 - Forks: 2

OSU-NLP-Group/ChemToolAgent
Official code repo for the paper "ChemToolAgent: The Impact of Tools on Language Agents for Chemistry Problem Solving" (previously "Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving")
Language: Python - Size: 2.39 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 4

OSU-NLP-Group/MQA
Multimodal Question Answering for Unified Information Extraction
Language: Python - Size: 2.23 MB - Last synced at: 21 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 1

OSU-NLP-Group/Explorer
Size: 1.79 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

OSU-NLP-Group/QA4RE
[ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"
Language: Python - Size: 50.8 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 5

OSU-NLP-Group/llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
Language: Python - Size: 148 KB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 54 - Forks: 4

OSU-NLP-Group/ScienceAgentInterface
Language: JavaScript - Size: 223 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

OSU-NLP-Group/SELM
Symmetric Encryption with Language Models
Language: Python - Size: 737 KB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 2

OSU-NLP-Group/AgentSafety
Size: 24.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 41 - Forks: 0

OSU-NLP-Group/Text2SQL-Error-Detection
Code for paper "Error Detection for Text-to-SQL Semantic Parsing"
Language: Python - Size: 105 KB - Last synced at: 21 days ago - Pushed at: 8 months ago - Stars: 5 - Forks: 2

OSU-NLP-Group/SeeActChromeExtension
Language: TypeScript - Size: 22.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 10 - Forks: 6

OSU-NLP-Group/awesome-agents4science
A curated list of papers on LLMs and agents for scientific research and development
Size: 47.9 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

OSU-NLP-Group/UMLS-Vocabulary-Insertion
Language: Python - Size: 252 KB - Last synced at: 21 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

OSU-NLP-Group/TacoBot
Language: Python - Size: 30 MB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

OSU-NLP-Group/Auto-SQL-Correction
Code, data, and model of paper "Text-to-SQL Error Correction with Language Models of Code" (ACL'23)
Language: Python - Size: 49.8 KB - Last synced at: 21 days ago - Pushed at: 8 months ago - Stars: 30 - Forks: 3

OSU-NLP-Group/ScienceAgentBench
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Language: Python - Size: 172 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 17 - Forks: 2

OSU-NLP-Group/AttributionBench
Language: Python - Size: 43.8 MB - Last synced at: 21 days ago - Pushed at: 11 months ago - Stars: 9 - Forks: 1

OSU-NLP-Group/Eval-LLM-Trust
Language: Python - Size: 19.4 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

OSU-NLP-Group/LLM-CN-Eval
[NAACL'24] A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models
Language: Python - Size: 3.36 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

OSU-NLP-Group/GroundCocoa
Language: Python - Size: 17.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

OSU-NLP-Group/FL4SemanticParsing
Language: Python - Size: 12.8 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

OSU-NLP-Group/AgentAttack
Size: 6.76 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

OSU-NLP-Group/Auto-Dialectical-Evaluation
Language: Jupyter Notebook - Size: 9.05 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

OSU-NLP-Group/Pangu
Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 0

OSU-NLP-Group/Bio-Tokenization
Biomedical LMs are Robust to Sub-optimal Tokenization
Language: Python - Size: 6.11 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1
