30 papers found
What Are Tools Anyway? A Survey from the Language Model Perspective
arXiv (Cornell University)20244 citations
Fine-grained Hallucination Detection and Editing for Language Models
arXiv (Cornell University)20248 citations
In-Context Learning with Long-Context Models: An In-Depth Exploration
arXiv (Cornell University)20247 citations
Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes
arXiv (Cornell University)20246 citations
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples
arXiv (Cornell University)20243 citations
Large Language Models Enable Few-Shot Clustering
Transactions of the Association for Computational Linguistics202444 citations
An Incomplete Loop: Instruction Inference, Instruction Following, and In-context Learning in Language Models
arXiv (Cornell University)20243 citations
Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design
Transactions of the Association for Computational Linguistics202447 citations
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
arXiv (Cornell University)20246 citations
Evaluating Text-to-Visual Generation with Image-to-Text Generation
Lecture notes in computer science202435 citations
Learning to Filter Context for Retrieval-Augmented Generation
arXiv (Cornell University)20238 citations
Do LLMs exhibit human-like response biases? A case study in survey design
arXiv (Cornell University)20235 citations
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
arXiv (Cornell University)202325 citations
An In-depth Look at Gemini's Language Abilities
arXiv (Cornell University)202311 citations
Unlimiformer: Long-Range Transformers with Unlimited Length Input
arXiv (Cornell University)202323 citations
ChatGPT MT: Competitive for High- (but not Low-) Resource Languages
arXiv (Cornell University)20235 citations
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
arXiv (Cornell University)20233 citations
Computational Language Acquisition with Theory of Mind
arXiv (Cornell University)20235 citations
Large Language Models Enable Few-Shot Clustering
arXiv (Cornell University)20234 citations
Multi-Dimensional Evaluation of Text Summarization with In-Context Learning
arXiv (Cornell University)20235 citations
