Arkil Patel's picture

3 8 1

Arkil Patel

arkilpatel

·

https://arkilpatel.github.io/

AI & ML interests

NLP

Organizations

upvoted 2 papers 9 months ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published Apr 11, 2025 • 28

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

upvoted 2 papers 10 months ago

Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Paper • 2503.08644 • Published Mar 11, 2025 • 16

SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published Mar 6, 2025 • 21

upvoted a collection 10 months ago

CHASE

Generate challenging synthetic data to evaluate LLMs • 5 items • Updated Feb 21, 2025 • 4

upvoted 2 papers 10 months ago

Societal Alignment Frameworks Can Improve LLM Alignment

Paper • 2503.00069 • Published Feb 27, 2025 • 17

How to Get Your LLM to Generate Challenging Problems for Evaluation

Paper • 2502.14678 • Published Feb 20, 2025 • 18

upvoted a paper over 1 year ago

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Paper • 2407.03471 • Published Jul 3, 2024 • 30