University College London

university

Verified

https://www.ucl.ac.uk

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

kenza-ily authored a paper about 1 month ago

DISCO: Document Intelligence Suite for COmparative Evaluation

Zhouhc submitted a paper about 2 months ago

Memento-Skills: Let Agents Design Agents

kenza-ily authored a paper about 2 months ago

Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models

View all activity

Papers

AgentSearchBench: A Benchmark for AI Agent Search in the Wild

Memento-Skills: Let Agents Design Agents

View all Papers

submitted a paper to Daily Papers 19 days ago

Target Policy Optimization

Paper • 2604.06159 • Published 27 days ago • 23

authored a paper about 1 month ago

Interventional Time Series Priors for Causal Foundation Models

Paper • 2603.11090 • Published Mar 11

submitted a paper to Daily Papers about 2 months ago

Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published Mar 19 • 58

submitted a paper to Daily Papers 3 months ago

InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem

Paper • 2602.14367 • Published Feb 16 • 17

authored a paper 3 months ago

AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents

Paper • 2602.06855 • Published Feb 6 • 83

authored 3 papers 6 months ago

Sequential Causal Normal Form Games: Theory, Computation, and Strategic Signaling

Paper • 2511.06934 • Published Nov 10, 2025

Towards Causal Market Simulators

Paper • 2511.04469 • Published Nov 6, 2025 • 1

Causal Regime Detection in Energy Markets With Augmented Time Series Structural Causal Models

Paper • 2511.04361 • Published Nov 6, 2025

authored 3 papers 7 months ago

How the Misuse of a Dataset Harmed Semantic Clone Detection

Paper • 2505.04311 • Published May 7, 2025

ReAssert: Deep Learning for Assert Generation

Paper • 2011.09784 • Published Nov 19, 2020

Sentinel: A Hyper-Heuristic for the Generation of Mutant Reduction Strategies

Paper • 2103.07241 • Published Mar 12, 2021

authored a paper 11 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

authored 2 papers about 1 year ago

Judging the Judges: A Collection of LLM-Generated Relevance Judgements

Paper • 2502.13908 • Published Feb 19, 2025 • 5

LLMJudge: LLMs for Relevance Judgments

Paper • 2408.08896 • Published Aug 9, 2024

authored 2 papers almost 2 years ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 50

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6, 2024 • 39

authored 4 papers about 2 years ago

WARP: Word-level Adversarial ReProgramming

Paper • 2101.00121 • Published Jan 1, 2021

Towards JointUD: Part-of-speech Tagging and Lemmatization using Recurrent Neural Networks

Paper • 1809.03211 • Published Sep 10, 2018

Natural Language Inference over Interaction Space: ICLR 2018 Reproducibility Report

Paper • 1802.03198 • Published Feb 9, 2018 • 1

Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

Paper • 1610.00768 • Published Oct 3, 2016