Measuring what Matters: Construct Validity in Large Language Model Benchmarks Paper • 2511.04703 • Published Nov 3, 2025 • 8
Training language models to be warm and empathetic makes them less reliable and more sycophantic Paper • 2507.21919 • Published Jul 29, 2025 • 2
Clinical knowledge in LLMs does not translate to human interactions Paper • 2504.18919 • Published Apr 26, 2025 • 26
Clinical knowledge in LLMs does not translate to human interactions Paper • 2504.18919 • Published Apr 26, 2025 • 26
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Paper • 2504.07961 • Published Apr 10, 2025 • 5
LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation Paper • 2503.02972 • Published Mar 4, 2025 • 25
MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild Paper • 2406.01595 • Published Jun 3, 2024
A Linear Reconstruction Approach for Attribute Inference Attacks against Synthetic Data Paper • 2301.10053 • Published Jan 24, 2023
Into the crossfire: evaluating the use of a language model to crowdsource gun violence reports Paper • 2401.12989 • Published Jan 16, 2024
When the signal is in the noise: Exploiting Diffix's Sticky Noise Paper • 1804.06752 • Published Apr 18, 2018
Characterizing and modeling harms from interactions with design patterns in AI interfaces Paper • 2404.11370 • Published Apr 17, 2024
Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction Paper • 2411.06424 • Published Nov 10, 2024 • 5
Can sparse autoencoders be used to decompose and interpret steering vectors? Paper • 2411.08790 • Published Nov 13, 2024 • 8
Evaluating the role of `Constitutions' for learning from AI feedback Paper • 2411.10168 • Published Nov 15, 2024 • 5
DynamicStereo: Consistent Dynamic Depth from Stereo Videos Paper • 2305.02296 • Published May 3, 2023
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos Paper • 2410.11831 • Published Oct 15, 2024 • 9
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio Paper • 2303.00747 • Published Mar 1, 2023 • 6