·
AI & ML interests
None yet
Organizations
Activation Steering With Mean Response Probes : A Case Study In Suppressing Sycophancy In Language Models During TTC
Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement