Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders arxiv.org 1 points by PaulHoule 6 hours ago