PITTI - Article - Decoding intermediate activations in llama-2-7b

Decoding intermediate activations in llama-2-7b

Artificial Intelligence,Information Processing | Computing

Date : 2023-07-21

Introduction

In line with previous research, Nina Rimsky found that the decoded block outputs at most layers, except a few early ones, were interpretable. She also found that the other intermediate outputs were interpretable and provided some intuition on what different layers were responsible for. Some very interesting insights in the post.

Read article here

Link

How hard does Art need to be ?