Show HN: Fully client-side GPT2 prediction visualizer by thesephist

4CommentsShare PostShare on Facebook Share on XShare by EmailSend Link

Show

Show HN: Fully client-side GPT2 prediction visualizer by thesephist

ByHackTech September 6, 2023

4Comments

Share This Article

Sed ut perspiciatis unde.

Send to HN

Hi HN! I've found this visualization tool immensely helpful over the years for getting an intuition for how an LLM “sees” some piece of text, and with a bit of elbow grease decided to move all compute to client side so I could make it publicly available.

I've found it particularly useful for

– Understanding exactly how repetition and patterns affect a small LM's ability to predict correctly

– Understanding different tokenization patterns and how it affects model output

– Getting a general sense of how “hard” different prediction tasks are for GPT-style models

Known problems (that I probably won't fix, since this was a kind of one-off project)

– Doesn't work well with Unicode grapheme clusters that are multiple GPT-2 tokens (e.g. emoji, smart quotes)

– Support for other models (maybe later?)
Read More

0Likes

Written by

HackTech

View all posts by HackTech

Show comments (4)

Show HN: Fully client-side GPT2 prediction visualizer by thesephist

Show HN: Fully client-side GPT2 prediction visualizer by thesephist

Share This Article

Newsletter

HackTech

Leave a comment Cancel reply

Editor's Choice

Show HN: Fully client-side GPT2 prediction visualizer by thesephist

Show HN: Fully client-side GPT2 prediction visualizer by thesephist

Share This Article

Newsletter

HackTech

Leave a comment Cancel reply

Editor's Choice

Sign Up to Our Newsletter