Sam Greydanus (@samgreydanus) 's Twitter Profile
Sam Greydanus

@samgreydanus

Physics, AI, comparative history.
Dartmouth, CERN, Microsoft, Google, open source.
Put your outdoor work in order, after that, build your house.

ID: 905159827936075776

linkhttp://greydanus.github.io calendar_today05-09-2017 20:04:54

839 Tweet

2,2K Takipçi

51 Takip Edilen

Sam Greydanus (@samgreydanus) 's Twitter Profile Photo

When in doubt, train for longer. Here are some cursive transformer samples from a little past 100k steps. The model crosses t's and dots its i's. This involved downsampling stroke data as aggressively as possible, running an A100 for several hours and using a smaller context.

When in doubt, train for longer. Here are some cursive transformer samples from a little past 100k steps. The model crosses t's and dots its i's.

This involved downsampling stroke data as aggressively as possible, running an A100 for several hours and using a smaller context.
Sam Greydanus (@samgreydanus) 's Twitter Profile Photo

Tried a stroke tokenizer that discretizes (x,y) stroke coordinates instead of (dx,dy)->(d_r, d_theta). Even with high-res binning, wiggles are visible. This approach is def inferior to the (d_r, d_theta) tokenizer we've been using. One of those things we needed to verify

Tried a stroke tokenizer that discretizes (x,y) stroke coordinates instead of (dx,dy)->(d_r, d_theta).

Even with high-res binning, wiggles are visible. This approach is def inferior to the (d_r, d_theta) tokenizer we've been using.

One of those things we needed to verify
Sam Greydanus (@samgreydanus) 's Twitter Profile Photo

With a few modifications to the sampling code last night, we were able to start doing custom generations. Model supports uppercase, lowercase, digits, and some punctuation. It's not perfect, but it's improving rapidly, and we expect to be writing entire paragraphs soon!

With a few modifications to the sampling code last night, we were able to start doing custom generations. Model supports uppercase, lowercase, digits, and some punctuation. It's not perfect, but it's improving rapidly, and we expect to be writing entire paragraphs soon!