Microsoft Announces: LongNet - Scaling LLM Transformers to 1,000,000,000 Tokens & Context Length

Blaed@lemmy.world · 1 year ago

Microsoft Announces: LongNet - Scaling LLM Transformers to 1,000,000,000 Tokens & Context Length

Blaed@lemmy.world · edit-2 1 year ago

You are correct in thinking this will demand a lot of compute. Hardware will need to scale to match these context lengths, but that is becoming increasingly possible with things like NVIDIA’s Grace Hopper architecture and AMDs recent commitment to expanding their hardware selection for emerging AI markets and demand.

There are also some really interesting frameworks and hardware developments being made at TinyCorp & TinyGrad that aim to run these emerging technologies efficiently and accessibly. He talks about this in detail in his podcast with Lex Fridman, a great watch if you’re interested in this sort of stuff.

It is an exciting time for technology and innovation. We have already started to hit exaflops of compute…