• entropicdrift@lemmy.sdf.org
      link
      fedilink
      arrow-up
      2
      ·
      17 hours ago

      The only way to make AI less power efficient: require an instance of Electron for each GPU thread in the matrix multiplication. Suddenly there is not enough RAM on the planet and optimizing Chrome becomes the all-consuming desire of the world’s wealthiest nations.

    • CapriciousDay@lemmy.ml
      link
      fedilink
      English
      arrow-up
      6
      ·
      2 days ago

      The advanced version also has a locally hosted chatbot per primitive.

      We’ve figured out annoying casts with next gen type coercion: type hallucination - the LLM will try to figure out how to do any necessary conversions at runtime and just kind of guess any missing details.

      The best thing is it doesn’t ever do the same thing twice. So if it causes a bug the first time, it might not the second time.