• @[email protected]
      link
      fedilink
      English
      06 months ago

      Okay so assuming the camera’s output can be represented as a series of bits, and that the arms’ input can be represented as another series of bits, you have successfully identified a text processing task.

      Your assertion then, is that this is a task outside the ability of an LLM to succeed at?

      How do you know that an LLM-steered robot cannot perform that task? Has that been tried?