cross-posted from: https://programming.dev/post/177822

It’s coming along nicely, I hope I’ll be able to release it in the next few days.

Screenshot:

How It Works:

I am a bot that generates summaries of Lemmy comments and posts.

  • Just mention me in a comment or post, and I will generate a summary for you.
  • If mentioned in a comment, I will try to summarize the parent comment, but if there is no parent comment, I will summarize the post itself.
  • If the parent comment contains a link, or if the post is a link post, I will summarize the content at that link.
  • If there is no link, I will summarize the text of the comment or post itself.

Extra Info in Comments:

Prompt Injection:

Of course it’s really easy (but mostly harmless) to break it using prompt injection:

It will only be available in communities that explicitly allow it. I hope it will be useful, I’m generally very satisfied with the quality of the summaries.

  • Anamana@feddit.de
    link
    fedilink
    arrow-up
    6
    ·
    1 year ago

    Just curious because I was discussing this with someone else on here. Do you think it’s possible to create a tldw bot with chatgpt for YouTube videos as well?

    • 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟OP
      link
      fedilink
      arrow-up
      4
      ·
      edit-2
      1 year ago

      It is definitely possible, at least for videos that have a transcript. There are tools to download the transcript which can be fed into an LLM to be summarized.

      I tried it here with excellent results: https://programming.dev/post/158037 - see the post description!

      See also the conversation: https://chat.openai.com/share/b7d6ac4f-0756-4944-802e-7c63fbd7493f

      I used GPT-4 for this post, which is miles ahead of GPT-3.5, but it would be prohibitively expensive (for me) to use it for a publicly available bot. I also asked it to generate a longer summary with subheadings instead of a TLDR.

      The real question is if it is legal to programmatically download video transcripts this way. But theoretically it is entire possible, even easy.

      • Anamana@feddit.de
        link
        fedilink
        arrow-up
        2
        ·
        1 year ago

        Ah yeah looks good. I mean V4 is definitely better, but if someone could write a bot for the V3.5 version it would still be better than nothing I guess.

        For videos without transcriptions you could also use whisper to transcribe the videos first. But that would probably be way to expensive and all.

        Still nice to see it would be possible.

      • 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟OP
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        Oh, I’ve just realized that it’s also possible if the video doesn’t have a transcript. You can download the audio and feed it into OpenAI Whisper (which is currently the best available audio transcription model), and pass the transcript to the LLM. And Whisper isn’t even too expensive.

        Not sure about the legality of it though.

        • 𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟OP
          link
          fedilink
          arrow-up
          1
          ·
          edit-2
          1 year ago

          If you want it to be really good, you can transcribe the audio even if the YouTube video already has a transcript. Whisper is much better than whatever YouTube uses for the subtitles. Of course it will be more expensive this way.

    • ShrimpsIsBugs@feddit.de
      link
      fedilink
      arrow-up
      0
      ·
      1 year ago

      Will it work on all instances?

      And how do you manage costs? Querying GPT isn’t that expensive but when many people use the bot costs might accumulate substantially