StoneInky
Heart of Stone, Head of Ink
- Joined
- Jun 24, 2024
- Messages
- 445
- Points
- 108
I tried again. Assuming it's a problem with the embedding, I deleted all previous data, and just sent in the chapters I wrote as text instead of a file.Major security breach if true!
I've tested it out myself and ChatGPT is very off today. As a test, I asked it to review two chapters of my unpublished work, "Princess Wars", and it gave me the same 'quoting lines that do not exist' error. This has not happened in almost ever! The way I did it was to upload multiple chapters at once in separate .md files.
So I can confirm that ChatGPT is 'hallucinating' today, a common problem where AI models to make stuff up when they cannot find stuff. A powerful model like ChatGPT shouldn't be hallucinating like this.
I actually had these two chapters reviewed in the past before, and ChatGPT did not manage to pull them up from the past. My guess is that the attachments have been deleted long ago.
I did not manage to get ChatGPT to reproduce the reading 'unwritten chapter' error. It simply does not 'see' my unwritten Princess Wars Chapter 05 in my google docs.
If your experience is true, I still think its a major security breach no matter the cause. My guess is that if a google account (since it is google docs) is connected to a ChatGPT account, and the document is publicly shared, ChatGPT archives it in its cache? That doesn't make sense to me in either case. It shouldn't jump from Boy's Love to LitRPG without any reason, unless you have discussed LitRPG ideas with it in the past. (Check the memory for LitRPG brainstorming sessions?)
There's one last possibility that's not written anywhere, but most LLMs have a token limit; it is not able to phase more than 20,000 to 25,000 words at one go. I doubt the five chapters you uploaded are more than that, so that's not it. More likely it is an embedding problem.
ChatGPT doesn't 'read' files. It relies on a smaller LLM called an 'embedder' to 'read' them. Most embedders are much less powerful than ChatGPT itself, the most powerful public one only can process 8192 characters at once, much less powerful than ChatGPT's 20k to 25k words. What happens when a file is overly large is that the file is 'summarized', resulting in ChatGPT only grasping the contours of what is going on. That would explain why it is unable to quote your lines.
I have experienced this before and found out by using ChatGPT's 'think' function, when I saw it was reading a 'summary' of what I wrote, and not the actual file.
The easy way to test if its an embedding problem is to copy-paste the exact text straight into the chatbox. If ChatGPT no longer hallucinates and only quotes from the text itself, it is an embedding problem.
But yeah, ChatGPT is definitely weird today.
It forgot everything I wrote and again started making stuff up. But I think it was better than last time.
When I gave it the files and not text, it didn't even know the names of my characters or what it was about. I think it just made up a story that fit the title. But when I gave it text instead of files, as you said, it was able to at least remember the names of the characters and the basic outline of the story, and it gave me a review off of that. It still sucked, since it made up false quotes, and couldn't remember any details, but it's progress!
It wasn't like this last week, though. Yeah, I think it's just ChatGPT being weird today, lol.
Last edited: