PolyThink: A Multi-Agent AI System to Eliminate Hallucinations

Excited to announce PolyThink Alpha's early access! Our multi-agent AI system fights hallucinations with consensus-driven, accurate answers from multiple models. I'd love for you to join the waitlist at https://www.polyth.ink/ as I'm planning to randomly roll out invites starting May. Feedback will shape our final launch! I'd love thoughts and suggestions too! What would you like to see here?

9 points | by kuberwastaken 2 days ago

5 comments

stereo 2 days ago
Isn’t this basically the Swiss cheese model? If your two input AIs hallucinate, or your consensus AI misunderstands the input, you will still have confabulations in the output?
[-]
- kuberwastaken 1 day ago
  From all my testing, this never really happened even once honestly, plus the judge model (that I've kept strictly a reasoning model) also evaluates individually before "judging" the consensus.
- TheKelsbee 2 days ago
  I have this same thought, and have tried similar approaches.
  OP: Have you trained or fine tuned a model that specifically reasons the worker model inputs against the user input? Or is this basically just taking a model and turning the temperature down to near 0?
  [-]
  - kuberwastaken 1 day ago
    Low temperature, heavy prompting to answer in a structured way. Sadly can't fine train models since this is API based but the approach does work!
sks38317 2 days ago
I’m genuinely interested in how you arrived at the concept of using AI as a method to treat hallucinations. What inspired that approach?
[-]
- kuberwastaken 1 day ago
  Honestly, personal use cases. I am a STEM student and deal with a lot of "hard" questions that are about 60% of the time miscalculated by LLMs, I used to manually paste in approaches from say ChatGPT to DeepSeek and now grok and asked them what do you think is better. I created this out of necessity to automate this then realized how cool it can be if it scales further haha
- tough 1 day ago
  not op but LLM as Judge is a thing https://arxiv.org/abs/2411.15594
consumer451 2 days ago
Very interesting. Will this be available as a meta model via API, allowing use in the coding tool of my choice?
[-]
- kuberwastaken 1 day ago
  Eventually yes, that's the plan! It's extremely good with code too, especially with more vague requests, tends to take about 2-3 rounds but almost always gets a great approach.
shemulray667 2 days ago
[flagged]
0m3g4_k1ng 1 day ago
[flagged]
[-]
- kuberwastaken 1 day ago
  Thank you? Haha