5
retoor
4h

It's still in development. It often says the opposite from what is expected. Try Retoor1b chatbot at https://llm.molodetz.nl

This was result after building bot + chat website from scratch including training with embeddings. Design is generated by GPT, I tried my own but all ugly.

It's quite cool huh? Ask it to write some code for you. It's absolutely terrible. If it's down, try again in 5 minutes. I'm still working on it.

What's the result? I finally have a toolkit to make good/serious bots. Code could be bit better, but that's for other day.

Stack: self written webserver (and yes, you can post a gb to it or ddos it. Not sure if it survives the first one. I should limit requests to one mb anyway. Http headers may officially not be more than 4096 in total) since I know http protocol from my head anyway. Python websockets module. Asyncio, chromadb.

It could have xss issues. Don't care.

Let me know what you think

Comments
  • 1
    it's down already!
  • 1
    @jestdotty works fine here
  • 0
    @jestdotty, this is what I see:
  • 0
    @jestdotty dns is good. https://check-host.net/check-dns/...

    What a great site btw, you can directly use an unix shell for debugging. curl https://llm.molodetz.nl doesn't work. But I've screenshot from @feuerherz that it works from Germany.

    I did recently some name server changes. That was yesterday. Maybe smth is still a bit fucked. Will be fixed automatically
  • 1
    was making coffee coffee coffee

    on phone:

    and just checked pc and it hangs also (was hoping to see errors but I don't)
  • 1
    yeah maybe. interestingly I can't access Yandex despite website saying it's up... or couldn't the other day. lemme see... nope Yandex works now

    I know in Germany they ban people from being able to browse to whole websites. when I got COVID vaccine injured I was in a discord server and this guy didn't believe me and I kept sending him basically like... doctor whistleblower websites where doctors were collaborating showing scans of injuries and such and all those websites were 404 for him which was terrifying
  • 0
    @jestdotty I do not think Germans are aware of that. I do not expect from Germany to restrict access to public information. @Germans: is this true?
  • 0
    Fuck, my rant is not clear. This is the result of a day work. All together.

    @jestdotty: ask the bot "What says jestdotty about ostream?"
  • 1
    @retoor still down for me 😝

    idk what you said anyway. what's an embedding?
    what's it mean design is generated by gpt? like gpt wrote the CSS for the website?

    I think I'm alright with design but when I did it at school everyone hated it. may have been cuz they were just asshole dudes though. we nearly failed that project cuz they refused to work on it, and it was the capstone project for the whole degree lol
  • 1
    I used to do a lot of art. was that weird quiet kid that drew skulls and other weird shit back of the class, except then I transitioned to anime, generally of the violent kind where I'd morph humans and animals with technology

    and sometimes put stuff on alien planets with alien monsters / animals or plants, or plain in space amongst dissolving planets and whatever stuff

    but I was told art doesn't make any money so I went with computers
  • 0
    @jestdotty they kept having me tweak the website's design all the time so I kept wasting time on it

    which was bullshit cuz I did most of that project cuz they didn't do their parts -.-

    probably they were trying to burn me out so I'd leave them alone
  • 2
    Damn! What model is it running? Is it your own?
  • 2
    @12bitfloat customized version of gwen2.5b0.5. I enriched data with embeddings. The retoor personality is programmed in the model system. It runs on a 4Gb Dual Core VPS. I could probably up to qwen2.5b1.5 but there's more i want to do on that vps. I also run the web version of vscode there for example. What a great product is that. No difference with real vscode. Even codeium runs in the web version! I'm now trying to run qwen2.5-coder:0.5b locally in vscode. Let's see if it's much difference with codeium
  • 0
    @jestdotty an embedding is just a piece of data that you can store in a vector database. I mean that i converted all stuff about dr and stuff to vector thingies and it knows what you're talking about superfast while having quite a big dataset now
  • 1
    @12bitfloat did you try it?
  • 2
    @retoor It unfortunately didn't load for me, it just times out :(
  • 1
    @jestdotty, it's weird that my bot has this in memory: "jestdotty has said on devrant: ostream is just confused and throws his confusion on others. he wants validation / help for his confusion I think." if you ask "Did ostream delete himself?" and it's answer is: "No, ostream did not delete himself. He continues to exist outside of the context provided in the prompt."

    What the fuck does that answer have to do with what it had in memory?

    Embedding works like this: first you insert a bunch of data. Later you can query them. The prompt used for the model, is the same as you use for the embedding. The embedding data as result is the context for the model.

    Yes, the CSS is done by ChatGPT
  • 2
    @12bitfloat so weird, it works superfast here (on wifi and phone 5g) and my friend from Germany can access it as well. Probably we have to sleep a night over it. Still aftermath of the nameserver update i think. I can't run it on other domain either, I updated nameservers for all. I wanted to add cloudflare but than i realized that i don't want a man in the middle. Not scared for ddos anyway. I also have ssh open with password auth. Certificates are just extra for me. fail2ban and firewall active tho. Brute forcing over internet is not doable. Recently checked it. It takes 212321291 combinations to crack a simple word like 'retoor'. No one will do that over web
  • 0
    @retoor Oh that might be it. I use a specific dns provider, maybe the records haven't been flushed yet
  • 0
    @retoor oooh ok embeddings is a vector database that's serves as its knowledge base. so you can make them learn then! ok that's excellent
  • 0
    @retoor I don't understand the question of what does that have to do with what it has in memory. cuz it's about ostream?
  • 0
    @retoor cloudflare is pretty terrible. keeps banning IPs that don't deserve to be banned on some websites I've seen

    also when I was doing webscraping you can still get around their anti bot stuff, people have free tools for that out there

    and ok. it probably has to do with the domain name. when I stuck a website I had thru DNS servers it would take some days for them to update and it seemed to be regional. so Europe pulled your records maybe but not north America. they don't pull new domains frequently for whatever reason so maybe it's just lagging being pulled at other DNS resolver locations around the world
Add Comment