Join devRant
Do all the things like
++ or -- rants, post your own rants, comment on others' rants and build your customized dev avatar
Sign Up
Pipeless API
From the creators of devRant, Pipeless lets you power real-time personalized recommendations and activity feeds using a simple API
Learn More
Search - "openai's"
-
I have a new UNTRAINED bot on my site. It's based on openai now. And that's why it's blazing fast and blazing usless.
I can tell you why bots are so boring and will sure cause the dead internet theory. My datasets for example never contain real disturbing stuff ACCORDING TO NORMAL PEOPLE. EVERY TIME:
"The job failed due to an invalid training file. This training file was blocked by our moderation system because it contains too many examples that violate OpenAI's usage policies, or because it attempts to create model outputs that violate OpenAI's usage policies."
Now i'm really done. I gonna email them about their unusable training system.
In theory, i could test the message one by one if it is bad first. Don't want to do or pay for that. There should be an option to skip the data it considers disturbing instead of cancelling a whole data set for 0.1%. You also don't want to know how long it takes BEFORE he is finished validating you set. I think someone is doing it manually and clicks 'Uh uh..'-button..
Also, for the people who think they have gpt4o by having the API, you're lied to. The 'own gpt'-option on the paid openai is way more advanced than the ones you make locally.
They don't give us the real good stuff!
Oh, btw! The input data for my training is based on FORMER conversations with the bot. I automated a script to repeat a conversation I had and selected those messages and clicked 'train'. So it even complained about its OWN data! That data was already saying stuff like 'I can't help you with that' IN my training data. So, you 'corrected' and corrupted my data and now its still nog good enough for round 2?
I would really love to go back to local LLM's, but I can't imagine having ever a machine that generates as fast as the real GPT does. I also prefer to do it myself, but it's David vs. Goliath, even with a 5k computer. I'm sure.
Low quality rant, I know. I'm typing while still frustrated. For people who think censorship is needed often, this is the result! According to someone else, YOU are the one who has to be censored. Don't forget that.11 -
AI is more than just a model.
It's also tooling. Tooling can help to interpret data or solve a puzzle like Sudoku or parse a JSON file perfectly. Results of those tooling will be wrapped in AI response. That quality of tooling responses is high because it's made by classical code that works with literal data and outputs literal data. As long the competition of OpenAI doesn't have tooling like that, it won't be the same.
I do assume for now that DeepSeek doesn't have that. I tried it, it answers things well, but for bigger questions that would require tooling it just crashes and says it's too busy. So I can't verify 100%.
Will try again later and update under this Rant, but assume the DeepSeek stuff is very over hyped. To know what DeepSeek really is about without watching all the fake fan video's, take this quite objective response of the maker of Perplexity AI. Someone that knows where he's talking about: (40 minutes) https://youtube.com/watch/....
So when it comes to investments of a model, what does the stuff investment is incuded in? I mean, OpenAI was way more expensive than DeepSeek but DeepSeek borrowed all OpenAI's research that was made by very expensive processes. So DeepSeek didn't pay research costs like OpenAI did. Also it (I still assume) didn't spend money on tooling.
Also, i'm sure a less woke API would be way cheaper because it doesn't have to lie to himself causing it to keep reasoning until his given woke fact makes sense! Wokism destroys models, i'm sure.
I didn't check DeepSeek on wokism yet, but it's based op GPT4o so, probably it is.
But competition is always great, I can't imagine the price would even drop further for AI requests but if it does, it would be amazing. Maybe it also becomes free and we will be forced to pay to use it without adverts.7