Join devRant
Do all the things like
++ or -- rants, post your own rants, comment on others' rants and build your customized dev avatar
Sign Up
Pipeless API
From the creators of devRant, Pipeless lets you power real-time personalized recommendations and activity feeds using a simple API
Learn More
Search - "voice recognition"
-
(Warning: kinda long && somewhat of a political rant)
Every time I tell someone I work with AI, the first thing to come out of their mouth is "oh but AI is going to take over the world!"
No.
It was only somewhat recently that it started being able to recognize what was in a picture from over 3 million images, and that too it's not that great at. Honestly people always say "AI is just if-else" ironically, but it isn't really that far from the truth, we just multiply an input by weights and check the output.
It isn't some magical sauce, it's not being born and then exploring a problem, it's just glorified-probability prediction. Even in "unsupervised" learning, the domain set is provided; in "reinforcement learning" which has gotten super popular lately we just have the computer decide which policy is optimal and apply that to an environment. It's a glorified decision tree (and technically tree models like XGBoost outperform neural networks and deep learning on a large number of problems) and it isn't going to "decide" to take over the planet.
Honestly all of this is just born out of Elon Musk fans who take his word as truth and have been led to believe that AI is going to take over the world. There are a billion reasons why it can't! And to top it off this takes away a lot of public attention from VERY concerning ethical issues with AI.
Am I the only one who saw Google Duplex being unveiled and immediately thought "fraud"? Forget phone scammers, if you trained duplex on the mannerisms of, for example, a famous politician's voice, you could impersonate them in an audio clip (or even video clip with deepfakes). Or for example the widespread use of object detection and facial recognition in surveillance systems deployed by DoD. Or the use of AI combined with location tracking and browsing analytics for targeted marketing.
The list of ethics breaches are endless, and I find it super suspicious that those profiting the most off of unethical AI are all too eager to shift public concern to some science fiction Terminator style takeover that, if ever possible, would be a long way out and is not any sort of a priority issue right now.11 -
While this wasn't technically a real client, it's still one of the most insane requests I've ever had.
I chose to specialize in software engineering for the last year and a half of my degree, which meant a lot of subjects were based around teamwork, proper engineering practises, accessibility, agile methods, basically a lot of stuff to get us ready to work in a proper corporate dev environment. One of our subjects was all about project management, and the semester-long coursework project (that was in lieu of a final exam) was to develop a real project for a real client. And, very very smartly, the professors set up a meeting with the clients so that the clients could tell us what they wanted with sixty-odd students providing enough questions. They basically wanted a management service for their day-center along with an app for the people there. One of the optional requirements was a text chat. Personally not something I'm super interested in doing but whatever, it's a group project, I'll do my part.
The actual development of the project was an absolute nightmare, but that's a story for another day. All I'll say is that seven juniors with zero experience in the framework we chose does not make a balanced dev team.
Anyway, like three months into the four-month project we've got a somewhat functional program, we just need to get the server side part running and are working our asses off (some more than others) when the client comes in and says that 'hey, nice app, nobody else has added the chat yet, but could you do voice recognition okay thanks?'.
Fucking.
Voice.
Recognition.
This was a fucking basic-ass management app with the most complicated task being 'make it look pretty' and 'hook up a DB to an API' and they want us to add voice recognition after sitting on their ass for three months??? The entire team collectively flipped its shit the second they were out of earshot. The client would not take no for an answer, the professor simply told us that they asked for it and it was up to us whether we delivered or not. Someone working on the frontend had the genius idea of 'just get them to use google voice recognition' so we added the how-to in the manual and ticked the requirement box.
What amazes me about all that is how the client probably had no idea that their new last-minute request was even a problem for us, let alone it being in a completely different ballpark in terms of implementing from scratch.8 -
Doing a distribution center job right now just to pay the bills and had to do voice order picking (with a headset and voice commands etc) today. Well, that didn't go ideal:
*girl walks by and gives me a tip/hint*
Me: *puts headset back on*
Headset voice: you've logged off. Truck could not be located by provided ID.
😓😮
I said "fair enough, thanks!" 😐4 -
I taught an intro to programming class today, brought back memories of highschool...
I remember when I started my first IT class in grade 10, it was a 50/50 split between IT theory amd programming. Choices were java or delphi...I made the uninformed choice to do java (thank goodness) and really enjoyed it. For some reason the logic and OOP concepts really made sense to me and i was well ahead of the class. I was always top 5 for maths/physics/chem and english literature but never enjoyed them for a second. On the other hand programming was something i could do for hours and still enjoy. In my final year we had to do a project, most of my class was still struggling with very simple for loops and jframes. The projects were terrible drag and drop NetBeans UIs that would convert meters to feet.
I remember being upset with the quality and ended up writing an entire client/server chat system with file sharing, voice notes, voice streaming, server admin controls, usernames and passwords (plaintext sql of course 😂), admins/mods/guests etc...
Got 100% and a personal recognition from the headmaster...found out yesterday the staff at the college have actually been using it since the time I left.
I don't know why i typed this whole story, something about teaching the kids where i was myself made me feel warm and fuzzy inside1 -
I've just spent the morning writing a C# program that pops up a message when I say hello. It's the most basic thing when I look at it now but as I was writing it you'd swear I was breaking technological boundaries!6
-
Bloody scammers and bloody Paypal.
So I bought echo spot just to see how good it's voice recognition is and also wanted to see what the spot does different. So I found out that it was like hello world for AI. So I wanted to sell it on ebay-kleinanzeigen.de. It's a website from Ebay here in Germany where you can easily sell your stuff that you don't need anymore. I put it there and someone just wanted it so badly and he said that he broke his friends spot and he has no money and he need it very badly cheaper. My price was 98€ and I believed him and sold it for 85€. Now he got the device and wants the refund because the device doesn't match the description and the things he mentioned weren't even in the description. The message you see in the pic it says: It doesn't do skype and it is impossible on that device. First It is his responsibility to inform himself about the device features I'm not Amazon to write something like that in the description I've to just say how the device looks. Second it does skype and it is possible but both partners must have the same device and they should connect it to their smartphones.
But that is not the bad part that my money is ceased and got ownd by a scammer. The bad part is that I wanted to reply his message but the bloody paypal design won't let me do that. Remind me how old is paypal now? It's been there for ages and the footer is just stuck in the middle of the page and won't allow me to click on reply button. Of course I later managed to write a reply but paypal shouldn't have these kind of problems.
I'm so upset right now because these things are wasting my time. I've my final exame in a week and I've to develop a parameter based multilingual CMS, just imagine how long would just data structure take.1 -
Visited a store to buy Amazon Kindle . Explored to find amazon echo . Amazed by the quality of voice recognition at the price of 5000 INR.. Also was shocked to find Amazon firestick at that price..
Amazon is the new Apple..Or..Apple was a overhyped version of Amazon.3 -
Created a github open source project for voice recognition for react native Android.
Needed help developing some features and did my research with links to some articles.
Then some guy that showed interest in the specific issue just copied my repo and launched his own npm package with the new features.
Just threw my code in the garbage and just felt so irritated. I did all the research and shared the research links and he just duped me and left me to rot. No credit given.
Fuck that guy. This is not open source.
Rant over3 -
Sweet, my motivation for coding my personal projects has started to come back.
Last night I setup my Personal Assistant project with Text to speech and Voice recognition.
Now I just have to get it to react to commands.6 -
Finally, I made my first android music player App. ☺️☺️
.
I don't know u guys like this or not but it have fully voice recognition functionality.
It means u can completely run this app by voice commands😋😋
https://play.google.com/store/apps/...14 -
IPhone speech to text has come a long way. Definitely has improved. Real-time dictation rather than batching it.
I am currently doing approximately 50 percent of my rants by voice. In fact the rank you are reading I did by voice.
You can easily do punctuation such as a period, new paragraph, new line, caps and lower case. The speech recognition is excellent even with my New York accent and it learns the more you use it. Rarely does it get a word wrong.
Editing still has to be done manually and is a pain but that may change as dragon already allows you to do in-line editing. iOS speech to text has already surpassed dragon in some facets.
I do have to press the add new and post buttons at this Time to post my rants. But that may change as the enhanced dictation on the map allows you access to specific commands.
I will keep you informed of progress and I will be testing on android over the next few days as well.4 -
My colleague is trying to add voice recognition to the mobile application
so each time he says "TEST" the mobile application answer with: The color is red so he rage and scream.
and now I can't stop laughing3 -
Try uncheking the voice recognition option for google assistant, the try saying "OK Google" after its switched OFF....
-
A small request (This is a rant in my mind, formed such as to not let anymore people be affected by this shit that corporations are doing.)
TL;DR: please please please visit https://voice.mozilla.org/en. They are the good people.
Amidst leaks of your personal activities' voice recordings for improvement of their voice recognition and generation software,
Why not donate some of your free time for the improvement of Mozilla's software by speaking and verifying non personal audios at https://voice.mozilla.org/en
Do visit. That's for benefit of the society we live in -
Still waiting for google to add voice commands in my language (at least "OK Google" recognition).
The audio to text is working pretty good, I'll give them that...