Linux PDA w/Voice Recognition 69
Hotaine writes "Lernout & Hauspie is working on a Linux PDA with speech regognition built-in. " It can do things like read (and record) emails, find weather, do stock quotes or whatever. Of course, you can't go out and simply buy the vaporware yet, but it has potential.
Re:Linux PDA speech CLI (Score:1)
Great books, anyhow. Rick Cook also writes relatively clueful tech articles...
---
pb Reply or e-mail; don't vaguely moderate [152.7.41.11].
Re:Has anyone seen this? (Score:1)
And more importantly, I want to know if anyone knows if there are screenshots.
-Brent(Insert platform) already has voice recognition (Score:1)
My first voice recognition system was Voice Master for the Commodore 64.
I'm sure there is a long list of systems that allready has this feature.
However I'd like to see Linux on that list. A project exists to do just that but I'm unimpressed with the results so far
Re:L&H are MS friends... (Score:1)
Ehm... L&H already does cool stuff on Windows
Has anyone seen this? (Score:1)
Re:Only if you don't have adequate fundamentals. (Score:1)
--Robert A. Heinlein
Old old old! Do some research! (Score:1)
disappointed, and waiting to get moderated down :(.
The Microsoftication of Linux (Score:1)
Who needs ethics when we can just get down in the mud with Bill...
And, while we're at it, why not get rid of this whole Open Source concept for this Linux PDA - it's not like we're any different than the old guard
Re:Has anyone seen this? (Score:1)
Isdaron
Re:Nice...but is Linux what we really want? (Score:1)
Why have a talking head on screen? Thats about as useful as the dancing paperclip in everyones favorite office suite. If thats all you are going to do with the screen, why just not have a screen. I suppose the screen could be used for displaying data, but please, no talking heads when there is no data to display.
Spyky
Re:Has anyone seen this? (Score:1)
Cat whey eye cain evil hugh eat hit eats trance later.
old stuff (Score:1)
what if.... (Score:1)
Re:Here's an idea (Score:1)
A microphone device that can translate your voice to text, and it outputs text into where you plug your keyboard in.
OS independant, usable most anywhere a keyboard can be used. Boy would it fool typing tutors, and alt-ctrl-delete is no longer a three finger salute but
bind "reset" to alt-ctrl-delete, instead of saying "hold alt hold ctrl hold delete release" or something as silly..
Re:Here's an idea - Already in development! (Score:1)
Re:Linux PDA speech CLI (Score:1)
I found three on The Very Big Online Bookstore [amazon.com] and found the fourth on Abebooks [abe.com]. They are (maybe in order) The Wizardry Consulted, The Wizardry Cursed, The Wizardry Quested and The Wizard's Bane by Rick Cook. Fun stuff; and this PDA brings us on step closer to making SF^H^H fantasy a reality...
wtf with the corporate doublespeak? (Score:1)
The rest of that was just crap.
Re:This is NOT COOL! (Score:1)
Interesting note (Score:1)
Voice Recognition vs. handwriting recog. (Score:1)
--------
Give me a.... (Score:1)
Hmm...
Or better yet... Now with my PDA I can be like James Bond, and tell my PDA and my new car with three legged robot where to go while shooting my nerf gun at evil borg's in my back seat...and then I can get apache for PDA and stream it over the internet..
hmm....oh well.
BTW...Linuxhaven.com got updated... (Score:1)
Way cool... (Score:1)
That thing better come with a speaker... (Score:1)
Monotonic computer voice:
BASH Apt COMMAND NOT FOUNDBASH Hem COMMAND NOT FOUND
BASH A-HEM COMMAND NOT FOUND
BASH ! EVENT NOT FOUND
EXECUTING COMMAND DAMN
UNABLE TO DAMN WIRELESS MODEMS.
OOPS - Debian not found. S**t.
I can see it now... (Score:1)
Ahem... APT-GET update.
A-HEM! APT-GET UPDATE!!!
Damn wireless modems.
Re:Linux PDA speech CLI (Score:1)
Alex
Urban Myth (Score:1)
Thad
Why post it then? (Score:1)
SR not reliable enough (Score:1)
Re:I still wanna scream (Score:1)
ROOT AT LOCALHOST ARROW CURSOR
Re:Way cool... (Score:1)
Nice...but is Linux what we really want? (Score:1)
Now on the topic of PDAs. I think the ultimate PDA would be a little thing that looks like the PalmPilot but w/o any buttons or screen inputs. On the screen is a talking head (AT&T In development) and it speaks to you (AT&T in devel, but really fantastic quality alread). It interprets what you say via speach recognition and *looking for the right word* a low level AI language interpreter. You would talk to it, like "Can you remind me to pick up some food?" or "John's e-mail address it blah at blah dot com and he is a moron." And then you can recall the info by saying "Who was john again?" Etc. Etc...like a portable secretary.
Roy Miller
Re:Nice...but is Linux what we really want? (Score:1)
The Head can display data on the screen too, what did you think it was going to do speak the whole thing out? You've gotta be kidding...
Roy Miller
Re: on a screen is a talking head ... (Score:1)
Voice Operated Linux.... Vs Voice Operated Win... (Score:1)
Another thing... how much does it help to have a portable linux? Do you really care what operating system tells you that you have a doctors appointment in an hour?
Vaporware.. (Score:1)
Re:Here's an idea (Score:1)
Re:This is NOT COOL! (Score:1)
Ummm . . . you must mean voice recognition secruity.
For those things, as far as the computer is concerned, no, you can't do a perfect impression of Sean Connery. Its not looking for sound patterns that can nessarly be heard by humans. It looks for small subtilities which are considered impossible to fake.
However, this article is talking about a system that will allow you to give commands into a computer and it will act on them (a la Star Trek). In this case, yes, you might be able to pull off an impression of Sean Connery to the computer. But security isn't the point behind such a system, anyway.
----------
Re:L&H are MS friends... (Score:1)
Which means that Bill will wait for them to make something cool and then make a half-baked port of it to Windows ;)
----------
Speech Software (Score:1)
Re:Reminds me of a story I read... (Score:1)
While they were setting up the demonstration someone in the audience would have said "start" - "run" - "format c:" - "enter".
The thing is, that this would not work enless Windows was installed on the D: drive (which is rather onlikely). "deltree
Re:Linux PDA speech CLI (Score:1)
Re:How well does it work? (Score:1)
Re:It does work --NOT!!!! (Score:1)
If it really worked, you wouldn't HAVE to read the Fine manual. Instructions = talk into microphone! Fact is: It works the more you use it (therefore it can adapt to your voice) but the early going is too rough for most.
Right now, voice recognition just plain plain sucks. Learn how to use a DVORAK keyboard and you'll be far better off. And, you won't need a Coppermine with 512MB RAM!!!
Lost faith in those things (Score:1)
---------------
Re:Lost faith in those things (Score:1)
---------------
Cool (Score:1)
"I'm sorry Dave an error 1 has occured while you were compiling"
Damn...
Re:Speech Software (Score:1)
Either:
1) They made one hell of a breakthrough in voice-recognition technology that will have repercussions throughout the whole PDA and PC industry, with the elimination of the requirement that everyone who runs it has a Pentium III 800 or better system,
*or*
2) This is complete vaporware.
Re:patents? (Score:1)
Talk to it... (Score:1)
Re:patents? (Score:2)
And I believe they deserve a software patent. People have been working on voice recognition for a long time, and it doesn't work all that great so far... if they did something neat, this is what patents were created to protect...
How much fun would it be if this WAS integrated into the OS? You'd have to give it verbal OS calls instead of common-sense commands... lots of fun.
E.
Box (Score:2)
Anyway, the main character (a detective called Nathan Spring who was sent out to police a small base on the moon) had this really neat PDA that he called "Box". It was a computer, internet terminal, telephone, diary etc. all rolled into one, and it had a voice interface. And a kind of personality too.
I seem to remember in the end it self-destructed in order to take out some bad guy and save Nathan's life.
Ever since then I've been waiting for these devices to appear. I can't be bothered with handheld computers as they are now, the interface is just too narrowband. I want my own Box.
Consciousness is not what it thinks it is
Thought exists only as an abstraction
Re:Way cool... (Score:2)
Alternately, there's IBM's ViaVoice [ibm.com] SDK, but it's closed-source, and I think it's still in beta. There's also CVoiceControl [kiecza.de], but it doesn't look like a very sophisticated implementation. (Then again, I haven't tried either of these). There's still a need for a free-as-speech, smooth-as-glass voice recognition library.
ObOnTopic: Speech control is PERFECT for a PDA-type device. It gives you almost all the flexibility of a CLI, without the overhead of a keyboard. But methinks it'll be tricky to get it running smooth enough so I don't have to repeat commands, or otherwise utter them like two inches from the microphone
Re:Lost faith in those things (Score:2)
Re:Odd Move for Lernout&Hauspie (Score:2)
Re:How well does it work? (Score:2)
And it's not running on a Palm, it's running on a Strongarm II.
Choosing Safe Voice Commands (Score:2)
So you pick a few dozen commands that you do often and that can't do major damage without at least some safewords around them.
Voicetyping things into text input boxes isn't as dangerous - so you put some wrong stuff in a paper you're writing, and it's got the usual interjections from editing and real-world interruptions and phone calls. You can edit that later.
Picking context-sensitive sets of things you actually want to talk to your computer about can be tougher...
Only if you don't have adequate fundamentals. (Score:2)
Unix-like systems give you a lot of flexibility in letting things talk to each other, including keyboards and their substitutes, processes, etc.,
and they're not picky about what hardware really exists as long as it's not too weird.
The critical issues for porting voice to Unix are things like making sure the scheduler has enough hard real-time support that the voice recognizer doesn't starve, working within a multi-process environment, and dealing with kernel-userspace boundaries. For instance, if your voice recognizer provides context-sensitive vocabularies, how do you keep track of which context(s) belong to which processes, and if they're in the kernel or other protected storage space, how do you pass vocabularies to them efficiently?
But most of that's pretty straightforward, and application level, and can't be worse than adding X Windows or NeWS was, and certainly not as difficult as adding TCP/IP had been.
More importantly, if you want to build voice applications, what kind of services do you think an OS will need to support them? Are they available in Linux? Do you need to add them? Is there anything big you have to get rid of to make them work? Sure, you could junk it all and use a capability-based system like the
[upenn.edu]
Extremely Reliable Operating System
or some microkernel thing. But what do you need?
Re:Lost faith in those things (Score:2)
They talked for a minute and he asked a fwe questions, trying to figure out who the computer was for. He came to the conclusion that it was for her.
He was helping her around the store when curiosity finally overwhelmed him, and he had to ask her why a blind person was in a computer store buying hardware. As it turned out, everything we do, she did. She was a computer geek. She was even a programmer. I'm not sure whether or not she installed her own hardware:)
She used voice commands for everything. Including writing code.
This point of this diatribe is to point out that while the keyboard and mouse may be fine for you, there are those who need the software to be productive.
As for the exercise, theres always masturbation.
Here's an idea (Score:2)
patents? (Score:2)
Grtz, Jeroen
How well does it work? (Score:2)
Re:Shades of 2001 (Score:3)
'Holy mirror images, Batman!' -Dick Grayson.
Networked Gargoyles; Talking To Yourself In Public (Score:3)
One negative about Voice PDAs, like cellphones, is that you spend time in public talking to yourself while you walk down the street. (Here in my part of San Francisco, half the pedestrians do that because of cellphones, and half because of substance abuse...)
If you add visual displays, e.g. i-Glasses, you're adding extra volume and weight, and you get to walk down the street in Gargoyle mode :-)
Reminds me of a story I read... (Score:3)
Wonder if there could be an analogous action for this new Palm software.
Shades of 2001 (Score:3)
And when they come out with a marketable version, it will be called HAL.
Gonzo
Some more info... (Score:4)
I saw a demo of this thing a couple of weeks ago. Basically what they did was port the Voice Xpress engine to linux, and make it work on a Strongarm II processor. Right now all that was built was a single prototype, and some issues like battery usage are still to be worked out before the thing goes into production. (Estimate : end of this year)
What was also done, was building in the L&H Realspeak synthesized voice. This allows real dialogues to be held with this device. For instance :
-Check mail !
-You have five new messages
-Summarize !
-Message one, sender is J. Random Hacker, Subject : kernel hack. Message two, sender is Bill Gates, Subject is Make money Fast...
-Read the first message !
-Hi there... I just finished a patch for the latest kernel update... blablabla...
-Reply !
You could then use this to dictate an e-mail (because it uses a full Voice Xpress engine), or order books on Amazon (If you wanted to shop there) using WAP. All in all, a pretty nifty device. I want one myself
And no, it was not vapourware, it really works : with the VX engine you can already get 95% accuracy if you want to, with a large vocabulary. The smaller the vocabulary, the better the recognition. How many commands do you need to read your e-mail ? You do the math.
Oh, yeah, it also has a stylus and screen that fold away into it, just in case you really wanted to read or write stuff.
Adding voice recognition is the wrong idea (Score:4)
When Microsoft took DOS, a single-user operating system, and added Windows to it, they ended up with a really terrible hack that didn't run either, up until about version 3.1.
When Microsoft later took Windows and added pen support (not CE, but Pen Windows--read your history!), it flopped. Windows wasn't suitable.
When they attempted to port a subset of NT to pen-based devices, they got CE, which has underperformed.
On the other hand...
When Apple abandoned the Apple II and designed the Mac as a graphically-operated machine from the ground up, they got a great platform.
When Palm threw out common notions of a shrunken PC and designed their device from scratch, they won.
My point is that a good voice-controlled system needs to be designed as a good voice-controlled system. Strapping it on after the fact won't cut it.
Linux PDA speech CLI (Score:5)
Computer: "slash"
Me: "l-s"
Computer: "slash u-s-r, slash h-o-m-e, slash b-i-n, slash b-o-o-t, slash e-t-c..."
Me: "cd slash e-t-c"
Computer: *silent*
Me: "vi profile"
computer: (reads file contents)
Me: "quit"
computer: "error"
Me: "exit"
computer: "error"
Me: "wq"
computer: "file is read only"
Me: "q!"
That's my kinda computing
-josh