Voice-Op Linux PDA 104
Anonymous Coward writes "At http://www.the-times.co.uk/interface/dailyextra5.html is
news of a voice-operated Linux handheld computer to be announced at CeBit next week. Sounds cool!" Oh yeah. Until someone shouts out, "ARRR-EMMM ARRR-EFFF STAR!" Then we'll see who's laughing.
Theft of PDA (Score:2)
"Here! Here! he stole me! nasty brute!
The joke (Score:1)
But without the Userspace Apps? (Score:3)
The user space apps.
Things like the names/dates/call logging application.
And, face it, most of the apps like that under the modern Unixes need to go on a resource diet if they want to fit on a handheld.
Who's been writing the lo-resource version of Xcalendar? OR a database?
cell phones are bad enough (Score:2)
Thank You,
Troll King
Voice-recognition pitfalls (Score:4)
ARRR-EMMM ARRR-EFFF STAR! (Score:1)
rm: cannot remove `rf': No such file or directory - :P
L&H voice technology (Score:3)
The importance of platform diversity (Score:2)
I see that the concept of voice-operated devices is viewed largely as a laughable matter. Personally, I do not view the potensial for "voice-cracking" as the most important aspect of these gadgets...
However, I think the linux (and /.) community should welcome the prospect of an expanding platform base in this field with enthusiasm. With the focus in the PDA/handheld field on WinCE, it would be a shame if this carried over to the field of voice-operated devices.
Personally, I think that much of the "voice" functionallity will reside in the mobile phone networks, and as such be independent on the operating system of the handheld device. But it never hurts to give developers a choice of platform technology.
I am looking forward to this device, and the voice-enabled applications. Although it would be "nifty", I do not think the VoiceShell (vsh) will be the most useful application...
Not quite (Score:2)
--
Re:The joke (Score:1)
rm rf *
Which doesn't really do much (Unless you have a file called "rf" in the working directory).
Me, i'd be shouting "ARRR-EMMM DASH ARRR-EFFF STAR", which would make much more sense
Re:HELLO....I AM SAM... (Score:1)
Voice Recognition (Score:2)
Sorry if this seems off topic.
- AZ
Dilbert Strip (Score:1)
Wizzy, wiz mis-features (Score:2)
Zaphod Beblebrox is trying to listen to the radio. I say trying because the damn silly thing is a highly sophisticated computer computer that interprets body movement as a request to change channels. Having to remain rock still once you get the channel is hardly going to improve your listening enjoyment. ;)
Things like voice control are great for text file production, but this kind of thing is often hyped just way too far.
Just as importantly, their is the issue of training the voice recognition system. Once it's calibrated, it might be fine most of the time, but what about when your voice temporarily changes when you get the flue?
Finally, there is the noise pollution factor. Modern open floor plan offices are noisy and distracting enough with telephones and what not. I tend to suspect that the introduction of voice controlled computers is going to be a no go unless people are allowed to work at home in a relative state of quiet.
yes but (Score:1)
The future of OSs *is* voice recognition though... I want to see the end of keyboards. And mice.
Stupid trick of the day (Score:2)
The damn thing started flipping between channels and window/full screen every time it picked a recognised "command" out of the current program.
I eventually managed to shut it down by unplugging the microphone.
Ah well, you live and learn.....
(Voice operated medical equipment, anyone??)
Re:cell phones are bad enough (Score:2)
Re:yes but (Score:3)
Then there's the mouse thing, I remember the dilbert cartoon:
Pointy Haired Guy: Higher, higher, higher, ok click there. Now! No! Not There!
Dilbert:
I mean, for certain things voice is all well and good. "open
Now consider this: the gimp
explain voice control in that? "draw the mona lisa"?
Until (IF) our thoughts can be interpreted, I'm gonna support my old keyboard and mouse. I have ten fingers, but I can only make one sound come out of my mouth at once. And well, despite its dopiness, the mouse works great. Oh, and I'm convinced that m$ should have been a hardware company, not a software one. Look at the intellimouse explorer. Working optical mouse, great accuracy, you never need to clean the ball and rollers, and it never looses tracking unless on a mirror or very smooth white surface. I have no complaints with mine. Actually, I've never owned any m$ *hardware* that I really ever had complaints with.
Re:rm /*.* (Score:1)
Re:text to speech? (Score:2)
I do, as a matter of fact, because I work there
The voice in Voice Xpress is indeed the typical monotone voice. But the latest and greatest voice synthesis technology is our RealSpeak. A demo can be found here [lhs.com]. You can type in anything you want (up to 30 english words), and RealSpeak sends you a
Re:cell phones are bad enough (Score:1)
If voice-accessible PDA's become commonplace, we won't know who's talking to their PDA and who's just nucking futz!
Re:yes but Other Possibilities (Score:1)
The future of OSs *is* voice recognition though... I want to see the end of keyboards. And mice.
I agree with the keyboards and (especially) mice becoming a drag. There are other inputdevices possible as well. There are alternatives.
Thinking here about the computer seeing when the navigator window has to scroll just by looking at your eyes. Knowing to stop when your pupils changes size, etc etc.
And off course: combinations of all the new inputdevices. Interaction through question-and-answer with your computer (computer: "give me a smile for Gnome, cry if you want to start KDE").
Re:rm /*.* (Score:3)
So, it's not a Dilbert joke anymore, it happens for real...
Re:The joke (Score:1)
so remember... every time you enter a room full of geeks with voice operated PDA's shout "ARRR-EMMM DASH-ARRR-EFFF SLASH-STAR" amd they'll all panic and run away...
Re:Security (Score:2)
For slightly better articles, try here [upside.com] and here [upside.com].
Re: the demo (Score:1)
Re:The joke (Score:1)
I think not. Since you're spelling it out in letters, you'd probably get something like this:
rmrf*
What you'd have to say is:
"ARR-EMM SPACE ARR-EFF SPACE STAR"
I think it'd be easier to yell something like:
"TELINIT ZERO!"
;)
Re:The joke (Score:1)
Oops... stick a "HYPHEN" in there too.
voice control for gimp CAN and SHOULD work (Score:3)
No, not quite. Voice control won't replace any 2-dimensional manipulator interfaces any time soon (at least not for non-disabled users). No one is claiming that the mouse will be rendered useless. After all, "a picture is worth..." Well, ya know.
BUT. How much do you really enjoy clicking around the gimp toolbox? Or, worse yet, searching for a filter you don't normally use in 3- or 4- deep menu system while losing that exact pixel you were over in the image. Right there, a secondary interface via voice would be ideal. No need to lift hand off mouse or move the pointer at all. Just "Use filter A, settings 50%, 3, no." I'm generally against voice recognition, but this would be one of the few spots I'd definitely want to see it.
dictation (Score:2)
Dictation is a much more natural integration of person to computer. Slower, but better. A newbie can do it just as well as a nerd. Why learn to touchtype???
End of command lines. End of keyboard. End of mouse - use touch-sensitive screens. Sorted.
Re:yes but (Score:2)
Actually, if I could find a cheap (sub 1k) and not hugely bulky text only head mounted display, i'd setup a wearable with a twiddler, my thinkpad, a backpack and whatever hmd I can find.
As for desktop/laptop pointing/input devices, I prefer the ibm TrackPoint keyboards (the one with the little pointing nub between the g and h keys) and those ancient IBM AT keyboards (the ones with metal springs, that make such a racket) for my console-only systems. The trackpoint is really cool for the occasional mouse user like myself, because it is really close to my keyboard. I just need to move one finger, and it's there. But for longer mouse activities, like quake, a joystick or normal mouse might be a bit better. after a good 2-3 hour quake run my finger is pretty sore. But then when I used to use a normal mouse, my wrist hurt.
Maybe I'm on crack here, but i thought that "microsoft" hardware was hardware that a another company made, and paid royalties to microsoft for the use of the name.
Re:no, because (Score:2)
Okay... you sit in your home directory and type the following:
acoward$ rm rf *
We'll see who's laughing. (Hint: it might complain about there being no file named "rf", and it won't recurse, but I'd imagine the files deleted would be a real forehead slapper).
--
Evan
Heh, how about this? (Score:4)
Would *that* qualify as "free speech"?
Seriously, Voice interfaces probably have a very limited usage. Some disabled would benefit (much). Hands free applications are very useful in cars and such, but typing is generally less tiresome.
Sure many people type faster than they speak (at least if it is to be interpretable by a machine) but the main problem is that speaking for an hour is very tiresome (and irritating for those around), and commands by voice are difficult compared to mouse and keyboard. ("Swap those two words,... three sentences back" as opposed to drag and drop or the arrow key dance.).
Still cool is always cool...
Re:dictation (Score:2)
The other thing is gameing. I just dont think quake would be as cool if you just touched your oponents to splatter them.
keyboardless/touch monitors my have a chance in clueless newbie/ocasional useage/wearable/pda markets, but as for me, I need a decent interface that is comfortable to use for extended periods of time.
re:yes but (Score:1)
Re:Stupid trick of the day (Score:1)
Eavesdropping (Score:2)
Re:The joke (Score:1)
Re:Wizzy, wiz mis-features (Score:2)
Voice Recognition would be pretty useless in a "Call Centre" type situation, where you have the operator already talking to a customer and operating the computer at the same time. You can only talk to one or the other, and what if the computer thinks that you're talking to it, when you're talking to the customer?
Kind of makes voice recognition in many office enviroments useless, i would think.
Voice recognition, holy grail or waste of time.... (Score:3)
Have a read what Jakob Nielsen (one of the greats of User Interface design) says, he presents one of the better arguments as to why voice recognition just isn't that good a way of interacting with a machine. Most of the things that voice recognition is pushed forward for can be done better and with greater accuracy with your hands and a well thought out display. There are certain cases where it is the best option, and possibly a PDA is one of them (although I use a Psion and don't have any problem with it at all and I wouldn't want voice recognition) but for the most time its a gimmick that doesn't stand upto the demands of the user.
rm -rf .* is the most lethal (Score:1)
Voice recognition is useful for a PDA (Score:4)
The killer applications for a PDA are the contact info, schedule, and memos - in general, maintaining a database made of records with a small amount of data in each field. Short messaging (integrated with E-mail) too, I guess - still small amount of data. Everything else is bells and whistles. People do not write long texts on a PDA - they use laptops, or at least buy one of the nifty folding keyboards for their PDA. People do not run GIMP on a PDA.
For these killer apps, a voice API is great: "show today's schedule". "new meeting, March 14th, at 10, with L&H". "new memo: buy milk for santa". "new expense: the L&H account, 112$, business lunch". "show contact Joe". "Message to Jane: Lunch at 2?".
I'd expect you'll need to push a button to make the PDA listen - I wouldn't like one which listens all the time (it might make sense for a desktop system but not for a PDA). I also expect you'd still have a touch-sensitive display, and be able to use a stylus for menu navigation and writing. Just like desktop systems did not give up the keyboard when they got the mouse!
Something like the "Itsy" would be perfect for the above. Take my REX-PRO and add handwriting recognition like the Palm's and voice recognition like the above and you end up with the perfect PDA. The only possible improvement would be integrating it with a cellular phone, or maybe with a holographic projector
Obviously working on the voice UI would take a lot of effort to get right. I predict the initial offering - by L&H or whoever - will flop like the Newton, to be followed by a Palm-like successor which would get it right.
And both L&H and Compaq know this. Thats why they are both using Linux; writing a voice UI that works is a classical open source "itch to scratch". They'll be able to obsolete the first generation software and replace it with a second open-sourced generation - while maintaining the same hardware platform, escaping the Newton's fate. Good move for them, good move for us, bad news for Microsoft
correction, sorry (Score:1)
Re:rm /*.* (Score:1)
not that such pendantry really matters anyway ;-)
--
but maybe you missed the point (Score:2)
The concept of a desktop computer is so un-natural! Especially with a tube firing electrons down it, producing a flickering and raditation emitting output. Bah! I want to sit/lie in my bed, with my PDA, and read it or talk to it like a book. Not that I talk to books... tell me, how easy is it to type when lying on your back in bed?
It's about time we stopped adapting to computers with keyboards and CRT tubes and adapted them to us.
The other thing is gameing. I just dont think quake would be as cool if you just touched your oponents to splatter them.
Although I would agree with you, I think that to play games you would probably want a joystick? Or just stick to chess? :o
Voice Recognition and linux (Score:3)
I haven't checked in a while (may a bit outdated), but heres some linux speech apps [phil.muni.cz]
For those that really wanna play, check out ISIP 's ASR project [msstate.edu].
For those that are interested in aquiring speech corpa (training data) check out The LDC-online [upenn.edu]. Get the free guest account, use your perl skills and your imagination, and suddenly the TIMIT corpus is yours
Email me if you're interested in this kinda stuff (or want my timitgrab.pl script)... its not my primary address, but I check it from time to time.
Handwriting-text entry and audio in/out jacks (Score:1)
Re:yes but (Score:1)
What needs to happen is that our brothers in the AI field need to perfect natural language recognition (NLR) so that a computer can not only translate what we say into text, but UNDERSTAND what it means. That we we would be able to use the MEANING of a statement instead of the statement itself to tell the computer what we want it to do (think of Star Trek).
twinkle twinkle little *? (Score:1)
still won't work as it should translate to
rm dashrf star
which should cause no harm at all.
How would Metacharacters be entered anyway? ESCAPE STAR? LITERAL STAR? And how would the ESCAPE or LITERAL be escaped?
Seems to me that voice commanding some appliance is not so easy after all?!?
What's the problem? Why do you resist this? (Score:1)
Re:no, because (Score:2)
"Arrr Emmmm (pause) Dash Arrr Efff (pause) Star (pause) Enter"
Re:no, because (Score:2)
Re:Voice-recognition pitfalls (Score:1)
-dvorsd
Re:correction, sorry (Score:1)
This already exists, it's called linuxconf (Score:1)
Re:yes but (Score:1)
I agree that voice control alone is a dead end, but it has potential when integrated with other techniques for target designation.
Re:Voice-recognition pitfalls (Score:1)
A friend of mine was at a friends place when he said something which seemed to "activate" his Mac. He was told that it responded to voice commands and he simply had disengaged the screensaver (or something). So my friend being the genius that he is starts playing around with commmands. He cuts and pastes text and then says "This is cool, what would happen if I said shutdown?" and it proceeded to shutdown. To this day I won't let him near my computer.
I want voice recognition on my desktop at home (Score:2)
I have no fear of the keyboard though. I don't mind typing. In fact I often find it annoying to reach for the mouse. Some voice recognition capabilities would be nice though--especially at home.
Seriously, I can already picture how I can make my whole apartment voice activated. "Turn on fishtank" would turn on the light in the fish tank (X10/firecracker,) "dim lights 75%", "play sublime 40 badfish."
I doubt I'd ever use voice recognition instead of typing in the shell other than for that kind of thing. But could certainly use it in a standalone app that executes shell commands based on voice commands i've specifically taught it. One of these days I'll get around to doing something like this.
numb
Re:Security (Score:1)
Screw PDAs! (Score:1)
The ViaVoice SDK comes close, but I havn't found any well-done frontends to that, even. I wish Dragon or L&H would release a product for Linux, or at least one that works with Wine.
Re:Yes, you DO talk MUCH faster than you type!!! (Score:2)
Those are the figures for speaking at speeds that DNS can accurately transcribe. Yes, you do have to also account for correction, but if I'm writing I combine this step with my normal editing. It adds time but not too much, especially after you've used the system for a while and are getting good performance (98% or so).
When writing a report or whatever SR is easily faster than typing. It's not perfect, it doesn't work well for things other than text entry and command and control, but for what it does (which incidently is also nearly everything that people use their keyboard for) it does well and faster.
I just thought of an interesting idea... playing a 1st person shoot-em-up with mouse control for aiming and voice control for most other actions. I guess I'd still want the keyboard for movement, voce control of trying to run or whatever would still be kludgy. Oh well.
Command not complete.... (Score:1)
nothing would happen.
You all forgot to scream
"ENTER"!!!!!!!!!!
Re:Voice-recognition pitfalls (Score:2)
This, to me, suggests that these stories are urban legends. If they're not, then they are indicative of a horribly stupid implementation of voice recognition : In a moderately loud area, or an area where more than one person will be using voice rec, headset mics with 6" pickups should be used.
Don't blame on the voice recognition software what is in reality caused by inepitude and lack of foresight.
Re:no, because (Score:1)
arr emm (pause) dash arr eff (pause) star (pause) enter
Nope, didn't work... slashdot is still here!
Converse, as in conver-say? (Score:1)
Furthermore, when will that 11 node neural net the guys from USC came up with, be used in these kinds of products?
Re:yes but (Score:1)
Re:but maybe you missed the point (Score:1)
That's what I call adapting my computer to me:)
I guess I was refering to the mouse under quake... keyboard/mouse make a pretty cool combo.
Sometimes I think... (Score:1)
I submited this story I found on USENET and they
desided not to post it. It's about Samsung's
Linux PDA.
--- Story - Start ---
From: "Amandio J.S. Bacalhau"
Newsgroups: comp.sys.palmtops.pilot,
comp.sys.palmtops
Subject: NEW info about the new Linux PDA from
Samsung !
Date: Sat, 12 Feb 2000 13:21:41 -0800
I received this info about the Samsung Yopy
(http://www.sem.samsung.co.kr/eng/product/digit
[snip]
We are going to show Multimedia PDA YOPY at Cebit
show in Germany for the first time in the world
from Feb. 24 through Mar. 1, 2000.
In regards to specific information(like performance) will be available on the digital
website from the end of Feb.
Of course, we will provide you any new information
on YOPY when we are ready via e-mail.
YOPY will be available in the market from the
second quotor of this year. Then you can meet YOPY
in your area. We are working on launching plan for
the product such as price and sales channels.
[snip]
Anyone going to CeBIT ?
Amandio J.S. Bacalhau
--- Story - End ---
Make of it what you will.
Re:I want voice recognition on my desktop at home (Score:1)
"Computer, read the first line of the FooBar file to me"
"Computer, start downloading the Quake3 demo."
It'd be great.
Re:Stupid trick of the day (Score:1)
Make Seven
Welcome to Gear [TM] (Score:3)
Morning. Get up. Get dressed. Put on your Baldric, a Miss-Universe-style sash made of trendy-stereo-grey squares, roughly the size of cigarette packets. You're going for state-of-the-art, so your Baldric contains:
- a RAM RAID, four or five Gear Cells of high-capacity, non-volatile memory, redundantly copying each other so that nothing short of a flamethrower will cause memory loss.
- a Jack-In-The-Box, a cell containing a speaker, microphone, infrared and microwave tranceivers, all sorts of cable in/outs, and all the software necessary to allow your Gear to communicate with the mobile phone network, internet, infranet, and you.
- a Brain Cell, a pluggable, replaceable processor.
- an Eye Ball, a cell containing a digital camera and a projector; this does most of the visual display work, projecting on a nearby wall, or connecting to your optional heads-up display.
- a Handle, a slightly oversized cell with a chord keyboard _and_ a Palm-style stylus/graffitti-pad arrangement for quick, quiet text input.
You operate your gear using voice commands, mostly, but like most power users you don't only use English. GearCorp have followed the example of Palm Computing, whose Graffitti is not quite standard handwriting but rather a modified, streamlined version. Knowing that some sounds are easier to detect than others, they invented a language called Glish. So: a casual user might open a work file with the command "Menu File. Open. Section 'Work'. Section 'Memo'. Document 'DailyMemo'.", On the other hand, you, as a power user, would say "Fie Oh Dok At 'Work' At 'Memo' At 'DailyMemo'". Rolls off the tongue, and is much quicker for you and the Gear.
Go to work. That is, go to the park, sit there and conduct work in relaxed surrounds. Take calls, write programs or documents, "attend" meetings, all while sitting on a park bench watching the world go by. If you need confidentiality, use the Handle, or speak in Glish. In your briefcase you have a full-sized foldable keyboard and a foldable flatscreen with easel legs, so you can avoid using the Handle and the Eye Ball if you like.
I think it'd work. I think it'll be here within five years. And I think it'll change the computing world more than anything since VisiCalc.
: Fruitbat :
Re:rm /*.* (Score:1)
There's something worse (Score:1)
chmod -R 644 / dosdir/*
</TT>
Because if you do rm -rf
--------
"I already have all the latest software."
Re:text to speech? (Score:1)
--------
"I already have all the latest software."
The wrong approach (Score:1)
We don't take even partial AI seriously enough.
--------
"I already have all the latest software."
Carpal Tunnel, Good User Interfaces and Designs (Score:3)
Do you want the new user interface applications developed in open source on Linux, or only on MSWin3K and the occasional Macintosh? Yeah, I thought so... There's also the PDA-like devices that will come from the cell-phone makers, and it'd be nice to have good programming interfaces to them. Some things will be killer apps, others will be toys we get bored with quickly, but open development environments will make it easier for everybody to try things out.
Some user interfaces are just dumb replacements for keyboards on machines that have conventional-sized screens. There are a lot of problems for which this is adequate, including the typing-impaired but also applications where you want hands-free but don't need to be eyes-free, such as information kiosks ("mirror, mirror on the wall, where can I find beer in this airport?"), reference-finders for workers in messy environments ("zoom in on the picture of the carburetor"), etc.
Voice commands can also be mouse/menu substitutes, for people who like them. A long-known safety principal is to limit the commands to a relatively short set of very safe commands. You don't want to have "rm -fr *" there, but "mail" and "phonebook bob smith - yes - dial" are pretty safe. (Ok, there are still risks like that web site with the background sounds saying "phonebook 1-900-RIP-OFFF - dial", but you can decide how much risk management you want. And you want it to ignore almost anything after the keyword "Daddy".) One of my coworkers had a PC-based application; we'd be on a conference call, and he'd occasionally interrupt to tell his computer to fetch a file. He doesn't use it much any more - I'm not sure if the novelty wore off or if he decided to cut down his weirdness quotient on the phone.
If you're willing to do voice input and output, portability becomes more practical, and computers can be a lot smaller because they don't need screens and keyboards, and more flexible because you can stick them in a pocket or backpack and use a headset. Sure, people will look at you funny walking down the street talking to yourself, but here in San Francisco, half the people on the streets are either talking to their cellphones or their liquor bottles, and society has adjusted to it. A hands-free voice portable makes an interesting combination with a GPS system and datacomm; it can give you while you're driving, tell you about nearby restaurants and traffic jams, and maybe let you call nearby cars ("Hey, CA123456, use your &^%&^% turn signal!").
MP3 Players can also benefit from voice interfaces, since it mainly requires adding a bit of storage to the computer you're already carrying. ("Computer, play Dark Side Of The Moon three times, volume low, speakers, order large pizza from Foobaros.").
oh god... (Score:1)
Re:cell phones are bad enough (Score:1)
Nothing about spilling your personal life to anyone who'd listen, but what's a typical citizen to do when seeing someone looking at a black box, talking out loud? (I.e., that person's talking to himself! Nuts!)
Heheh. ALthough, I do see a use of vocal "rm -rf
How about foreign language SR.. (Score:1)
Re:This already exists, it's called linuxconf (Score:1)
As for Linuxconf, I know it (of course I do) and I use it too. Changing things like default runlevel or the services started at boot-time is a lot quicker with Linuxconf than having to change bootscripts, like in the old days with Slackware. Good thing that -for example- RedHat ships with it.
Re:no, because (Score:1)
Scott Taylor
Re:twinkle twinkle little *? (Score:1)