Monday, August 8, 2011

Rise of the Machines (Part 1)


Part one of a multi-part series commenting on next steps in the evolution of human-machine interfaces and the rise of “apparent” machine intelligence.

We all remember The Terminator and Skynet – the evil machine intelligence that attempted to retroactively (via time travel) kill its nemesis, John Conner.  http://en.wikipedia.org/wiki/The_Terminator
 
What we might not think about is that many of the concepts in the movies are based upon science fact, not science fiction.  And, no, I’m not talking about time travel – as attractive as the concept might be.
I’m talking about the human-machine interfaces and autonomous robotics concepts that were demonstrated both directly and indirectly in the series.

In the fictional not-too-distant future, Skynet began “life” as a military expert system, designed to control robot military systems.  Many of these systems were simply unmanned autonomous or remotely operated robotic killing and spying machines – not unlike what the military is developing and deploying today.  The most interesting, though, was the Terminator.  The Terminator was an “infiltration unit”, designed to interact with humans.  Its mission was to spy on the remaining population, infiltrate their positions, and then bring in killing robots to eradicate them.

The Terminators were designed to interact with humans without being detectable as agents of Skynet.  As such, they looked human, acted human, and even smelled human – “bad breath and all”.  In the story, they were capable of autonomous operation within defined mission parameters.  They were capable of voice-based communication, and they were capable of gesture-based communication.  They used what might be called expert system technologies to interpret what was going on around them so as to fulfill their assigned individual missions.

Much of what enabled these infiltration units to operate is being developed today – albeit in an uncoordinated manner.

EXPERT SYSTEMS

IBM is perhaps preeminent in the development of expert systems that are publically visible – that is, unclassified.  Their Deep Blue chess systems play at the Grand Master level.  Their Jeopardy system, Watson, plays at an expert level – in a field that requires not only massive information levels, but also the ability to make sense of complex and occasionally obscure relationships.  

One can be sure that IBM and other companies have developed and continue to develop expert systems for commercial and military applications. One example - airplanes that can land themselves.

A particularly interesting kind of expert system, one that most of us already use on a regular basis, is the navigation system commonly called a “GPS”.  This system is tightly coupled to its operational domain.  It has knowledge of location, direction, speed, and a database that describes the surrounding environment, including roads and commercial facilities.  It also has access to real-time updates concerning road conditions and traffic conditions.  Within its limited domain of operation, these units can make complex decisions to choose optimal navigation paths.

Perhaps more interesting is that these systems, like the other expert systems described above, are relatively self contained.  They can be effectively embedded into other higher-level expert systems.   
The concept of “nested” or embedded expert systems (with higher level expert systems invoking lower level “components” to solve problems in specific domains) gives us what might be the beginning of machine intelligence – at least the raw material of expertise in multiple domains and the concept of expert systems that can identify when a sub-domain is being referenced (like in Jeopardy).

MACHINE-HUMAN INTERFACES

In the Terminator movies, the ability to interact with humans was critical to the success of the evil intelligence.  The infiltrators had to be able to pass something like an extended Turing Test to be successful.  http://en.wikipedia.org/wiki/Turing_test
 
Intelligence is extremely difficult to concisely describe – like Turing, we often simplify by using a negated test that shifts the burden of definition to someone else – if we cannot tell it is not human, it therefore must be human.  If we cannot tell that it is not intelligent, it therefore must be intelligent.

One key to this concept is the human interface.

In order to pass this extended Turing test concept, a machine intelligence, like the Terminator, must be able to pass for human.  A successful human interface for a machine intelligence, like the Terminator, must address all aspects of human-to-human communication.  It must be able to recognize and convey voice, facial, gesture, and “body language” information simultaneously.   It may even convey information by sweating and smelling bad.

Of course, we are a long way from producing a machine infiltrator with this complete capability set.  We are, however starting to explore many aspects of this complete human-to-human communication.

Products such as Ford SYNC provide limited voice interaction within the strictly defined domain of in-vehicle communications and vehicle-based entertainment system control.  Advanced avionics systems track what a pilot is looking at and interpret commands using this information as context.  University and commercial research is being done on reading and conveying facial expressions.

Predictions from NostraDavis

The seeds are being planted, with much more to come.  In subsequent articles, we’ll look at what has been done and what is on the drawing board, and how it might all come together.  Some predictions:

a) Expert systems for specific domains will continue to be developed and ultimately commoditized.  Just as we have navigation systems in a box and chess playing programs on a chip today, we can expect advanced medical expert systems, law systems, vehicle operation, weapons systems, sensing and surveillance systems, and other specialized knowledge domains to be addressed and ultimately ‘packaged’.

b) Human interfaces are moving from theory to practice.  Though simplistic, Ford SYNC is an example of an agent with control over multiple domains – in car entertainment, voice communication, navigation to name the basics.  Like Watson, it has the ability to recognize the change in context from one domain to another, and to invoke the expert in that domain.

c) Products that track eye movements and facial expression are just down the road.

Wednesday, August 3, 2011

Self-driving Vehicles

Modern airliners can fly and land under control of on-board computers.

Google has been experimenting with cars that can drive themselves.  Many others, including a number of automobile manufacturers are also heavily investing in this technology.

I noted a few weeks ago that Nevada has started the process of investigating how laws should treat cars that can operate autonomously. 

I think that's great.  We can't get there soon enough.  Here's why...

THE PROBLEM

A quick scan of the web turned up the following statistics:

1. Approximately 30,000 people die in traffic accidents per year - just in the US.  The number is 1.2 million across the world - or 1/4 of all accidental deaths.

2. Drunk driving accounts for 1/3 of that total - staggering - not in a funny way

3. There are about 8 collision coverage claims per 100 insured per year, with an average payout of $4100 per claim. (US numbers)

4. Because of all of this, people (you and I) must pay the insurance companies (just in the US) about $110 million per year, with about 70% of that going for the actual claims and 7% going to profits.

That's the cost of   PEOPLE   DRIVING !

OK.  Now let's consider other costs.

a) How about inefficient utilization of the roads ?  Inefficiency means more road investment than necessary.

b) How about inefficient use of commuting time ?  US residents spend more than 100 hours per year commuting alone.

c) How about the price of fuel ?  Yeah, we all understand that one!


Putting it simply, autonomous vehicles (self-driving cars and trucks) will have a powerful effect:

1. Reduction of accidents, in particular reduction of drunk-driving related accidents
2. More efficient use of roadways - dramatically increasing vehicle packing means paying for less roadbuilding
3. More efficient use of fuel - yup, computers can do a better job that us
4. Reallocation of time spent traveling for business or personal use - who couldn't use an extra 100 hours per year ?

and

A large portion of the 1.2 million people identified above simply won't die and many, many more won't be injured.

SO.  HOW DO WE GET THERE ?

First, the policy makers must start treating this as a priority.  Call your representative.

Second, the law makers must resolve how to treat this new technology so that it can be deployed.

MONEY

There is lots of money to be made in this new industry.   Lots !

Just as the cell phone market is reaching saturation, this will spawn a new industry for vehicle-based communication and computing technologies. The autonomous vehicle will become the next focus for innovation.  

There most definitely WILL be an App for that !


Predictions from Nostradavis

a. New cars will come equipped for autonomous operation
b. New cars, not your friends, won't let drunk drivers drive drunk
c. Roadways will start to become designated for autonomous vehicles
d. Insurance incentives will be given for autonomous vehicles
e A new wave of "apps" will spring up to support entertainment in autonomous vehicles
f. Every autonomous vehicle will be equipped with mobile computing and TV
g. Providing wireless telecommunications to autonomous vehicles will provide a new wave of business for communication providers (what used to be called cable companies and telecom companies)
h. A business for retrofitting older cars will spring up
i. Eventually, most roadways will become restricted to autonomous vehicles
j. The resultant shifts in spending will be initially resisted, but the tide is inevitable.

Got more ?    let me know.

Got ideas on how to get this going?   please share.