Hi guys...
This is Prasun...
Last night I received an invitation to write for this blog..
So here I present my first post for the CYSIASTS...
Its all about an AI program known as
Never Ending Image Learner (NEIL)
Hope U'll like it...!!
This is Prasun...
Last night I received an invitation to write for this blog..
So here I present my first post for the CYSIASTS...
Its all about an AI program known as
Never Ending Image Learner (NEIL)
Hope U'll like it...!!
The real progress in AI will only occur when the different areas come together to work on understanding and interacting with the world.
NEIL is a program that scans the web 24/7 and looks at photographs to build up common sense knowledge of the world. It’s machine vision meets semantic graph.
There have been many attempts to teach computers about the common sense relationships we take for granted.
Cyc :
The most direct way that we interact with the world and discover common sense is via vision. We also have the advantage that we can interact with what we see, but there is a lot of knowledge to be gained about what there is in the world and how things relate simply by looking.
NEIL – Never Ending Image Learner
For example…
If you want to recognize a hat you could search for pictures labeled “hat” and train the classifier using them. In practice things are a bit more complicated and the overall method includes a clustering step on the returned images to select groups that really do represent good interpretations of the label.
Once the classifiers are trained they are used to examine what objects, attributes and scene types occur in general images downloaded from the web.
The program doesn’t try to understand every image it is more interested in the statistical relationships. It currently extracts object-object relationships -
Scene – Object relationships – “Bus is found in Bus depot”.
Scene – Attribute relationships – “Ocean is blue”.
For example, if the car detector detects something new that it labels as a car but it has no wheels and isn’t found on a road then it is unlikely to be a car. The whole thing starts to feedback to extend and improve the object recognition. In particular the detectors can be tuned to detect sub-categories of object such as particular makes of car.
The big problem is that this is a big data processing task. The object recognizers use 3912 dimensional feature vectors and a range of 1152 object categories and growing. It has also examined 5 million images to date and extracted 3000 common sense relationships. You can view the current state of things on the NEIL website.
When AI starts to find ways of improving its performance and extending its abilities this is when the real payoffs of the approach start to become apparent. NEIL teaches itself common sense that grows in sophistication as it is exposed to new images of the world.
NEIL is a program that scans the web 24/7 and looks at photographs to build up common sense knowledge of the world. It’s machine vision meets semantic graph.
There have been many attempts to teach computers about the common sense relationships we take for granted.
Cyc :
- a project to build a knowledge base by typing in information
- started in 1984 by Douglas Lenat.
- problem – labour intensive.
The most direct way that we interact with the world and discover common sense is via vision. We also have the advantage that we can interact with what we see, but there is a lot of knowledge to be gained about what there is in the world and how things relate simply by looking.
NEIL – Never Ending Image Learner
- This is the creation of Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta of Carnegie Mellon University with funding from the Office of Naval Research and Google.
- By looking at pictures stored on the web the program is attempting to extract not only objects but their relationships, and from these the underlying concepts.
- To do this needs the ability to recognize particular objects – car, aeroplane, person and so on. Training such recognizers is time-consuming, but again the web can be used.
- The detectors are trained by using Google Image search to return photos tagged with a particular label. These are then used to train classifiers for the object.
For example…
If you want to recognize a hat you could search for pictures labeled “hat” and train the classifier using them. In practice things are a bit more complicated and the overall method includes a clustering step on the returned images to select groups that really do represent good interpretations of the label.
Once the classifiers are trained they are used to examine what objects, attributes and scene types occur in general images downloaded from the web.
The program doesn’t try to understand every image it is more interested in the statistical relationships. It currently extracts object-object relationships -
- “Eye is part of baby”
- “BMW is a kind of car”
- “swan looks similar to goose”.
Scene – Object relationships – “Bus is found in Bus depot”.
Scene – Attribute relationships – “Ocean is blue”.
- You can readily see how these relationships can be extracted if you have enough images and if your object detection is good enough.
For example, if the car detector detects something new that it labels as a car but it has no wheels and isn’t found on a road then it is unlikely to be a car. The whole thing starts to feedback to extend and improve the object recognition. In particular the detectors can be tuned to detect sub-categories of object such as particular makes of car.
The big problem is that this is a big data processing task. The object recognizers use 3912 dimensional feature vectors and a range of 1152 object categories and growing. It has also examined 5 million images to date and extracted 3000 common sense relationships. You can view the current state of things on the NEIL website.
To do all this it runs on two clusters of 200 processing cores.
When AI starts to find ways of improving its performance and extending its abilities this is when the real payoffs of the approach start to become apparent. NEIL teaches itself common sense that grows in sophistication as it is exposed to new images of the world.
No comments:
Post a Comment