Prior to joining Google, I spent a lot of time trying to get computers to recognize objects in images. At Jetpac my colleagues and I built mustache detectors to recognize bars full of hipsters, blue sky detectors to find pubs with beer gardens, and dog detectors to spot canine-friendly cafes. At first, we used the traditional computer vision approaches that Id used my whole career, writing a big ball of custom logic to laboriously recognize one object at a time. For example, to spot sky Id first run a color detection filter over the whole image looking for shades of blue, and then look at the upper third. If it was mostly blue, and the lower portion of the image wasnt, then Id classify that as probably a photo of the outdoors.
Id been an engineer working on vision problems since the late 90s, and the sad truth was that unless you had a research team and plenty of time behind you, this sort of hand-tailored hack was the only way to get usable results. As you can imagine, the results were far from perfect and each detector I wrote was a custom job, and didnt help me with the next thing I needed to recognize. This probably seems laughable to anybody who didnt work in computer vision in the recent past! Its such a primitive way of solving the problem, it sounds like it should have been superseded long ago.
Thats why I was so excited when I started to play around with deep learning. It became clear as I tried them out that the latest approaches using convolutional neural networks were producing far better results than my hand-tuned code on similar problems. Not only that, the process of training a detector for a new class of object was much easier. I didnt have to think about what features to detect, Id just supply a network with new training examples and it would take it from there.
Those experiences converted me into a deep learning enthusiast, and so when Jetpac was acquired and I had the chance to join Google and work with many of the stars of the field, I couldnt resist. What impressed me more than anything was the teams willingness to share their knowledge with the rest of the world.
Im especially happy that weve just managed to release TensorFlow, our internal machine learning framework, because it gives me a chance to show practical, usable examples of why Im so convinced deep learning is an essential tool for anybody working with images, speech, or text in ML.
Given my background, my favorite first example is using a deep network to spot objects in an image. One of the early showcases for the new approach to neural networks was an annual competition to recognize 1,000 different classes of objects, from the Imagenet data set, and TensorFlow includes a pre-trained network for that task. If you look inside the examples folder in the source code, youll see “label_image”, which is a small C++ application for using that network.
The README has the instructions for building TensorFlow on your machine, downloading the binary files defining the network, and compiling the sample code. Once its all built, just run it with no arguments, and you should see a list of results showing "Military Uniform" at the top. This is running on the default image of Admiral Grace Hopper, and correctly spots her attire.
![]() |
Image via Wikipedia |
The only things it will spot are those that are in the original 1,000 Imagenet classes, and it will always try to find something, which can lead to some funny results. There are no people categories, so on portraits youll often see objects that are associated with people like seat belts or oxygen masks, or in Lincoln’s case, a bow tie!
![]() |
Image via U.S History Images |
Its unlikely that the set of labels it produces is exactly what you need for your application, so the next step would be to train your own network. That is a much bigger task than running a pre-trained one like this, but one of the things I like about TensorFlow is that it spans the whole lifecycle of a machine learning model, from experimentation, to training, and into production, as this example shows. To get started training, Id recommend looking at this simple tutorial on recognizing hand-drawn digits from the MNIST data set.
I hope that sharing this framework will help developers build amazing user experiences we’d never even think of. We’ve been having a massive amount of fun with TensorFlow, and I can’t wait to see what interesting image tools you build using it!
Related Post:
classify
how
- How to allow users to Run only specified programs in Windows 7
- How to measure translation quality in your user interfaces
- How to add Read More link to feedburner feed
- How good you are in SQL Simple SQL Quiz
- How Google Translate squeezes deep learning onto a phone
- How To Bypass Megaupload Wait Time And Download At Maximum Speed !!!
- 20 Resources for Teaching Kids How to Program Code
- How to Copy or Hide a File inside an Image
- How to remove New Folder exe Virus manually or using software
- How to display codes in blogger post
- How to put a flash mp3 player in blogger post
- How to recover any file from corrupted or broken HDs SSDs or SD cards
- Compare How London Looks on Google vs Paintings From the 1700s
- How to extract or get email id from orkut
- Map of Life A preview of how to evaluate species conservation with Google Earth Engine
- How to Copy Text from Images or protected web pages
- The Computer Science Pipeline and Diversity Part 1 How did we get here
- Ever wondered how Amazon makes money
- How to delete all your computer Virus for free
- How To Keep Safe Your Gmail Account
- What is Recycler Folder and How to remove it
- How to Turn On or Enable Hibernation in Windows 7
- Spam email watch How to protect yourself from spam emails
- How to add favicon icon to blogger
computer
- Take a better selfie with Lily
- Free Lecture The Psychology of Computer Insecurity
- MOOC Research and Innovation
- Calculating Ada The Countess of Computing
- When can Quantum Annealing win
- Creating a templated Binary Search Tree Class in C
- Projecting without a projector sharing your smartphone content onto an arbitrary display
- Will a robot take your job
- Facebook Introduces ‘Hack ’ the programming language of the future
- High Resolution Scary Haunted House Wallpapers for Desktop
- TYBSC IT Sem V Question Papers 2009 Mumbai University
- Home automation update
- Very easy to download youtube videos audio mp3 format
- HD Dark Desktop Background Wallpapers Download
- Launching the Quantum Artificial Intelligence Lab
- Syrias children learn to code with the Raspberry Pi
- Running omxplayer from the command line easily using alias
- Largest collection of Google Logos on the web Set 7
- Collection of SQL queries with Answer and Output Set 2
- Prevent access to specific partition or drive
- Summer Games Learn to Program
- PiAUISuite Update and Voicecommand v3 1
- Sign in to edx org with Google and Facebook and
- Large Scale Machine Learning for Drug Discovery
- Hacker Tricks from Insiders A Threat to ERP Systems
0 comments:
Post a Comment