Reports from Tesla suggest it is gathering huge amounts of driving data from logs in its cars – 780 million miles of driving, and as much as 100 million miles in Autopilot mode. This contrasts with the 1.6 million miles of test operations at Google.
Huge numbers, but what do they mean now, and in the future?
As I’ve written before, testing is one of the biggest remaining challenges in self-driving car development – how do you prove to yourself and to others that you’ve reached the desired safety goals? Tons of miles are a very important component to that. If car companies are able to get their customers to do the testing for them, that can be a big advantage. (As I wrote last week, another group that can get others to do testing are companies like Uber and even operators of large commercial and taxi fleets.) Lots of miles mean lots of testing, lots of learning, and lots of data.
Does Tesla’s quick acquisition of so many miles mean they have lapped Google? The short answer is no, but it suggests a significant threat since Google is, for now, limited to testing with its small fleet and team of professional testing drivers.
Tesla is collecting vastly less data from its cars than Google does. First of all, the Tesla has a lot fewer sensors and no LIDAR, and according to various sources I have spoken to, Tesla is only collecting a fraction of what its sensors gather. To collect all that it gathers would be a huge data volume, not one you would send over the cell network, and even over the Wi-Fi at home it would be very noticeable. Instead, reports suggest Tesla is gathering only data on incidents and road features the car did not expect or did not handle well. However, nothing stops Tesla in the future from logging more, though the company might want to get approval from owners to use all that bandwidth.
Dateline NBC reports on self-driving cars, including an interview with Brad Templeton. (scroll forward to second segment)
Tesla wants to make a self-driving car for people to buy today. As such, it has no LIDAR, because a car today, and even the Autopilot, can be done without LIDAR. Tomorrow’s LIDARs will be cheap but today’s production LIDARs for cars are simple and/or expensive. So while the real production door-to-door self-driving car almost certainly uses LIDAR, Tesla is unable and unwilling to test and develop with it. (Of course, they can also argue that in a few years, neural networks will be good enough to eliminate the need for LIDAR. That’s not impossible, but it’s a risky bet. The first cars must be built in a safety-obsessed way, and you’re not going to release the car less safe than you could have made it just to save what will be only a few hundred dollars of cost.)
As noted, Google has being doing its driving with professional safety drivers, who are also recording a lot of data from the human perspective that ordinary drivers never will. That isn’t 100 times better but it’s pretty important.
Tesla is also taking a risk, and this has shown up in a few crashes. Their customers are beta testing a product that’s not yet fully safe. In fact, it was a pretty bold move to do this, and it’s less likely that the big car companies would have turned their customers into beta testers – at least no until forced by Tesla. If they do, then the big automakers have even more customers than Tesla, and they can rack up even more miles of testing and data gathering.
When it comes to training neural networks, ordinary drivers can provide a lot of useful data. That’s why Commma.ai, which I wrote about earlier, is even asking volunteers to put a smartphone on their dash facing out to get them more training data. At present, this app does not do much, but it will not be hard to make one that offers things like forward collision warning and lane departure warning for free, paid for by the data it gathers.
This article was republished with permission from Brad Templeton’s Robocars Blog.