daily californian logo

BERKELEY'S NEWS • MARCH 30, 2023

Welcome to the (March) Madness! Read more here

Campus artificial intelligence research group creates data set for self-driving cars

article image

BERKELEY DEEP DIVE | COURTESY

SUPPORT OUR NONPROFIT NEWSROOM

We're an independent student-run newspaper, and need your support to maintain our coverage.

JUNE 08, 2018

The Berkeley Artificial Intelligence Research Lab, or BAIR, released a study on BDD100K — a driving database that can be used to train algorithms of self-driving cars — May 12.

The data set can be used to train self-driving cars’ artificial intelligence programs, according to BAIR’s website. The study concluded that the data set can help researchers understand how different scenarios affect current self-driving car programs.

A study by the research team that created the data set described two contributions to self-driving cars, one of which is the data set and the other its video annotation system. According to BAIR’s website, BDD100K is “the largest and most diverse driving video dataset,” containing 100,000 driving clips. Each clip in the data set is about 40 seconds long for various scenarios and driving conditions in different locations.

“It is great to have more open data out there for real-world domain problems, in this case for the autonomous driving space,” Cathy Wu, a doctoral student in UC Berkeley’s department of electrical engineering and computer sciences, said in an email. “This will allow the research community to really see if their methods really generalize or apply in real-world domains, and allows for the development of new methods.”

Other than the 100,000 videos, BDD100K includes multiple cities, weathers, times of day and scene types, according to BAIR’s website. The videos can be useful for imitation learning, or artificial intelligence learning through mimicking human behavior. Locations videoed range from New York to the Bay Area.

To help understand the diversity and number of objects, BAIR placed two-dimensional bounding boxes after the first 10 seconds of each video. Those boxes were used to identify buses, traffic lights, traffic signs, people, bikes, other vehicles and riders. Most notably, the 100,000 videos included over 1 million cars, according to the website.

“This dataset is much larger and diverse than other available academic datasets for autonomous driving — which is great,” Wu said. “This is also a dataset that is grounded in a truly real-world problem, rather than a dataset collected from images on the internet.”

According to the study, the researchers also created a system to allow objects to be identified with more accuracy and efficiency. This system is used to find the diversity in objects and object detection by adding curved boundaries.

The study states that the data set comes with comprehensive annotations, and can serve as a “benchmark” for other research initiatives. Wu added that many researchers can benefit from this data set, as it allows researchers to test their techniques for self-driving cars.

“This dataset may also provide a means of comparing machine learning techniques in a standardized manner,” Wu said in the email. “The subsequent research results may in turn shape the development of autonomous driving in industry as well.”

Contact Suryan Bhatia and Yao Huang at [email protected].
LAST UPDATED

JUNE 08, 2018


Related Articles

featured article
The grant will be used over the next five years for research in artificial intelligence and machine learning.
The grant will be used over the next five years for research in artificial intelligence and machine learning.
featured article
featured article
Electrical Engineering and Computer Science faculty members and researchers from UC Berkeley’s Real-Time Intelligent Secure Execution Lab, or RISELab, released a report Monday outlining challenges facing the progress of artificial intelligence technologies, as well as some ways to address them.
Electrical Engineering and Computer Science faculty members and researchers from UC Berkeley’s Real-Time Intelligent Secure Execution Lab, or RISELab, released a report Monday outlining challenges facing the progress of artificial intelligence technologies, as well as some ways to address them.
featured article
featured article
Siri — a form of artificial intelligence used in iPhones — can understand a lot; however, if you ask Siri if it "grasps" the concepts lobbed its way, the intelligence reveals its artificiality.
Siri — a form of artificial intelligence used in iPhones — can understand a lot; however, if you ask Siri if it "grasps" the concepts lobbed its way, the intelligence reveals its artificiality.
featured article