Databricks releases free data for training AI models for commercial use

Stephen Nellis and Krystal Hu

Updated 12 April 2023 at 9:08 am·2-min read

By Stephen Nellis and Krystal Hu

(Reuters) - Databricks, a San Francisco-based startup last valued at $38 billion, released a trove of data on Wednesday that it says businesses and researchers can use to train chatbots similar to ChatGPT.

The data, based on questionnaires of employees of Databricks, fills in an important gap in the company's efforts to create commercially usable tools to train AI systems that could offer alternatives to Microsoft-backed OpenAI.

Databricks said it spent the past several weeks gathering 15,000 questions and responses from its 5,000 employees in 40 countries and then vetted the data for quality, an effort Chief Executive Ali Ghodsi estimated cost the company millions of dollars.

Databricks sells software tools for building AI systems.

Ghodsi told Reuters that the company is releasing the free training data in the hope that other companies will use it to make their own AI systems, possibly using Databricks to do so.

The free dataset came after Databricks last month released Dolly, an open source large language model, the technological basis for chatbots. But it could not be used in commercial products because the data used to train the model was generated by OpenAI's ChatGPT, whose terms of service forbid using its data to develop commercial AI systems that could compete with OpenAI.

Using data generated by AI to train other AI systems has become common. New chatbots published by Stanford University and University of California Berkeley this year, for example, used such machine-generated data from ChatGPT, but both made clear that their models could not be used for commercial purposes.

Ghodsi acknowledges the dataset is far from perfect because it consists of only the Databricks' employee base, which he said skews male. Users will be able to examine the training data themselves, which they cannot do for models such as ChatGPT or Alphabet Inc's Bard, whose training data wasn't released.

"We're not claiming that this is an unbiased dataset," Ghodsi said. "We're just trying to push the community to go in this direction of more transparency, and more of everyone owning their own models instead of just a few that we have to trust."

(Reporting by Stephen Nellis in San Francisco and Krystal Hu in New York; Editing by Robert Birsel)

NextShark
16-year-old Chinese boy beat with iron rod on New Zealand bus
On Friday morning, during the Maori New Year public holiday, a 16-year-old teen named Jason was brutally attacked on an Auckland bus by a woman wielding an iron rod. The assailant, described as a Maori woman “more than 200 kilograms," hit Jason multiple times and stabbed him in the face, knocking out five of his teeth, on board the bus after having allegedly shouted "ch*nk" at him at Johns Lane stop. Man intervenes: A 75-year-old Chinese man, Penglai Qiuyue, intervened and managed to grab the weapon, though he was also injured in the process.
Malay Mail
Abused husky rescued after viral video, owner files police report demanding his dog back (VIDEO)
KUALA LUMPUR, JULY 3 — A viral video showing a man beating his dog, a husky, while his other, smaller dog cowered in a c...
Malay Mail
As AG stays silent, Rosmah presses on with bid to strike out money laundering and tax evasion charges
KUALA LUMPUR, July 4 — Datin Seri Rosmah Mansor's money laundering and tax evasion trial proceeded today despite the lac...
The Telegraph
Michelle Obama the only Biden alternative who would beat Trump
The only prospective candidate who could beat Donald Trump in the presidential election is Michelle Obama, according to a new poll.
HuffPost
Death Of Teenage Badminton Star Prompts Outrage Over Delayed Medical Response
Video from the incident shows tournament officials watching Chinese athlete Zhang Zhijie for around 40 seconds before medical help arrives.
Malay Mail
Tour bus driver in fatal Genting crash claims trial to reckless driving causing death, and having no valid licence
KUALA LUMPUR, July 3 — S. Anand Kumar, the Malaysian man behind the wheel of the tour bus that overturned and killed two...
Malay Mail
Singapore’s new S$20,000 cash declaration rules: What Malaysians need to know
Singapore has implemented new regulations since May requiring all travellers to declare cash exceeding S$20,000 (RM69...
Malay Mail
In court, Mais and Hindu family agree man won’t be reburied according to Islamic rites; next-of-kin to inherit assets
KUALA LUMPUR, July 3 — A court dispute between the Selangor Islamic Religious Council (Mais) and a Hindu family was sett...
People
Dua Lipa Goes Instagram Official with Boyfriend Callum Turner After Months of Romance Rumors
The pop singer posted a cozied-up photo with the British actor on Instagram from the Glastonbury Festival this past weekend
People
Jennifer Lopez Flaunts Her Abs in White Crop Top and Baggy Jeans During N.Y.C. Outing
Lopez was spotted wearing her wedding ring with the casual attire
The Telegraph
Martina Navratilova hits out at ‘regressive’ campaign featuring rugby players in lingerie
Martina Navratilova has led a backlash against a “regressive” campaign featuring Team GB’s Olympic rugby players in lingerie.
Bloomberg
These flight routes suffer the world’s worst turbulence
A series of incidents that left scores of airline passengers needing medical attention has shone a spotlight on the problem of severe in-flight turbulence.
The Guardian
Video appears to show gang-rape of Afghan woman in a Taliban jail
Activist claims she was threatened with release of the footage in order to silence her, amid multiple reports of sexual violence inflicted upon imprisoned Afghan women
The Telegraph
Hindu priest sexually abused hundreds of followers, High Court hears
A woman accusing a religious leader of exploiting and sexually abusing her told the High Court he has done it to “hundreds” of his followers over the last 40 years.
Entertainment Weekly
Victoria and David Beckham rewear purple wedding outfits to celebrate 25th anniversary: 'Look what we found'
The soccer star and Spice Girls singer married July 4, 1999.
Malay Mail
What you should know about: Malaysia’s proposed ‘lemon law’ for cars
Malaysia is proposing a so-called “lemon law” that would provide vehicle buyers in the country more protection against f...
Business Insider Video
Hong Kong Mafia (triad) member breaks down 12 HK Mafia scenes in movies and TV
Jimmy Tsui, a former member of the Sun Yee On triad in Hong Kong and Tung On in New York City's Chinatown, breaks down 12 Chinese organized-crime scenes in movies and TV shows based on realism. Tsui breaks down the accuracy of triad activities in Hong Kong and the United States, such as the money-laundering scenes in "Rush Hour 2," with Jackie Chan and Chris Tucker; "A Better Tomorrow," with Chow Yun-fat and Leslie Cheung; the human smuggling ring in "Lethal Weapon 4," with Jet Li, Mel Gibson, and Danny Glover; and the connection of triads with the nightclub and movie industries in Hong Kong in "Young and Dangerous 3." He explains the realism of triads dealing with law enforcement and other international criminal organizations, such as the Irish Mob in "The Departed," with Leonardo DiCaprio, Matt Damon, and Jack Nicholson; the tensions between the yakuza and the San Francisco triad in "War," with Jason Statham; the relationship between the Hong Kong police and the triads in "Infernal Affairs," starring Tony Leung and Andy Lau; and the interaction with motorcycle clubs in "Sons of Anarchy" S6E10. Tsui also looks at scenes in New York City's Chinatown, such as the rivalry between two Tong associations in "The Corruptor," with Mark Wahlberg and Chow Yun-fat; and the gambling-house scene in "Year of the Dragon." Tsui also explains the rituals and hierarchy of the triads, such as the initiation-ceremony scene in "Election" (2005) and the voting scene in "The Brothers Sun" E7, starring Michelle Yeoh. Tsui was involved with the Sun Yee On triad and Tung On in New York City's Chinatown for over 10 years. In 1985 in New York, he was arrested and charged with robbery and homicide with a $1.5 million bail. The case was dismissed and resulted in his transition into Sun Yee On in 1988. He was involved in karaoke bars, gambling houses, and various scams. After leaving the triads, Tsui got involved with Chinatown Gang Stories, a YouTube channel organized by Mike Moy, a former gang member and New York City Police Department officer. You can learn more about Jimmy Tsui's story here: https://www.youtube.com/@chinatowngangstories
Bloomberg
China urges Philippines to punish killers of Chinese citizen
China called on the Philippines to catch and severely punish the murderers of a Chinese citizen in a kidnapping case that prompted Beijing’s diplomatic intervention.
People
Victoria and David Beckham Slip Back into Their Iconic Purple Wedding Outfits to Celebrate 25th Anniversary
The couple got married on July 4, 1999
The Daily Beast
Daughter, Zara Tindall, ‘Shaken to the Core’ by Princess Anne’s Amnesia
Zara Tindall has been left “shaken to the core” by her mother Princess Anne’s mystery encounter with a horse which left her with a brain injury that has resulted in amnesia, according to a report.OK! Magazine says that Zara, 43, has been left freaked out by the incident which saw Anne, 73, also known as The Princess Royal, struck by a horse while she was out walking, alone, on Sunday, June 23, at her home, Gatcombe Park, in the Cotswolds.She was treated at the scene and then hospitalized for fiv

Latest stories