"How do you know about all this AI stuff?"
I just read tweets, buddy.


Page 1 of 1

OpenAI transcribed over a million hours of YouTube videos to train GPT-4

#training data   #youtube   #openai   #ethics   #link  

How Tech Giants Cut Corners to Harvest Data for A.I.

#training data   #youtube   #openai   #ethics   #link  

If it exists online, they're gonna take it.

Some OpenAI employees discussed how such a move might go against YouTube’s rules, three people with knowledge of the conversations said. YouTube, which is owned by Google, prohibits use of its videos for applications that are “independent” of the video platform.

Ultimately, an OpenAI team transcribed more than one million hours of YouTube videos, the people said. The team included Greg Brockman, OpenAI’s president, who personally helped collect the videos, two of the people said. The texts were then fed into a system called GPT-4, which was widely considered one of the world’s most powerful A.I. models and was the basis of the latest version of the ChatGPT chatbot.

Doesn't matter what original license you might have granted or what makes sense, it's alllll theirs.

Last year, Google also broadened its terms of service. One motivation for the change, according to members of the company’s privacy team and an internal message viewed by The Times, was to allow Google to be able to tap publicly available Google Docs, restaurant reviews on Google Maps and other online material for more of its A.I. products.

I think my favorite point in the piece is how Meta came from a weaker position because Facebook users don't post essay-like content.

@decka227 on August 01, 2023

#audio   #youtube   #deepfakes   #translation   #tweets  

Forget subtitles: YouTube now dubs videos with AI-generated voices – the background to know for this one is the role of dubbing in YouTube fame, featuring MrBeast conquering the world through the use of localized content.

There's definitely a need, as "as much as two-thirds of the total watch time for a creator’s channel comes from outside their home region." But what's the scale look like for those who make the transition?

The 9-year-old Russian YouTuber, Like Nastya, has created nine different dubbed channels, expanding into Bahasa Indonesia, Korean, and Arabic to reach over 100 million non-Russian subscribers

To make things personal: my boring, mostly-technical English-language YouTube channel has India as its top consumer, currently beating out the US by just under one percentage point (although India was twice as common as the US a couple years ago).

Although I realize that yes, that's cheating, since almost 90% of those with a Bachelors degree in India can speak English.