aifaq.wtf

"How do you know about all this AI stuff?"
I just read tweets, buddy.

#training

Page 1 of 1

@Abebab on February 07, 2024

#common crawl   #data sources   #training data   #training   #tweets   #Abebab   #Abeba Birhane  

@Yampeleg on September 27, 2023

#behind the scenes   #law and regulation   #training   #tweets  

fast.ai - Can LLMs learn from a single example?

#behind the scenes   #training   #models   #link  

According to Betteridge's law of headlines:

Any headline that ends in a question mark can be answered by the word no.

@NiemanLab on August 17, 2023

#journalism   #business of AI   #training   #ethics   #law and regulation   #labor   #tweets  

@natanielruizg on July 14, 2023

#models   #fine-tuning   #training   #generative art and visuals   #tweets  

Introducing Aya: An Open Science Initiative to Accelerate Multilingual AI Progress

#translation   #low-resource languages   #under-resourced languages   #models   #training   #fine-tuning   #link  

Looks great!

Multilingual AI is a vey real issue, with literal lives on the line. Mostly because Facebook wants to use AI to moderate hate speech instead of using actual human beings (although that has problems, too). Ignoring content moderation on social media in non-English countries goes much worse than you'd imagine.

Lots of ways to contribute, from the Aya site:

Screenshot of what you can do with Aya

@tomgoldsteincs on July 07, 2023

#models   #training   #tweets  

May 4, 2023: @structstories

#custom models   #training   #models   #fine-tuning  

Not that I know the details, but I have my doubts that BloombergGPT was even worth it. I think "maybe look at" is a little too gentle – if you think you need your own model, you don't.

Prompt engineering and even somewhat thoughtful engineering of a pipeline should take care of most of your use cases, with fine-tuning filling in any gaps. The only reason you'd train from scratch is if you're worried about the copyright/legal/ethical implications of the data LLMs were trained on – and if you're worried about that, I doubt you have enough data to build a model.