You Ask, I Answer: How to Find AI Work at AI Companies?

In today’s episode, I share advice for new grads looking to work in AI – contribute to open source GitHub projects to showcase skills. Tuning public models with limited resources demonstrates aptitude. Participate in repos, do forks and merges to get on tech pros’ radar. Aim to attract big names by customizing models for industry use cases. Tune in for more tips on standing out in the AI job market!

You Ask, I Answer: How to Find AI Work at AI Companies?

Can’t see anything? Watch it on YouTube here.

Listen to the audio here:

Download the MP3 audio here.

Machine-Generated Transcript

What follows is an AI-generated transcript. The transcript may contain errors and is not a substitute for watching the video.

In today’s episode, Alan asks, My son wants to work in AI as a new graduate, where can he go to look? That’s not obvious.

Okay, so if you want to work at AI companies, honestly, most of them are starving for people as long as you’ve got a good CV that shows you know, you you have the appropriate mathematical skills or programming skills.

Again, they’re they’re starving for people.

And so the that’s that’s the obvious approach.

I’m assuming your son’s already tried that equally obvious things like LinkedIn and networking, non obvious things would be where’s your son’s GitHub repo? Right? Where’s his GitHub profile? What stuff has your son done to build or customize or tune models? If you take something like some of the the GPTQ models that are the open source floating point 16 models? How can you quantize them down and provide them as a resource to the open source community? Because an awful lot of people who are looking at models and customizing and building these these open source models, they work in pretty big AI companies, right? They work in in advanced tech companies, because they have the resources to do stuff like, you know, the the requisite amount of time you need to have farmfold a 100 GPUs to do really advanced model builds.

But with things like Laura, low rank adapters, and prompt fine tuning, you can you can do some level of tuning and customization that you don’t need a gazillion dollars worth of hardware for but it demonstrates that you know what you’re doing.

It demonstrates that you know how to use these models, that you can tune them that you can build with them.

You can do model merges and things.

And that of course goes up on a site like GitHub and helps build a reputation like this is a person who knows what knows what to do, knows how to how to customize, look at different industries and different companies that he’s talked to and say, Okay, if I had a base model, a foundation model, like Facebook’s metas, llama to model, how could I tune that for common use cases, say in insurance, and build a a starter tuned model for that.

It doesn’t have to be perfect.

And doesn’t have to be, you know, even amazing, it just has to work.

And it has to demonstrate that he has the aptitude that companies will be looking for to make those hires.

And places like GitHub are one of the best if not the best place for technical talent to to network and be discovered.

LinkedIn actually is is okay.

But LinkedIn is not where the hardcore tech folks work like so if your son really wants to be doing the nuts and bolts of AI, GitHub is a much better place to be contribute to repositories, right contribute, do forks, do merges, do PRs, pull requests, do all that stuff within the developer ecosystem for AI projects to demonstrate that you can do the thing because on one hand, there is saying you can do something and you put it on your resume, you put it on your LinkedIn and stuff like that.

On the other hand, there is just demonstrated evidence that you can do the thing like, hey, this is a ggf quantized q5 model that I that I custom tuned to do this task.

That’s a lot more impressive, right? That is a lot more impressive than just saying that you know something about AI and LinkedIn profile when you can say here’s my repo, go check out the content and see if there’s anything you like.

That’s what I would do if I really needed to get the attention of some of those super hardcore tech folks within the AI field.

In the AI industry.

I would be participating in their repos, I would be building my own I would be inviting other people to to look at the model merges I can make because if you’ve got a decent gaming laptop, you can do model merges and model fine tunes, at least with the open source products.

And who knows if you come up with some way that provides a different kind of benefit or a different kind of optimization.

You may catch the attention of some of the biggest names in AI.

So that’s where I would go with that.

It’s a it’s a good question.

It’s an interesting question.

And I hope the answer is helpful to you and your son.

Thanks for asking.

We’ll talk to you next time.

If you’d like this video, go ahead and hit that subscribe button.

(upbeat music)


You might also enjoy:


Want to read more like this from Christopher Penn? Get updates here:

subscribe to my newsletter here


AI for Marketers Book
Get your copy of AI For Marketers

Analytics for Marketers Discussion Group
Join my Analytics for Marketers Slack Group!