Four Steps to Start Learning Artificial Intelligence

Introduction

Many people ask, “Where should I start learning artificial intelligence?” This question is broad, as AI is a vast field, similar to asking, “Where should I start with sports?” Are you interested in basketball, swimming, or running?

However, I will break down this question into four concrete steps based on my experiences and those of friends who successfully transitioned into AI. After reading this, you’ll see that it’s not as daunting for beginners to enter the AI field.

Step 1: Don’t Let Math Scare You, Start with “Swapping”

Many people panic at the mention of AI, thinking, “Oh no, I’m not good at math!” While deep learning involves linear algebra, probability, and calculus, the truth is that beginners don’t need to master all the math before they start.

Modern AI frameworks like PyTorch, TensorFlow, and Keras encapsulate complex mathematical operations. With just a few lines of code, you can build a neural network without needing to manually derive matrices.

For example, I know a liberal arts student who studied English in college and had never encountered advanced math. He learned the basics of Python in two weeks and then ran an image classification code written by someone else, successfully creating a program to recognize handwritten digits. He was thrilled to discover he could also do AI.

Practical advice:

Spend 1-2 weeks learning basic Python syntax (variables, loops, functions, classes are enough).
Install Anaconda and use Jupyter Notebook to follow tutorials and write code.
Successfully run your first MNIST handwritten digit recognition program; this is more effective than reading ten books.

Step 2: Choose a “Small Niche” and Don’t Try to Learn Everything at Once

AI has many directions: computer vision, natural language processing, speech recognition, recommendation systems, reinforcement learning, etc.

A common mistake for beginners is to want to learn everything. One day, facial recognition seems interesting; the next day, chatbots look cooler; then they dive into autonomous driving. After six months, they realize they haven’t mastered anything.

The correct approach is to select a scenario you can immediately apply.

For instance, if you work in operations, you can learn to use AI for sentiment analysis of user comments; if you’re in design, you can learn to use Stable Diffusion to generate materials; if you enjoy stock trading, you can learn to use LSTM to predict stock price trends (though it may not be accurate, it’s good practice).

Data supports this: According to a 2022 Kaggle survey, 74% of successful beginners started with a single task-oriented project rather than a comprehensive theoretical study. My own experience mirrors this. Initially, I focused on natural language processing to develop an automated customer service system, concentrating only on text classification and intent recognition. Within three months, I had a prototype, while colleagues who aimed to “learn AI comprehensively” were still stuck in math textbooks.

Step 3: Focus on Practical Skills; Theory is Secondary

Many people approach AI as an academic discipline, buying numerous textbooks like “Statistical Learning” and “Deep Learning,” taking extensive notes, but then feel lost when they open a code editor.

This is a classic case of putting the cart before the horse.

Industry and academia are entirely different. Companies hiring for AI roles want you to solve practical problems—cleaning data, tuning parameters, deploying models, and handling bad cases—rather than deriving BP algorithm formulas.

A sobering statistic: A job platform found that only 32% of candidates interviewing for AI positions could write code to run a simple model on the spot. This means over two-thirds of applicants are just “talking the talk.”

So my advice is:

80% hands-on, 20% reading. Look up concepts as you encounter them; don’t read just for the sake of reading.
For every algorithm you learn, implement it in code immediately. Try out logistic regression, decision trees, CNNs, etc., by coding them yourself.
Once your project reaches a certain point, revisit the math. By then, you’ll have specific questions, making your study ten times more efficient.

A good analogy is that learning AI is like learning to cook. You don’t need to master food chemistry or thermodynamics before you can cook. Follow a recipe, if you burn something, check why, and gradually you’ll understand timing and seasoning.

Step 4: Use the “Copy-Improve-Create” Method to Build Projects

Many beginners get stuck in the awkward phase of “not having projects to work on”—they understand some theory and can write a few lines of code, but when it comes to creating something from scratch, they feel lost.

The solution is simple: Copy, Improve, Create.

Phase 1: Copy (1-2 months) Find a popular project on GitHub or Kaggle, such as cat-dog classification, movie review sentiment analysis, or housing price prediction. Copy the code line by line and understand the purpose of each line. You don’t need to innovate; reproducing someone else’s results is a victory.

Phase 2: Improve (2-3 months) Make small modifications based on your replication. For example, if someone used ResNet50, try switching to EfficientNet; if they used the Adam optimizer, experiment with SGD. Record the effects of your changes. This process will help you truly understand the impact of parameters and architectures.

Phase 3: Create (Long-term) Combine your work and interests to create a project that solves a real problem. For example:

Automatically organize photos on your computer by face classification.
Scrape second-hand housing data in your city to predict reasonable prices.
Write a script to automatically reply to emails (only for template scenarios).

A real-world example: A friend in HR created a script using the GPT API to automatically screen resumes, scoring and ranking them based on job descriptions and resumes. This tool has been used in his company for six months, saving him ten hours of work each week. This is the best entry-level project—solving your own pain points.

Final Thoughts

Artificial intelligence does have a learning curve, but it’s not as mystical as training programs claim, nor is it as terrifying as some detractors suggest.

What truly discourages most people is not the difficulty of math or the complexity of code, but information overload and perfectionism—the constant desire to learn all prerequisite knowledge before taking action, leading to endless preparation without ever starting.

If you want to get started right now, here’s a simple “30-day launch plan”:

Days 1-7: Learn basic Python; being able to write loops and functions is sufficient.
Days 8-14: Set up your environment and run an image recognition code written by someone else.
Days 15-21: Change the dataset and run the same model again (for instance, switch from handwritten digits to cat-dog images).
Days 22-30: Try modifying one or two parameters and observe the changes in results.

After 30 days, you’ll find that you’ve become a “beginner-level AI practitioner.” The rest of the journey can be learned through practice.

Remember: AI is not about understanding or memorizing; it’s about running code and becoming proficient.

Now, close this article and start installing Python!