These are the nine best methods for data analysis

These are the nine best methods for data analysis

Introduction: The author of this article mainly introduces several methodologies for data analysis, from the basics to single indicators, and gradually extends to multiple indicators. The article has clear ideas and contains many practical methods. It is worth reading for those who want to further expand their data analysis capabilities.


"What is your methodology for data analysis?" Whether it's work or an interview, this question will stump many students. Oh my god, you can figure it out by running numbers every day, but what the hell is methodology! How to summarize it! Today I will introduce to you nine basic methods, which are used for interviews, job presentations, and bragging!

01 What is the basic method

In essence, almost all jobs are related to data and require some data analysis methods. However, data itself has a threshold. Many people were afraid of math classes when they were in school, not to mention complex theories.

Therefore, the so-called basic data analysis method should be:

  • It does not involve advanced mathematics, statistics, operations research, and algorithm principles.
  • Does not involve complex business logic or causal inference logic
  • Not limited to specific business scenarios, universal

In short, the basic method is the one that everyone can use. Based on this concept, we have compiled nine basic analysis methods for you, which are simple and easy to use.

02 Start with an indicator

1. Periodic analysis method

The most basic analysis method can start with an indicator, which is the "periodic analysis method". The so-called "periodic analysis method" is very simple in operation, which is to extend the observation time of an indicator to see if it has a periodic change pattern.

This method is simple to analyze, but very practical, because novices often make jokes because they don't know how to read the cycle changes.

Such as:

"I noticed that the indicator fell sharply yesterday" - Yesterday was the weekend, so it should have fallen (natural cycle changes)

“I found that product A sells very well” – Product A has just been launched, so it should sell well (product life cycle)

Often, the indicators we look at are overall indicators, which are composed of several parts, such as:

Head office - Branch A, Branch B, Branch C Total sales - Product A, Product B, Product C

Therefore, after seeing an overall indicator, you can break down the whole according to its components and understand the composition of each part. This is the structural analysis method (analyzing the internal structure of the whole).

2. Structural analysis method

The structural analysis method is useful in many cases. For example, if you ask: "Why is the performance declining?" Answer: "Because XX area is not done well!" By looking at the structure, you can quickly find the person responsible.

3. Hierarchical analysis method

In addition to simply looking at the structure, people also like to make rankings and distinguish between high, medium and low levels. This is the hierarchical analysis method.

Many students confuse stratification and structure. Just remember the following two sentences:

Structure exists objectively, just ask clearly. Stratification is subjective, and you need to determine the high and low.

These three methods are the most basic of the basics. First, they are all analyzing an indicator. Second, they are all based on factual statements and do not require any calculations. When we first arrive at a company or come into contact with new data, we can use these three methods to establish basic cognition.

03 From one indicator to two indicators

1. Matrix analysis method

When the number of indicators increases from one to two, the best method is matrix analysis. Matrix analysis constructs an analysis matrix by crossing two indicators, and uses the average value to cut out four quadrants to find problems (as shown below).

The biggest advantage of the matrix analysis method is that it is intuitive and easy to understand. It is easy to find problems from the cross-comparison of two indicators. Especially when the two indicators are input/cost indicators, the two categories of high cost + low income and low cost + high income can directly indicate the direction of improvement for the business, thus greatly avoiding the problem of "not knowing how to evaluate good and bad".

Many consulting companies like to use this method, which is similar to the KANO model or the Boston Matrix. The essence is to find two good evaluation indicators, construct a matrix by cross-constructing the two indicators, and classify the business. The classification effect is very good, so it is widely circulated.

04 From 2 indicators to multiple indicators

When there are more analytical indicators, the most important task is to figure out the relationship between these indicators. There are two typical relationships.

1. The first type: parallel relationship

Several indicators are independent of each other and are components of the upper-level indicators. For example, we often say: Performance = Number of customers * Consumption rate * Average customer price In this formula

  1. Primary indicator: performance
  2. Secondary indicators: number of customers, consumption rate, average order value
  3. The number of customers, consumption rate, and average order value are independent of each other

At this point, the number of customers, consumption rate, and average order value are three parallel indicators, and are all sub-indicators of performance.

2. The second type: serial relationship

Several indicators are interrelated and have a sequential relationship. For example, we often say: Number of new registered users = number of ad viewers * landing page conversion rate * registration page conversion rate.

  1. Primary indicator: Number of newly registered users
  2. Secondary indicators: number of ad views, landing page conversion rate, registration page conversion rate
  3. Users must first see the ad, then click on the ad to enter the landing page, and then complete the registration

At this point, the indicators of the advertising page, landing page, and registration page are interrelated, and users have to proceed step by step.

These two relationships correspond to two basic analysis methods:

Parallel relationship: indicator decomposition method, by decomposing a first-level indicator, problems can be found from the second-level indicators.

Funnel analysis method: By observing a series of processes, understand the process conversion rate and find out the shortcoming of the conversion rate.

3. Indicator decomposition method

The indicator decomposition method is generally used more in business analysis. For example, a small program mall had sales of 1.5 million last month and 1.2 million this month. If you only look at the results, you will not know anything except that there is a loss of 300,000. However, after the indicator is decomposed, you can find a lot of things (as shown below)

After breaking it down, it is obvious that although the number of registered users increased this month, the consumption rate dropped significantly, so the income was less. In the future, we can further think about how to increase the consumption rate.

4. Funnel analysis

The funnel analysis method is more commonly used in Internet product/promotion/operation analysis, because Internet products can record more user data, and thus can present the entire user conversion process for analysis.

For example, when we see a product advertisement online, we are very interested and click to buy it. We need to go through the steps of advertisement page → details page → shopping cart → payment. With each additional step, some users will be lost, just like a funnel.

At this point, you can use a conversion funnel to graphically represent this relationship (as shown below).

With the conversion funnel, we can further analyze the funnel to guide business improvements:

  • Which link misses the most users and needs improvement?
  • What is the funnel shape for different products, and which one is more suitable for promotion?
  • Has the number of missed users been reduced after the new product revision?

5. Correlation analysis

Of course, there are some indicators that may not have a direct parallel/serial relationship, but at work, I would also like to know if they are related, such as:

  • Advertising investment and sales performance
  • Rain, wind and store traffic
  • User clicks and consumption behavior

At this point, you need to master the correlation analysis method. Note: There may be a natural correlation between indicators. Common natural correlations have three forms:

  • In structural analysis, the relationship between the overall index and the partial index
  • In the indicator decomposition method, the relationship between the main indicator and the sub-indicators
  • In the funnel analysis method, the relationship between the previous and next step indicators

These three situations are called: direct correlation . Direct correlation does not require data calculation, and the relationship can be clearly seen through indicator combing. Correlation analysis method is more about using scatter plots/correlation coefficients to find potential correlations (as shown below).

But please note: correlation does not equal causation. How to interpret the correlation coefficient needs to be combined with the specific business meaning, and you cannot draw conclusions randomly.

05 From indicators to business logic

1. Label analysis method

All of the above methods are based on data indicator calculations, but in actual business, many relationships cannot be directly expressed by data indicators.

for example:

  • Do community stores do better business than pedestrian street stores?
  • Does private domain traffic convert better than public domain traffic?
  • Are sales better when it is windy and rainy than when the sky is clear?

It is difficult to measure community stores/private domain traffic/wind and rain with a single data indicator. However, these factors do have an impact on business operations. How can we analyze them? This requires the use of label analysis.

Let's take a simple example. In a southern province, heavy rain often occurs in August. Everyone thinks that rain will affect store performance. So how do we analyze it? According to the five-step method, we can analyze the stores in this province as shown in the following figure:

So we can conclude that rain has little impact on performance, and that’s it.

Note that in the example above, the labels are very crude, with only two simple categories: rain or no rain. In addition to rain, there may be typhoons, hail, high temperatures, and so on. Therefore, the precision of labeling determines the accuracy of label analysis. Whether the appropriate label can be selected is a test of the analyst's understanding of the business.

So far, a total of eight basic methods have been introduced. In actual work, multiple methods are generally used in combination. Because the questions raised by the business are very complex, they may involve multiple indicators and multiple tags. At this time, to sort out the ideas, we have to use the ninth method: MECE method.

2. MECE Method

MECE is the abbreviation of (Mutually Exclusive Collectively Exhaustive), which refers to the classification principle of "mutually independent and completely exhaustive". Classifying problems through the MECE method can be clear and accurate, making it easy to find answers.

The MECE method is the watershed between basic analysis and advanced analysis, and is also the path from basic to advanced. All complex problems need to be carefully sorted out and decomposed to become small problems that can be solved. The so-called business analysis model is actually the MECE decomposition of business problems.

Of course, some students are definitely curious: After mastering these nine methods, how can we go deeper? Generally, there are three routes for in-depth analysis.

06 After mastering the basic methods

1. Route 1: Business Analysis Model

Business models are used to solve problems such as vague definitions, poor data, and the need for business guidance. For example, the business is struggling with:

  • What exactly does it do that is better for users?
  • How can we motivate sales?
  • How much of an impact did the activity have?

These questions sound simple, but in fact, the definitions are very vague. What is good? What does it mean to be effective? Are incentives without money really effective? Various questions are complicated and may be mixed with the business department’s own little thoughts. Therefore, it is necessary to carefully sort out the business logic and derive feasible solution logic.

2. Route 2: Algorithm Analysis Model

Algorithmic models are used to solve problems with clear definitions, rich data, and complex computational processes.

For example, the problem of identifying high-value users has been clearly defined in the business:

  • What indicators are used to measure user value?
  • What criteria are used to evaluate the value of "high"
  • Rich data collected (gender, age, interests, related products, interactions, comments, etc.)
  • There is a group of manually labeled "high-value" users as positive samples

At this point, various algorithms can be used to build models. The purpose of modeling is not to increase the depth of analysis, but to improve the efficiency from analysis to business application. With relatively accurate model judgment, the business can automatically trigger marketing rules through CDP+MA, without having to write a long PPT every time.

3. Route 3: Statistical Inference

Statistical inference methods are used to solve problems that are clearly defined, have no data, and require testing and data collection.

For example, if you want to release a new version of a product, the business has already defined that the new version should increase the average online time of users (mean problem). Now you need to do a test and choose one from two preliminary versions. At this time, you need to use the double population mean comparison hypothesis test method.

Of course, actual problems are more complicated, requiring consideration of various control variables, assumptions, system development, and data collection plans, and cannot be solved solely by analysts.

Author: Down-to-earth Teacher Chen

Source: WeChat public account "Down-to-earth Teacher Chen"

<<:  299 yuan per set, 24 hours a day, AI anchor enters the live broadcast room

>>:  Full scale, more delicious than Zibo barbecue

Recommend

Advertisers, please check: 6 love letters from brands

What new styles will brand copywriting have this V...

Will the cool and addictive “Wandans” be more popular than the skit?

What impact will the explosive popularity of live-...

MINISO, the king of IP collaboration

In 2024, MINISO successfully consolidated its lead...

Without products, there is no growth.

Do you know the few key decisions about products? ...

How is Amazon's review rate calculated? What is it generally?

As one of the world's largest e-commerce platf...

How is Wish's cross-border logistics calculated? What logistics can be sent?

Now, to register a store on the Wish platform, sel...

WeChat store incentive policy is here!

In order to encourage more merchants to join WeCha...

How to run cross-selling ads on eBay? Is it effective?

On the eBay platform, cross-promotion ads are an e...

Should WeChat open a first-level entrance for video accounts?

This article elaborates on the view that WeChat sh...

What should a novice individual seller do on Amazon? Some experience sharing

As a new Amazon individual seller, you may face ma...

The underlying logic of product homogeneity

When it comes to product homogeneity, the first th...