📊 Real-World Aggregation Examples in Python: Sales, Analytics & Student Data

July 26, 2025

🔍 Introduction

In the age of big data, simply collecting information isn’t enough—you need to analyze and summarize it to uncover patterns, trends, and insights. That’s where data integration comes in.

Aggregation is the process of grouping data and applying summary tactics such as sum, mean, count, max, etc. It's widely used in business intelligence, analytics, and education to make data manageable and actionable.

In this post, we’ll real world aggregation examples using Python and pandas including:

1. Sales Data

2. Website Analytics

3. Student Academic Records

🛍️ 1. Sales Data Aggregation

A retail company records every transaction, including product, store, quantity sold, and revenue.

Goal:

● Total sales by product

● Average quantity per store

● Top-selling products

📊 Aggregation Examples:

1. Total Revenue per Product

2. Average Quantity per product

sales_data = {

'Product': ['Shirt', 'Shirt', 'Shoes', 'Shoes', 'Hat'],

'Store': ['NY', 'LA', 'NY', 'LA', 'NY'],

'Quantity': [10, 5, 8, 7, 3],

'Revenue': [200, 100, 400, 350, 75]

}

sales_df = pd.DataFrame(sales_data)

print("\nSales Data:\n", sales_df)

# Total Revenue per Product

print("\nTotal Revenue per Product:")

print(sales_df.groupby('Product')['Revenue'].sum())

# Average Quantity per Store

print("\nAverage Quantity per Store:")

print(sales_df.groupby('Store')['Quantity'].mean())

# Top Selling Products by Quantity

print("\nTop Selling Products (by Quantity):")

print(sales_df.groupby('Product')['Quantity'].sum().sort_values(ascending=False))

🌐2. Website Analytics Aggregation

A site tracks users, sessions, and conversions per channel (Organic, Email, Paid Ads).

Total sessions by channel

Conversion rate by channel

📊 Aggregation Examples:

1. Total Sessions by Channel

2. Conversion Rate per Channel

web_data = {

'Channel': ['Organic', 'Email', 'Paid', 'Organic', 'Paid'],

'Users': [150, 80, 120, 130, 110],

'Sessions': [200, 100, 160, 180, 150],

'Conversions': [30, 15, 40, 35, 45]

}

web_df = pd.DataFrame(web_data)

print("\nWeb Analytics Data:\n", web_df)

# Total Sessions by Channel

print("\nTotal Sessions by Channel:")

print(web_df.groupby('Channel')['Sessions'].sum())

# Conversion Rate (%) per Channel

print("\nConversion Rate per Channel (%):")

web_grouped = web_df.groupby('Channel').sum()

web_grouped['Conversion Rate (%)'] = (web_grouped['Conversions'] / web_grouped['Sessions']) * 100

print(web_grouped[['Conversion Rate (%)']])

🎓 3. Student Data Aggregation

A school tracks students, their subjects, grades, and attendance.

🧠 Goal:

1. Average grade per subject

2. Attendance rate per student

data = {

'Student': ['Alice', 'Bob', 'Alice', 'Bob', 'Charlie'],

'Subject': ['Math', 'Math', 'Science', 'Science', 'Math'],

'Grade': [85, 78, 90, 82, 88],

'Attendance (%)': [95, 88, 92, 85, 97]

}

📊 Aggregation Examples:

1. Average Grade per Subject

2. Attendance per Student

student_df = pd.DataFrame(student_data)

print("\nStudent Data:\n", student_df)

# Average Grade per Subject

print("\nAverage Grade per Subject:")

print(student_df.groupby('Subject')['Grade'].mean())

# Average Attendance per Student

print("\nAttendance Rate per Student:")

print(student_df.groupby('Student')['Attendance (%)'].mean())

🔮 Future Scope of Data Aggregation

As data continues to grow exponentially across industries, the future of data aggregation is set to become even more impactful. Here’s how aggregation is expected to evolve in the coming years:

1. Integration with AI and Machine Learning

Aggregated data serves as the foundation for AI models.

More intelligent systems will use automated aggregation pipelines to prepare real-time data for predictive analytics, anomaly detection, and personalization.

2. Real-Time Aggregation at Scale

With the rise of IoT, e-commerce, and digital platforms, there's an increasing demand for real-time dashboards powered by streaming aggregation (using tools like Apache Kafka, Spark, or Flink).

This enables faster decision-making in areas like fraud detection, live analytics, and dynamic pricing.

3. Self-Service BI and No-Code Tools

Non-technical users will increasingly use drag-and-drop platforms (like Tableau, Power BI, and Google Looker Studio) to perform complex aggregations without writing code.

This democratizes data access and empowers more teams to make data-driven decisions.

Name:Pravin Talawar

University:Sri Balaji University

Class:BCA2302341

Search This Blog

MongoDBcompass

📊 Real-World Aggregation Examples in Python: Sales, Analytics & Student Data

🔍 Introduction

🛍️ 1. Sales Data Aggregation

Goal:

📊 Aggregation Examples:

2. Average Quantity per product

🌐2. Website Analytics Aggregation

Total sessions by channel

Conversion rate by channel

📊 Aggregation Examples:

1. Total Sessions by Channel

🎓 3. Student Data Aggregation

🔮 Future Scope of Data Aggregation

Comments

Post a Comment

Popular posts from this blog

Query Operator's

Creating Documents in MongoDB(Insert)

📊 Real-World Aggregation Examples in Python: Sales, Analytics & Student Data

🔍 Introduction

🛍️ 1. Sales Data Aggregation

Goal:

📊 Aggregation Examples:

2. Average Quantity per product

🌐2. Website Analytics Aggregation

Total sessions by channel Conversion rate by channel

📊 Aggregation Examples:

1. Total Sessions by Channel

🎓 3. Student Data Aggregation

🔮 Future Scope of Data Aggregation

Comments

Post a Comment

Popular posts from this blog

Query Operator's

Creating Documents in MongoDB(Insert)

Total sessions by channel

Conversion rate by channel