Introduction
Data visualization isn't just about making "pretty pictures." It is about communication. In TECH3100, you move beyond the basic charts you might see in Excel. You learn to use the R programming language to create high-quality, reproducible, and interactive graphics.
R was built by statisticians, for statisticians, which means its ability to handle complex data is unmatched. Throughout this course, you will learn how to take a raw dataset, clean it up, and choose the perfect visual format to reveal hidden trends. By the end of the term, you won’t just be a "coder"; you’ll be a digital storyteller.
Subject Objectives
What will you actually be able to do by the end of TECH3100? Kaplan designed this course with very specific goals:
- Master the R Environment: You will learn to navigate RStudio, the standard "workspace" for R developers.
- Implement the Grammar of Graphics: You will understand how to build plots layer-by-layer using the ggplot2 package.
- Wrangle Messy Data: Before you can visualize data, you have to "clean" it. You’ll learn how to filter, sort, and reshape data using the tidyverse.
- Select Appropriate Visuals: You will learn the logic behind choosing a bar chart vs. a scatter plot vs. a heat map.
- Tell a Story: You will learn how to add labels, themes, and annotations to make your charts easy for non-technical bosses to understand.
Core Topics & Concepts
This course focuses heavily on the Tidyverse, a collection of R packages designed for data science. Here are the big pillars of the syllabus:
A. The Grammar of Graphics (ggplot2)
The "GG" in ggplot2 stands for Grammar of Graphics. Instead of just clicking a button to make a chart, you build it using layers.
- Data: The actual numbers.
- Aesthetics (aes): Mapping variables to things you can see (like putting "Age" on the X-axis and "Salary" on the Y-axis).
- Geometries (geoms): The shape of the data (points for scatter plots, bars for bar charts).
- Themes: The "vibe" of the chart (background colors, fonts, and gridlines).
B. Data Wrangling (dplyr)
You can't visualize a mess. You will spend a lot of time learning "The Pipe" operator %>% or |>. This allows you to take a big dataset and "funnel" it through filters until it's ready to be plotted. For example:
- filter(): Pick only certain rows (e.g., only sales from 2026).
- mutate(): Create new columns based on old ones.
C. Choosing the Right Plot
TECH3100 teaches you that different data needs different "homes."
- Distribution: Use Histograms or Box Plots to see how spread out your data is.
- Relationships: Use Scatter Plots to see if two things (like $X$ and $Y$) are correlated.
- Trends: Use Line Graphs for time-series data (like stock prices over a year).
D. Interactive & Advanced Viz
Later in the course, you might touch on:
- Plotly: Turning static charts into ones you can hover over and zoom in on.
- Shiny: Building a web app where users can play with your data in real-time.
- Maps: Using the sf or leaflet packages to visualize geographic data.
Assignments & Assessment Tips
The assessments in TECH3100 are usually practical. You won't just take multiple-choice quizzes; you’ll submit "R Scripts" and reports.
- The Case Study Analysis: Often, you are given a dataset (like customer churn or health data) and told to find three insights. Tip: Don't just show the chart; explain why it matters to a business.
- Reproducibility is King: Your professor should be able to run your code on their computer and see the same chart you made. Use R Markdown to combine your code, your charts, and your writing into one clean PDF or HTML file.
- Follow the Rubric: Kaplan rubrics usually award points for "Code Cleanliness." Use comments (# like this) to explain what your code is doing.
- Label Everything: A chart without a title or axis labels is like a map without names. Use the labs() function in R to add titles, subtitles, and captions.
Common Challenges & Solutions
Challenge 1: "The Coding Learning Curve"
R can be intimidating if you've never coded before. One missing comma can make the whole script crash.
- Solution: Use the Help tab in RStudio. Typing? ggplot in the console will bring up the official manual with examples you can copy and paste.
Challenge 2: "Messy Data"
Sometimes your data has "NA" (missing values) or strange formatting.
- Solution: Master the na.omit() function or use drop_na() from the tidyr package to clean up your data before you try to plot it.
Challenge 3: "Over-Visualizing"
Students often try to put too much information into one chart, making it unreadable.
- Solution: Use Faceting. Use facet_wrap() to split one giant, messy chart into five small, clean ones based on a category (like "Country" or "Gender").
Recommended Resources
Textbooks & References:
- "R for Data Science" (2e) by Hadley Wickham: This is essentially the R "Bible." It is a free online resource that covers exactly what you need for TECH3100.
- ggplot2: Elegant Graphics for Data Analysis: Great for deep-diving into the "Grammar of Graphics."
- The R Graph Gallery: A website full of "recipes." If you want to make a specific type of chart, you can find the code here and adapt it.
Online Datasets:
Practice makes perfect. Use these sites to find data for your assignments:
- Kaggle: A massive repository of datasets on everything from sports to finance.
- TidyTuesday: A weekly social data project in R. Great for seeing how other pros design their charts.
- Built-in R Data: Type data() in your R console to see datasets already built into R, like iris or mtcars.
Conclusion
TECH3100 – Data Visualisation in R is a challenging but incredibly rewarding course. It moves you from "consuming" data to "creating" insights. Remember, the goal isn't to write the most complex code; it's to make the data speak.
Take it one layer at a time. Start with a simple bar chart, then add colors, then add themes, and before you know it, you’ll be building professional-grade dashboards that would impress any CEO.
FAQs
Q1: Is R harder than Python for data visualization?
For visualization, many find R easier because of ggplot2. While Python is a general-purpose language, R was specifically designed for data, making its visualization libraries more intuitive for "data-first" thinkers.
Q2: Do I need to be a math genius for this course?
No! While understanding basic statistics helps (like knowing what a "mean" or "median" is), the course is more about logic and design than doing manual calculations.
Q3: What is RStudio?
Think of R as the "engine" of a car and RStudio as the "dashboard." You write your code in RStudio because it makes it easy to see your files, your plots, and your errors all in one window.
Q4: Can I use these skills in a real job?
Absolutely. Data Visualization is a top-five skill for Data Analysts, Business Intelligence (BI) Developers, and Market Researchers. Companies like Google, Facebook, and the New York Times all use R for their data visuals.
Q5: What if my code shows a bunch of red text?
Don't panic! In R, red text in the console often just means "I don't understand." Read the error message carefully; it usually tells you exactly which line has the mistake.