# Setting Up Environment
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
Insight through Data
Crafting Visual Narratives for Corporate Strategy
In my role as the Data Insights Specialist II on the Learning and Impact team at the Center for Employment Opportunities, I directed the overall strategy for each quarterly board report. I identified key performance indicators (KPIs) by analyzing data and aligning them with the strategic goals of the organization. I coordinated the narrative, deciding which insights to present, and shaped the story to highlight our achievements and areas for improvement. I’d communicate these insights through visualizations that were clear and impactful.
Here, I’ll share a glimpse into how I’d transform raw data [usually obtained from various departments] into engaging visual stories that inform strategic and operational decisions. It’s all about turning complex data into clear, useful insights, even for those who aren’t familiar with data science or statistical jargon.
Each quarter, I’d set these goals for myself:
- Show Org-Wide Performance: Ensure strategic alignment with fiscal year objectives.
- Share Our Strategy for Improvement: Highlight key initiatives and outcomes
- Delivering Actionable Insights: Provide clear, impactful information to the board.
Wrangling the Data
At the end of each quarter, I’d reach out to various departments to gather updates on programs and activities. As the Data Insights Specialist II, I stayed informed on organizational activities year-round, as our team was heavily involved in evaluations and research. Proactively engaging with pilots, program performance, evaluations, research, and analyses allowed me to know exactly what to ask for when compiling the board report, ensuring relevant insights were included.
The Data
Usually, these updates came back to me as rudimentary figures, spreadsheets, or dashboards from which I’d pull data points. I would then download these into .csv files and process them using my favorite tool: pandas.



In addition to incorporating relevant program updates, we also included findings from the Learning and Impact team’s internal analyses. One quarter, we included insights derived from an evaluation of one of our intervention methods. These insights revealed specific characteristics and intervention dosages that significantly increased the likelihood of success for our participants.
“Likelihood”—there’s a term you’ll often hear in data science. Within the Learning and Impact team and in discussions across departments, terms like probabilities and likelihoods were straightforward to discuss, thanks to the context we could provide. However, translating these and other statistical terms accurately into language and visuals for the board—without the benefit of interactive dialogue—posed a unique challenge.
In the code blocks that follow, which I’ve anonymized for confidentiality, I’ll briefly demonstrate how I transformed data into visuals for our board updates. The data cleaning, processing, and manipulation steps shown below are routine practices I used to prepare the data points received from various departments. These steps were crucial for converting raw data, whether received through G-chat, Google Sheets, or emails, into clear, impactful board-level visual presentations.
# Read in Data
= pd.read_csv("anonymized_all.csv")
df1 = pd.read_csv("anonymized_fy24_only.csv") df2
# First few rows of raw data
df1.head() df2.head()
Unique Identifier | Start Date | Client Name | Job ID | |
---|---|---|---|---|
0 | 581013 | 9/25/23 | 581013 Record | job-193451 |
1 | 583586 | 7/17/23 | 583586 Record | job-198299 |
2 | 585047 | 7/11/23 | 585047 Record | job-198331 |
3 | 576444 | 7/9/23 | 576444 Record | job-198365 |
4 | 585401 | 9/18/23 | 585401 Record | job-198436 |
# Cleaning and transforming the data
def clean_df(df, date_col, drop_col):
= pd.to_datetime(df[date_col])
df[date_col] = df.drop([drop_col], axis = 1) # Drops Unnecessary Columns
df = df.columns.str.lower() # Standardize Column Names
df.columns
return df
= clean_df(df1,'Start Date', 'Client Name')
cleaned_df1 = clean_df(df2,'Start Date', 'Client Name') cleaned_df2
def fiscal_year(date): # Fiscal Year Calculation
if date.month >=7:
return date.year + 1
else:
return date.year
'fiscal year'] = cleaned_df1['start date'].apply(fiscal_year)
cleaned_df1['fiscal year'] = cleaned_df2['start date'].apply(fiscal_year) cleaned_df2[
# Check FYs in dataset
'fiscal year'].unique() cleaned_df1[
array([2021, 2022, 2023, 2024])
= cleaned_df1[cleaned_df1['fiscal year'].isin([2024])]
cleaned_previous # cleaned_previous.head()
= (
previous_weekly_performance
cleaned_previous'fiscal year', pd.Grouper(key='start date', freq='W')]).size()
.groupby([= 'previous_weekly_performance')
.reset_index(name
)# previous_weekly_performance
'cumulative_performance'] = (
previous_weekly_performance[
previous_weekly_performance'fiscal year')['previous_weekly_performance']
.groupby(
.cumsum()
) previous_weekly_performance.head()
fiscal year | start date | previous_weekly_performance | cumulative_performance | |
---|---|---|---|---|
0 | 2024 | 2023-07-02 | 15 | 15 |
1 | 2024 | 2023-07-09 | 77 | 92 |
2 | 2024 | 2023-07-16 | 99 | 191 |
3 | 2024 | 2023-07-23 | 71 | 262 |
4 | 2024 | 2023-07-30 | 72 | 334 |
= (
current_weekly_performance
cleaned_df2'fiscal year', pd.Grouper(key='start date', freq='W')]).size()
.groupby([= 'current_weekly_performance')
.reset_index(name
)'current_cumulative_performance']= (
current_weekly_performance[
current_weekly_performance'fiscal year')['current_weekly_performance']
.groupby(
.cumsum()
)# current_weekly_performance.head()
= pd.merge(previous_weekly_performance, current_weekly_performance, how = 'outer')
merged_df # merged_df.head()
Designing Clear Visuals
After extensive data manipulation—cleaning, pivoting, and generating initial visuals like bar charts—I begin to see potential ways to make the data understandable for the board.
For the quarterly board presentation that used the anonymized code blocks introduced earlier, I constructed the two charts below using the resulting merged_df
. Given the available variables, focusing on temporal dynamics and comparative metrics provides a valuable direction to uncover significant trends and irregularities to highlight in the final product.
Now, with some initial visualizations created, I can focus on refining and enhancing the clarity of the data presentation and insight delivery. One best practice I adhere to in my visualization work is the “30-second rule” (sometimes it’s more like 2 minutes). The idea is that viewers should quickly grasp the intent of a chart or graph. This contrasts with the more abstract visualizations by my favoured artists like Federica Fragipane and Alberto Cairo, who target a different audience.
For board presentations, I prioritize straightforward designs and avoid overly complex or abstract visuals. Sketching with pencil and paper after extensive screen time helps me push the boundaries of basic charts just enough to enhance clarity without becoming too abstract. This physical connection with my work makes the creative process more tangible and refreshing. It allows me to identify where strategic annotations, adjustments to bar charts, or repositioning elements in a scatterplot can significantly improve the visual clarity and effectiveness.
With these principles in mind I refine and enhance the clarity of the data presentation to produce final visuals.
Final Product
Below are some examples of the final visualizations I have created for corporate board presentations. These visuals are drawn from various projects, each tailored to address specific strategic questions. From demonstrating engagement levels across regions to highlighting the efficacy of strategic efforts over time, these visuals represent the culmination of thorough data analysis and creative design.
Intervention Visualization: This chart illustrates regional engagement levels following strategic interventions. Each number represents the number interactions per intervention type aimed at enhancing interaction effectiveness.
Performance Trend Chart The line graph depicts the effectiveness of strategic efforts, showing a notable increase in outcomes following specific interventions. Key moments, like the significant improvement marked in the chart, are pivotal discussion points in strategy meetings.
Thanks for reading :)