SAS-SCHULZ-MC1

VAST Challenge 2022

Challenge 1

Team Information

Team members:

Student team?

No

Analytics tools used:

Approximately how many hours were spent working on this submission in total?

100 hours

May we post your submission in the Visual Analytics Benchmark Repository after VAST Challenge 2022 is complete?

Yes

Video

Watch Video on YouTube

Questions

Question 1 | Question 2 | Question 3 | Question 4

Demographics and Relationships involves understanding the city's demographics. Given social networks and other information about the city, you will analyze the available data to prepare a one-page fact sheet about the city's demographics, its neighborhoods, and its business base.

NOTE

The answer form contains interactive visualizations that rely on content hosted on GitHub pages and CDNs. If you encounter issues with these visualizations first try viewing in Chrome and disabling any ad blocking. If the interactive visuals still do not work a static fallback version can be forced by clicking this link:

Static Answer Form



1Assuming the volunteers are representative of the city's population, characterize what you can about the demographics of the town. Provide your rationale and supporting data.

Limit your response to 10 images and 500 words.

Age

Age ranges from 18 to 60. Average Age is 39. Most cases (704 of 880) have an age between 22 and 56.

Map of Age Group Distributions Throughout the City

Apartment Location

Central is the most common value representing 45.68% (402 of 880) of Apartment Location. Other common values are Northwest (at 29.43%) and East (at 14.09%).

Rental Costs in Different Areas of the City

Apartment Cost

"$$" is the most common value representing 31.97% (485 of 1.5K) of Apartment Cost. Other common values are "$$$" (at 27.88%) and "$" (at 24.65%).

Correlated Groupings of Education Level, Household Size, and Families with Children

Households Size

Household Size ranges from 1 to 3. Average Household Size is 1.9.

Children (yes/no)

"no" is the most common value representing 71.93% (633 of 880) of Children (yes/no). Another common value is "yes" (at 28.07%).

Education Level

"HighSchoolOrCollege" is the most common value representing 47.95% (422 of 880) of Education Level. Other common values are "Bachelors" (at 26.36%) and "Graduate" (at 19.32%).

Location of Schools, Restaurants, and Pubs

Schools

Enrollment (max) ranges from 242 to 418. Average Enrollment (max) is 360. Most cases (3 of 4) have an Enrollment (max) between 242 and 418.

Restaurants

Food Cost ranges from $4.1 to $5.9. Average Food Cost is $5. Most cases (16 of 20) have a Food Cost between $4.2 and $5.9.

Pubs

Hourly Cost ranges from $6.4 to $15. Average Hourly Cost is $11. Most cases (9 of 12) have an Hour Cost between $7.5 and $14.

Job Availability and Education Requirements for Employers

Employer Job Count

Employer Job Count ranges from 2 to 9. Average Employer Job Count is 5.2. Most cases (202 of 253) have an Employer Job Count between 2 and 8.

Employer Education Requirements

"HighSchoolOrCollege" is the most common value representing 72.56% (2.2k of 3K) of Education Level (min req). other common values are "Bachelors" (at 12.64%) and "Low" (at 8.93%).

Breakdown of Job Types Available

Wage Range

$10 - $20 is the most common value representing 67.24% (893 of 1.3K) of Wage Range. Other common values are $20 - $50 (at 29.59%) and $50 - $100 (at 3.16%).

Hourly Rate

Hourly Rate ranges from $10 to $100. Average Hourly Rate is $19. Most cases (1.1K of 1.3K) have an Hourly Rate between $10 and $33.

Job Start Time

8:00 to 8:29 is the most common value representing 26.20% (348 of 1.3K) of Job Start Time.

Job End Time

16:00 to 16:29 is the most common value representing 26.20% (348 of 1.3K) of Job End Time.

Examination of Distribution of Jobs Not in The Study

Job in Study

"In Study" is the most common value representing 89.6% (1.2K of 1.3K).

2Consider the social activities in the community. What patterns do you see in the social networks in the town? Describe up to ten significant patterns you observe, with evidence and rationale.

Limit your response to 10 images and 500 words.

There are several anomalies in the first few days of the study, such as people having multiple jobs with some lasting only 5 minutes. The study started with 1,011 participants with 32 dropping out after 4 days and another 99 dropping out after 5 days. None of the participants who dropped out of the study lived in an apartment and they all traveled to School. Our conclusion is these participants stayed in the dorms at college/university and traveled home at the end of the school year. The remaining 880 participants completed the study, which ran 450 days.

Impact of Anomalies Found in First Week of Study

The analysis in this report is only on those that participated in the entire study.

Impact of Anomalies Found in First Week of Study

On weekends, the longest average travel time was between Apartments and Pubs. On weekdays, the longest average travel time was between Apartments and Employers. Participants also had a longer average time staying at their Employer on weekdays versus weekends.

Comparison of Weekday and Weekend Travel Times

Using SAS Viya's network analytics procedure, we searched for various travel patterns to better understand the daily social activities of the participants. When analyzing patterns, we took into account the travel end location, purpose and check-in time. Using pattern match algorithm, we are able to determine the frequency of given patterns in the data. These were the primary patterns included in our search:

workDays
Number of days where the participant goes to work.
pubEvents
Number of times where the participant goes to a pub.
eatEvents
Number of times where the participant goes to a restaurant.
goingToEatAtWork
The participant goes to work, eats in the middle of work, then goes home.
twoJobs
The participant goes to work at one job then goes to work at a different job.
goingToPubAtWork
The participant goes to work, goes to a pub in the middle of work, then goes home.
goingToPubAfterWork
The participant goes to work, comes home, then goes to a pub.
goingToEatAfterWork
The participant goes to work, comes home, then goes to a restaurant.
multiplePubsOneDay
The participant goes out to at least two pubs.

These patterns can be grouped into Working, Pub, and Restaurant visits to better describe their daily social activities.

Distribution For Number of Days Spent at Work

Eating out is a common activity after work and in general. The median participant ate out 469 times in total and ate out 130 times after a work day (40% of the work days).

Restaurant Visit Details

Visiting pubs were a common activity even though 13 participants refrained from going to pubs altogether. Thankfully no participant went to a pub during a work shift. However going to pubs after work and on non-work days was common, often multiple times in one day. The median participant went to a pub 364 times in total, 71 times after work (22% of the work days), and had 100 days that contained multiple pub trips.

Pub Visit Details

3Identify the predominant business base of the town, and describe patterns you observe.

Limit your response to 10 images and 500 words.

The citizens in the study primarily went to two locations during their free time: pubs and restaurants. Pubs were the dominant place to engage in social gatherings. There are five primary locations of pubs and restaurants:

Pub and Restaurant Sales Across the City

Throughout the study, two pubs dominated all sales in the central part of town: 1342 and 1344. These two pubs made up over 20% of all sales alone. Additionally when performing a network analysis of the travel network involving business pubs 1342 and 1344 had nearly twice the centrality score of any other business, implying that they are a common gathering location.

Nearly 78% of all pub and restaurant sales occurred between the Northwest, Southwest Central, and Central parts of town.

Sales Breakdown by District and Unit

Restaurants had the lowest number of sales within the study. Sales tended to be distributed throughout the city relatively evenly, but the central part of the city had the lowest number of sales. The best performing restaurant, 1801, is located in the northwest part of the city. The poorest performing restaurant, 1349, is located in the central part of the city.

Restaurant Sales Across the City

Throughout the course of the study, Restaurants and Pubs were shown to have very different behavior in terms of the times that participants spent money. Regardless of the location of the pub, participants tended to spend the most money on weekends between 9AM and 12:00AM, and weekdays from 5PM to 12:00AM. Pubs tended to have fewer sales on weekdays between 6AM and 4PM.

Daily Pub Sales Patterns

Restaurants told another story. Restaurants, however, generate most of their revenue between 7AM and 8AM on weekends and 11AM and 3PM on weekdays.

Daily Restaurant Sales Patterns

Sales for pubs across all regions exhibit identical seasonality, with the most sales occurring on Saturday and the least sales occurring on Wednesdays. There is no growth in spending for either restaurants or pubs based on these participants' habits. Sales declined between March and May before stabilizing to an overall average of approximately $340,000/month.

Pub Sales Pattern by District

Restaurants, however, tell a different story. Seasonality is rarely exhibited within the identified districts, but individual restaurants often exhibit their own seasonality. When grouped together in districts, restaurants tend to have erratic seasonality. This is due to each individual restaurant's patterns not matching up with other restaurants. For example, Restaurant 1804 has very clear seasonality that peaks on Sundays, but this seasonality is not identical to other restaurants in the Southern district.

Restaurant Sales Pattern by District
Sales Pattern for Restaurant 1804
Sales Pattern for Northwestern Restaurants

We discovered 6 clusters of weekly sale patterns by restaurants and pubs. There is no clear relationship between weekly sales patterns and location within the city. The y axis represents normalized total sales. The higher the value, the greater the sales at that time. The clusters below are zoomed in to a representative date range and is designed to show the general shape of sales patterns over time.

Cluster 1
Consists almost entirely of pubs with the exception of a single restaurant (895) that has similar demand patterns to pubs. Saturday and Sunday both have sharp increases in sales. Weekdays tend to have much fewer sales.
Cluster 2
Restaurants whose sales tend to be greatest during weekdays. Weekends have fewer sales.
Cluster 3
Restaurants whose sales tend to be greatest on Saturday followed by Sunday. Weekdays have fewer sales.
Cluster 4
Two restaurants whose sales are relatively evenly distributed across weekdays with few peaks in-between.
Cluster 5
Four restaurants whose sales are generally higher on weekdays but tend to have slightly higher sales on weekends compared to clusters 2, 3, and 4. Sales tend to be more erratic throughout the week compared to other clusters.
Cluster 6
Four restaurants whose sales tend to peak on Sundays, with Saturday close behind.
Comparison of Time Series Cluster Patterns

4From your answers to questions 1-3, assemble a one-page summary that provides the key information to share with residents about the town.

COME VISIT
Engagement Ohio



Our town has lots to offer those looking to relocate or to visit!


Nestled between the convenience of Dayton and the smaller charm of Marion, Engagement Ohio is a perfect place to visit for a day of fun and relaxation. The town is home to a variety of activities and events that are sure to make your stay a memorable one.