The results below are likely only meaningful to subject matter experts because the source dataset employs abbreviations, jargon and/or otherwise non-obvious labels. You may get in touch to help improve the source data, or you may browse Analyst-2 to find more accessible datasets.

Craft Beers Dataset

2K+ craft canned beers from the US and 500+ breweries in the United States. (nickhould/craft-cans) [more]

Context

It's a great time to be a craft beer fan in the U.S.! There are a ton of beer styles and brands to choose from and breweries have become very successful in the last several years. Breweries owe it all to beer lovers around the world! This dataset contains a list of 2,410 US craft beers and 510 US breweries. The beers and breweries are linked together by the "id". This data was collected in January 2017 from CraftCans.com. The dataset is an a tidy format and values have been cleaned up for your enjoyment.

Content

beers.csv - Contains data on 2000+ craft canned beers

breweries.csv - Contains data for 500+ breweries in the United States

Acknowledgements

If you are interested in learning more about how this dataset was acquired, I wrote an extensive blogpost about it http://www.jeannicholashould.com/python-web-scraping-tutorial-for-craft-beers.html.

Inspiration

Can you predict the beer type from the characteristics provided in the dataset?

What is the most popular beer in North Dakota?

Enjoy!

Data summary

File 'beers.csv'
- Table ‘beers’ consists of 2410 data rows along nine dimensions: ‘Column #1’, ‘abv’, ‘ibu’, ‘id’, ‘name’, ‘style’, ‘brewery_id’, ‘ounces’ and ‘Column #9’

File 'breweries.csv'
- Table ‘breweries’ consists of 558 data rows along four dimensions: ‘Column #1’, ‘name’, ‘city’ and ‘state’

Size: 53.8 KB Source: Kaggle Last updated: 2022-01-28 14:27

Analysis completed in 11 minutes 53 seconds ()

View

The quick Introduction provides a quick view into the dataset automatically analyzed by Inspirient.

View

The most Relevant Insights report presents the ten most relevant insights automatically selected by Inspirient. The insights were chosen because they have a high dimension priority and highlight a relevant pattern.

View

The data Quality Assessment assesses the quality of the input data and prioritizes mitigation steps based on analytical impact and ease of implementation

Explore all 117 insights

Explore all insights

Explore more datasets

Analyst-2 explores entire data repositories and data lakes, autonomously analyzing each dataset using the Inspirient Automated Analytics Engine.

If you would like Analyst-2 to surface insights in your company's data repository or data lake, please get in touch!

These analysis results by Inspirient GmbH are licensed under a Creative Commons Attribution 4.0 International License in conjunction with the licence of the source dataset.

Dimension	Priority
For table beers* in lines 1–2411 of input file beers.csv…*
Column #1 Integer Number
abv Floating-Point Number
ibu Integer Number
id Integer Number
name String
style String
brewery_id String
ounces Floating-Point Number
Column #9 Integer Number

Dimension	Annotations
For table beers* in lines 1–2411 of input file beers.csv…*
Column #1 Integer Number
abv Floating-Point Number
ibu Integer Number
id Integer Number	ID
name String
style String
brewery_id String	ID
ounces Floating-Point Number
Column #9 Integer Number

Craft Beers Dataset

Context

Content

Acknowledgements

Inspiration

Data summary

File 'beers.csv'

File 'breweries.csv'

Analysis fidelity reduced to speed things up

Quick introduction to this dataset

Most relevant results discovered in this dataset

Data quality assessment of this dataset

All insights currently in focus, ranked by relevance

Search Tags

User Tags

Source dimensions

Generated dimensions

Methods

Patterns

Hotspots

Stories

Craft Beers Dataset

Context

Content

Acknowledgements

Inspiration

Data summary

File 'beers.csv'

File 'breweries.csv'

Analysis fidelity reduced to speed things up

Quick introduction to this dataset

Most relevant results discovered in this dataset

Data quality assessment of this dataset

All insights currently in focus, ranked by relevance