The results below are likely only meaningful to subject matter experts because the source dataset employs abbreviations, jargon and/or otherwise non-obvious labels. You may get in touch to help improve the source data, or you may browse Analyst-2 to find more accessible datasets.

Craft Beers Dataset

2K+ craft canned beers from the US and 500+ breweries in the United States. (nickhould/craft-cans)   []

Context

It's a great time to be a craft beer fan in the U.S.! There are a ton of beer styles and brands to choose from and breweries have become very successful in the last several years. Breweries owe it all to beer lovers around the world! This dataset contains a list of 2,410 US craft beers and 510 US breweries. The beers and breweries are linked together by the "id". This data was collected in January 2017 from CraftCans.com. The dataset is an a tidy format and values have been cleaned up for your enjoyment.

Content

beers.csv - Contains data on 2000+ craft canned beers

breweries.csv - Contains data for 500+ breweries in the United States

Acknowledgements

If you are interested in learning more about how this dataset was acquired, I wrote an extensive blogpost about it http://www.jeannicholashould.com/python-web-scraping-tutorial-for-craft-beers.html.

Inspiration

Can you predict the beer type from the characteristics provided in the dataset?

What is the most popular beer in North Dakota?

Enjoy!

Data summary

  • File 'beers.csv'

    • Table ‘beers’ consists of 2410 data rows along nine dimensions: ‘Column #1’, ‘abv’, ‘ibu’, ‘id’, ‘name’, ‘style’, ‘brewery_id’, ‘ounces’ and ‘Column #9’
  • File 'breweries.csv'

    • Table ‘breweries’ consists of 558 data rows along four dimensions: ‘Column #1’, ‘name’, ‘city’ and ‘state’

Size: 53.8 KBSource: KaggleLast updated: 2022-01-28 14:27

Quick introduction to this dataset

The quick Introduction provides a quick view into the dataset automatically analyzed by Inspirient.

Most relevant results discovered in this dataset

The most Relevant Insights report presents the ten most relevant insights automatically selected by Inspirient. The insights were chosen because they have a high dimension priority and highlight a relevant pattern.

Data quality assessment of this dataset

The data Quality Assessment assesses the quality of the input data and prioritizes mitigation steps based on analytical impact and ease of implementation

All insights currently in focus, ranked by relevance

Analyst-2 explores entire data repositories and data lakes, autonomously analyzing each dataset using the Inspirient Automated Analytics Engine.

If you would like Analyst-2 to surface insights in your company's data repository or data lake, please get in touch!

Creative Commons License

These analysis results by Inspirient GmbH are licensed under a Creative Commons Attribution 4.0 International License in conjunction with the licence of the source dataset.