Analyst-2 explores entire data repositories and data lakes, autonomously analyzing each dataset. Browse analysis results or search for topics of interest below!
Size: 287.0 KB
Size: 5.7 KB
Air quality and health (marprezd/air-pollution-exposure-and-effects)   [more]
Air pollution Air pollution is one of the most pressing environmental and health issues across OECD countries and beyond.
Air pollution exposure and air pollution effects Fine particulate matter (PM2.5) in the air pollutant that poses the greatest risk to health globally, affecting more people than any other pollutant. Chronic exposure to PM2.5 considerably increases the risk of respiratory and cardiovascular diseases in particular. Data refer to population exposure to more than 10 micrograms/m3 and are expressed as annual averages.
Fine particulate matter (PM2.5) can be inhaled and cause serious health problems including both respiratory and cardiovascular disease, having its most severe effects on children and elderly people. Exposure to PM2.5 has been shown to considerably increase the risk of heart disease and stroke in particular. Cost estimates represent only the cost of premature mortalities. They are calculated using estimates of the “Value of a Statistical Life” (VSL) and the number of premature deaths attributable to ambient particulate matter.
Size: 5.2 KB
Size: 12.6 KB
Air and GHG Emissions (cemalcemtastan/air-and-ghg-emissions-oecd)   [more]
Greenhouse gases refer to the sum of seven gases that have direct effects on climate change : carbon dioxide (CO2), methane (CH4), nitrous oxide (N2O), chlorofluorocarbons (CFCs), hydrofluorocarbons (HFCs), perfluorocarbons (PFCs), sulphur hexafluoride (SF6) and nitrogen trifluoride (NF3). The data are expressed in CO2 equivalents and refer to gross direct emissions from human activities. CO2 refers to gross direct emissions from fuel combustion only and data are provided by the International Energy Agency. Other air emissions include emissions of sulphur oxides (SOx) and nitrogen oxides (NOx) given as quantities of SO2 and NO2, emissions of carbon monoxide (CO), and emissions of volatile organic compounds (VOC), excluding methane. Air and greenhouse gas emissions are measured in thousand tonnes, tonnes per capita or kilogrammes per capita except for CO2, which is measured in million tonnes and tonnes per capita
Size: 9.3 KB
Alcohol Consumption by countries in Average serving sizes per person (codebreaker619/alcohol-comsumption-around-the-world)   [more]
Contains the data behind the story "Dear Mona Followup: Where Do People Drink The Most Beer, Wine And Spirits?"
Units: Average serving sizes per person Source: World Health Organisation, Global Information System on Alcohol and Health (GISAH), 2010
Size: 2.5 KB
List of PlayStation 4 games (grafstor/ps4-games)   [more]
I wanted a dataset with games
Size: 88.6 KB
Size: 1.3 KB
Detect Android Malware using Machine Learning (shashwatwork/android-malware-dataset-for-machine-learning)   [more]
"Mobile malware is malicious software that targets mobile phones or wireless-enabled Personal digital assistants (PDA), by causing the collapse of the system and loss or leakage of confidential information. As wireless phones and PDA networks have become more and more common and have grown in complexity, it has become increasingly difficult to ensure their safety and security against electronic attacks in the form of viruses or other malware."
Dataset consisting of feature vectors of 215 attributes extracted from 15,036 applications (5,560 malware apps from Drebin project and 9,476 benign apps). The dataset has been used to develop and evaluate multilevel classifier fusion approach for Android malware detection, published in the IEEE Transactions on Cybernetics paper 'DroidFusion: A Novel Multilevel Classifier Fusion Approach for Android Malware Detection. The supporting file contains the description of the feature vectors/attributes obtained via static code analysis of the Android apps.
Yerima, Suleiman (2018): Android malware dataset for machine learning 2. figshare. Dataset. https://doi.org/10.6084/m9.figshare.5854653.v1 Data Source - https://figshare.com/articles/dataset/Android_malware_dataset_for_machine_learning_2/5854653 Literature URL - https://ieeexplore.ieee.org/document/8245867
Size: 417.4 KB
Complete Blood Count Anemia Diagnosis (saurabhshahane/anemia-diagnosis-dataset)   [more]
This data set presents the prevalence of different types of Anemia including it’s severity and association with age and gender of the study population with CBC data set parameters as variables. We generated dataset from complete blood count test performed by Hematology analyzer to determine the prevalence of different types of Anemia treated at the Eureka diagnostic center in Lucknow, India. All the procedures for the CBC test were done following standard operating protocols defined for the Hematology analyzer. For CBC investigation, 400 patient samples were randomly selected to compute the dataset from the patients who visited the Eureka diagnostic center in Lucknow for various clinical examinations. The diagnostic center performs 4 – 8CBC investigations a day on average. During the data collection period between September 2020 to December 2020, 1000 CBC investigations were performed, out of which 400 random samples were selected. We included adult males and females who are not pregnant and older than 15 years of age in the study population. Infants, young children less than 10 years old and pregnant women were excluded from the study due to various factors like variable CBC test values and other factors. After excluding the above stated persons from the randomly chosen sample of 400 patients, we were left with 364 patients in the final data set.
Vohra, Rajan; pahareeya, jankisharan; Hussain, Abir (2021), “Complete Blood Count Anemia Diagnosis”, Mendeley Data, V1, doi: 10.17632/dy9mfjchm7.1
Size: 7.6 KB
These analysis results by Inspirient GmbH are licensed under a Creative Commons Attribution 4.0 International License in conjunction with the license of the respective source dataset.
To suggest additional public data repositories for Analyst-2 to analyze, please get in touch!