Analyst-2 explores entire data repositories and data lakes, autonomously analyzing each dataset. Browse analysis results or search for topics of interest below!
Dataset of 50 Online Services Advertised in the Internet Marketing Forum (saurabhshahane/50-online-services-dataset)   [more]
Dataset of organized services contains the top 50 services offering products or services in the forum. The services were the most discussed in their own forum categories. The data was gathered throughout 2020 and January 2021.
For every service the dataset contains the following features: * User Forum Name: forum user that posted about the service. * User Forum Rating: the rating of the user in the forum. * User Forum Registration: the date the user registered in the forum. * Type of Service Offered: personal, professional, training, computer programs, conferences and events. * Types of Advertisement: individuals, groups, organizations * Service Overall Maturity Level: low, medium, high * Type of User Advertising: on their own behalf, on behalf of a group, on behalf of an organization * Service Label: personal service, emerging service, organized service * searchengines.guru Category: forum main category where the service was posted * searchengines.guru Sub Category: forum sub-category where the service was posted * Country of Legal Terms: which country laws they abide * Domain Registration: year the domain name of the service was first registered * Domain Registrar: registrar organization associated with the domain name * Site ASN: autonomous system information associated with the website * Site ISP: internet service provider associated with the website * Site Hosting Country: country where the website is hosted * Site Alexa Ranking: Alexa website popularity global ranking * Web Reputation: Cisco Umbrella Security Score: security score based on the domain name. The final risk scores to assess a domain's reputation are Low Risk, Medium Risk, and High Risk. * Web Reputation: BrightCloud® TI Risk: risk level based on the domain name (Trustworthy, Low Risk, ModerateRisk, Suspicious, and High Risk) * Web Reputation: Suspicious Activity from VirusTotal Intelligence Indicators: a positive value (yes) means there is at least one indicator retrieved from VirusTotal that associated the domain with malicious behavior. * Web Reputation: OSINT Reports on Suspicious Behavior: a positive value (yes) if there is at least one OSINT report tying the domain to malicious behavior. * Trustworthiness Label #1: untrustworthy if at least one of the web reputation indicators is positive. Otherwise trustworthy. * Trustworthiness Label #2: untrustworthy if at least two of the web reputation indicators is positive. Otherwise trustworthy. * Trustworthiness Label #3: untrustworthy if at least three of the web reputation indicators is positive. Otherwise trustworthy. * Trustworthiness Label #4: untrustworthy if at least four of the web reputation indicators is positive. Otherwise trustworthy.
Valeros, Veronica; Garcia, Sebastian (2021), “Dataset of 50 Online Services Advertised in the Internet Marketing Forum searchengines.guru”, Mendeley Data, V1, doi: 10.17632/48gyrs6y37.1
Size: 4.1 KB
Predict which companies to invest for maximizing profit (amineoumous/50-startups-data)   [more]
Size: 1.3 KB
65 Years of Weather Data Bangladesh Preprocessed (emonreza/65-years-of-weather-data-bangladesh-preprocessed)   [more]
The dataset is a combination of several datasets. Most of them were taken from various sources and combined together. The data is from the Bangladesh Meteorological Department (BMD).
The dataset contains Monthly average of Maximum Temperature, Minimum Temperature, Rainfall, Relative Humidity, Wind Speed, Cloud Coverage and Bright Sunshine of Bangladesh from the period 1948 to 2013 of specific ares. It also has the Weather Station Numbers, the X, Y coordinates, Latitude, Longitude and Altitude. I processed the data to fill the null rows with some algorithms.
The data is taken from Bangladesh Meteorological Department (BMD) from various sources.
The weather and climate of Bangladesh is changing very rapidly. This country has one of the most diverse weather conditions in the world but a very little research has been done so far. Feel free to use this dataset for any scientific work or predicting the future weather condition.
Size: 391.7 KB
Nutritions Insecurity (nitishvig007/80cerealdataset)   [more]
What does the people like to have in cereals. Is it hot or cold? Does it matter if they take high sugary or high protein ones?
Fields in the dataset:
Name: Name of cereal mfr: Manufacturer of cereal A = American Home Food Products G = General Mills K = Kelloggs N = Nabisco P = Post Q = Quaker Oats R = Ralston Purina type: cold hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals (Possibly from Consumer Reports?)
These datasets have been gathered and cleaned up by Petra Isenberg, Pierre Dragicevic and Yvonne Jansen.
Eat too much sugary cereal? Ruin your appetite with this dataset! Does the brand name matter before taking a good cereal
Size: 311.0 KB
Size: 64.5 KB
Size: 18.4 KB
ALS Projected Life Expectancy (mpwolke/cusersmarildownloadsprevalencecsv)   [more]
Four comparisons are shown for the years 2020 and 2066, which is the last year for which projected life expectancy data is available.
Predicting the future of ALS: the impact of demographic change and potential new treatments on the prevalence of ALS in the United Kingdom, 2020–2116 https://doi.org/10.1080/21678421.2019.1587629
Table: Summary of prevalence estimates for males, females, and both at baseline and for each modeled scenario, showing first and last year of available data.
Authors: Alison Gowland, Sarah Opie-Martin, Kirsten M. Scott, Ashley R. Jones, Puja R. Mehta, Christine J. Batts, Cathy M. Ellis, P. Nigel Leigh, Christopher E. Shaw , Jemeen Sreedharan & Ammar Al-Chalabi - https://doi.org/10.1080/21678421.2019.1587629
Amyotrophic Lateral Sclerosis.
Size: 0.5 KB
From JetBrains Datalore (pascalhouba/amd-radeon-and-nvidia-gpu-specifications)   [more]
Size: 7.0 KB
Predicting the age of abalone from physical measurements. (hurshd0/abalone-uci)   [more]
Abalone is common name for any group of small to very large sea snails, commonly found along the coasts across the world, and used as delicacy in cusinies and it's leftover shell is fashioned into jewelery due to it's iridescent luster. Due to it's demand and economic value it's often harvested in farms, and as such the need to predict the age of abalone from physical measurements. Traditional approach to determine it's age is by cutting the shell through the cone, staining it, and counting the number of rings through a microscope -- a boring and time-consuming task.
From the original data examples with missing values were removed (the majority having the predicted value missing), and the ranges of the continuous values have been scaled for use with an ANN (by dividing by 200).
Number of instances:
Number of attributes:
Features: Sex, Length, Diameter, Height, Whole weight, Shucked weight, Viscera weight, and Shell weight
> Note: Number of rings is the value to predict: either as a continuous value or it can be converted to classification problem.
Given below is attribute name, type, measurement, and brief description.
Name Data Type Meas. Description ----- --------- ----- ----------- Sex nominal M, F, and I (infant) Length continuous mm Longest shell measurement Diameter continuous mm perpendicular to length Height continuous mm with meat in shell Whole weight continuous grams whole abalone Shucked weight continuous grams weight of meat Viscera weight continuous grams gut weight (after bleeding) Shell weight continuous grams after being dried Rings integer +1.5 gives the age in years
Dataset comes from UCI Machine Learning repository: https://archive.ics.uci.edu/ml/datasets/Abalone
Size: 52.3 KB
Size: 761.6 KB
These analysis results by Inspirient GmbH are licensed under a Creative Commons Attribution 4.0 International License in conjunction with the license of the respective source dataset.
To suggest additional public data repositories for Analyst-2 to analyze, please get in touch!