28 Jan Understand and describe the nature and structure of the selected dataset A brief description about the dataset Identify the features of dataset.
Understand and describe the nature and structure of the selected dataset A brief description about the dataset Identify the features of dataset.
computer science multi-part question
Adhere to the instructions written inside the file, and I do not want to match
Requirements:
Description and Instructions
Students can form groups consisting of three students and send their names to instructor before 3rd January and select one dataset from the datasets provided in the bellow link.
“10 free public datasets for EDA”
Use only one dataset and analyze the data using Microsoft Excel to discover the structure of data, trends, patterns, or any anomalies in the data based on your own hypothesis. Perform the following tasks. You should use visualization to aid your answer.
The final project report which must incorporate all the following 5 tasks and written using the provided template. (14 marks distributed among the below tasks).
==========================================================
Task 1: Understand and describe the nature and structure of the selected dataset. (3 marks)
A brief description about the dataset.
Identify the features of dataset.
Propose hypothesis / assumptions (between 2 variables) to validate.
Task 2: Reduce the dimension of the datasets to support the hypothesis validation. If necessary, do data preprocessing on any missing values, duplicate values, etc. You can also generate new feature from the any of the provided features that may support your hypothesis. Due to the limitation of processing power of some devices, you can reduce your dataset to 1000 tuples. (3 marks)
Task 3: Provide descriptive statistics for some feature using statistical method to understand the dataset more and answer the following analysis questions :(4 marks)
Compare different attributes (features). What trend did you find?
Include any of the measure of central tendency such as the mean, median, and mode.
Describe the spread of your data.?This may include the measure of variance, standard deviation, skewness, and kurtosis.
(You are encouraged to impose other analysis questions based on any trend you notice in the dataset).
Task 4: Validate the?hypothesis in Task 3 by investigating the relationship between two quantitative variables you have chosen using correlation, regression and R-squared with possible conclusions. (3 marks)
Task 5: Show visual representation of your analysis (hint: use the right chart/graph for your data analysis). (1 mark)
Our website has a team of professional writers who can help you write any of your homework. They will write your papers from scratch. We also have a team of editors just to make sure all papers are of HIGH QUALITY & PLAGIARISM FREE. To make an Order you only need to click Ask A Question and we will direct you to our Order Page at WriteDemy. Then fill Our Order Form with all your assignment instructions. Select your deadline and pay for your paper. You will get it few hours before your set deadline.
Fill in all the assignment paper details that are required in the order form with the standard information being the page count, deadline, academic level and type of paper. It is advisable to have this information at hand so that you can quickly fill in the necessary information needed in the form for the essay writer to be immediately assigned to your writing project. Make payment for the custom essay order to enable us to assign a suitable writer to your order. Payments are made through Paypal on a secured billing page. Finally, sit back and relax.