09 Jan IT350M6-6: Explore non-relational database alternatives
Big Data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management
tools or traditional data processing applications. The challenges include capture, curation, storage, search, sharing, transfer, analysis and
visualization. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as
compared to separate smaller sets with the same total amount of data, allowing correlations to be found to delineate business trends, determine
quality of research, prevent diseases, link legal citations, combat crime, and determine real-time roadway traffic conditions.
Big Data analytics is a topic fraught with both positive and negative potential. Big Data is defined not just by the amount of information involved but
also its variety and complexity, as well as the speed with which it must be analyzed or delivered. The amount of data being produced is already
incredibly great, and current developments suggest that this rate will only increase in the near future. Improved service should result as companies
better understand their customers, but it is also possible that this data will create privacy problems. Thus, Big Data is important not only to students
who hope to gain employment using these techniques and those who plan to use it for legitimate research, but also for everyone who will be living
and working in the 21st Century.
Assessment Instructions
This competency assessment is divided into two tasks covering non-relational database facets. You will generate two separate report documents,
each addressing a specific task, for this assessment.
It is very important that you watch the Module 6 videos associated with SQL prior to completing the assessment. You will need to install and use
Microsoft SQL Server Express and Microsoft SQL Server Management Studio (SSMS) for this course. You can download the latest versions of
these free software products here:
Microsoft SQL Server Express
Microsoft SSMS
. Navigate to the Academic Tools area of this Module and select Library then Required Readings to access your texts and videos. You will need to
install and use Microsoft SQL Server Express and Microsoft SQL Server Management Studio (SSMS) for this course.
Task 1 – Big Data Use Cases
Perform research on Big Data use cases via the Internet and Purdue University Global library. Use the article at the following website as a starting
point for your research:
Big Data Use Cases
Select one use case from the list below to be the topic of your paper.
1. 360° View of the Customer
2. Fraud Prevention
3. Security Intelligence
4. Data Warehouse Offload
5. Price Optimization
6. Operational Efficiency
7. Recommendation Engines
8. Social Media Analysis and Response
9. Preventive Maintenance and Support
10. Internet-of-Things (IoT)
Write a 3-page expository paper, not including title page or references, that addresses the following:
Describe the use case and how it makes use of Big Data.
2022/01/07 14:38 Purdue University Global
https://purdueglobal.brightspace.com/d2l/le/content/198702/viewContent/13257482/View 2/3
Explain the V’s of Big Data within the context of your chosen use case.
Volume
Velocity
Variety
Veracity
Task 2 – Exploring the Hadoop Environment
You will download and install software products that will allow you to use and explore the Hadoop environment. You will then perform tive specified
exercises with the installed environment.
Cloudera is a software company that provides a platform for data analytics, data warehousing, and machine learning. Initially, Cloudera started as
an open-source Apache Hadoop distribution project, commonly known as Cloudera Distribution for Hadoop or CDH. It contains Apache Hadoop and
other related projects where all the components are 100% open-source under Apache License.
The Cloudera QuickStart virtual machine (VM) includes everything that you would need for using CDH, including Impala, Cloudera Search, and
Cloudera Manager. The Cloudera QuickStart VM uses a package-based install that allows you to work with or without the Cloudera Manager. It has
a sample of Cloudera’s platform for “Big Data.”
You are required to complete the subtasks listed below. Generate a Microsoft Word report incorporating the specified artifacts from the subtask
work.
Task 2.1 – Install Oracle VirtualBox and the Cloudera QuickStart VM
Install Oracle VirtualBox and the Cloudera QuickStart VM using the following guidance document:
Installation Instructions for the Cloudera Quickstart Virtual Machine
Take screen captures to prove that you accomplished the installation tasks. Incorporate the screen captures into your Microsoft Word assessment
document. Describe your experience with installing and operating these software programs via a minimum of two paragraphs of content.
Task 2.2 – Complete Tutorial Exercise 1
Complete Exercise 1 (pages 1-11) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
Task 2.3 – Complete Tutorial Exercise 2
Complete Exercise 2 (pages 12-20) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
Task 2.4 – Complete Tutorial Exercise 3
Complete Exercise 3 (pages 21-26) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
Task 2.5 – Complete Tutorial Exercise 4
Complete Exercise 4 (pages 27-36) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
The exercise entailed examination of log records, which indicated the occurrence of distributed denial-of-service (DDoS) attacks. Describe what
DDoS is and how it can be damaging to an organization via a minimum of one paragraph of content.
Task 2.6 – Complete Tutorial Exercise 5
Complete Exercise 5 (pages 37-43) contained in the following tutorial document:
2022/01/07 14:38 Purdue University Global
https://purdueglobal.brightspace.com/d2l/le/content/198702/viewContent/13257482/View 3/3
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content. Also, provide the benefits of using data
visualizations like that established in this exercise via a minimum of one paragraph of content.
Our website has a team of professional writers who can help you write any of your homework. They will write your papers from scratch. We also have a team of editors just to make sure all papers are of HIGH QUALITY & PLAGIARISM FREE. To make an Order you only need to click Ask A Question and we will direct you to our Order Page at WriteDemy. Then fill Our Order Form with all your assignment instructions. Select your deadline and pay for your paper. You will get it few hours before your set deadline.
Fill in all the assignment paper details that are required in the order form with the standard information being the page count, deadline, academic level and type of paper. It is advisable to have this information at hand so that you can quickly fill in the necessary information needed in the form for the essay writer to be immediately assigned to your writing project. Make payment for the custom essay order to enable us to assign a suitable writer to your order. Payments are made through Paypal on a secured billing page. Finally, sit back and relax.