Data Science

1

FINAL PROJECT REQUIREMENTS: For the Data Science final project, students will work individually to analyze in a problem in their field of interest using tools from the course. Address a data-related problem in your professional field or in a field you're interested in. Pick a subject that you're passionate about; if you're strongly interested in the subject matter it'll be more fun for you and you'll probably produce a better project! (You can additionally choose a Kaggle competition) In the course of the project, we expect you to complete the following tasks: 1) Gather, preprocess and visualize a dataset. What can you learn from a high-level analysis? 2) Apply modeling techniques (regression, recommendation, classification, etc.) and data analysis principles (cross-validation, caution against overfitting, etc.) and report your results. 3) Plan out how you would implement what you’ve done in (2) as a live system. Where would the data live? How would it represented? How would end-users access it? How often would you have to re-do your analysis? You will need to vet your project with the instructional team to make sure the scope is suitable for this course.

OUTLINE (DUE JULY 17, PRESENT AND DISCUSS JULY 22) • Problem you are solving? • Description of data set and how you will obtain it • Hypothesis • Statistical methods you plan to use and why • What business applications do you think your findings will have?

PRESENTATIONS (LAST DAY OF CLASS): On the last day of class, all students are required to give a 5 – 7 minute presentation that summarizes their data results. The presentations should target a non-technical audience and serve the purpose of having students practice the highly sought after communication skills that data scientists need. What to cover in presentation: • Overview of problem and hypothesis • Overview of data

Data Science

2

• Any visualizations or overview you created • Modeling techniques used and why • What decisions your findings allow you to make. • Discuss your implementation plan (or any hurdles there would be)

GRADING: EXCELLENT:

Student's presentation is engaging, clear, and informative, describing the project, approach, and conclusions, and is suitable for a non-technical audience.

GOOD

Student's presentation is as above but is either inadequately engaging, clear, or informative.

FAIR:

Student's presentation fails on two out of three of engaging, clear, and informative.

POOR

Student's presentation fails on all three or is off-topic with respect to his or her paper.

***Additional open-ended feedback will be provided to each student

PAPER: (4 -6 PAGES) Students are also required to submit a 4 – 6 page paper that describes the project’s technical details. The paper should target a technical audience. What to cover in paper: • Description of problem and hypothesis. • Detailed description your data set. o

How did you decide what features to use in your analysis?

o

What challenges did you face in terms of obtaining and organizing the data?

o

What did you learn from the initial exploration phase

• Describe what kinds of statistical methods you used, and perhaps others you considered but did not use, and how you decided what to use. • What business applications do your findings have? • Describe the implementation plan in detail from the ingesting of data to how end-users would access it.

3

Data Science

GRADING: EXCELLENT: GOOD

Student's paper demonstrates thorough understanding of statistical techniques, data management, and the application of these in programming, and is clearly communicated to a reasonably technical audience. Student's paper demonstrates above knowledge, but lacks some necessary rigor, detail, and/or exploratory depth or is not well communicated.

FAIR:

Student's paper demonstrates some learning of principles taught in class, but is clearly lacking in rigor and/or depth.

POOR

Student's paper is incomplete or does not conclusively demonstrate understanding of statistics or programming.

***Additional open-ended feedback will be provided to each student

IMPORTANT DATES: Deliverable:

Deadlines:

Outline of Project

July 17

Meet with GA instructional team to discuss project idea

TBD

Final Presentations/Paper

Paper: Aug 14 Presentations: Aug 14 and Aug 17

The instructor and TAs will be checking in with you periodically to make sure you are making good progress on your projects. Please use office hours to obtain additional help.

final project requirements - GitHub

In the course of the project, we expect you to complete the following tasks: 1) Gather ... The presentations should target a non-technical audience and serve the ...

115KB Sizes 6 Downloads 485 Views

Recommend Documents

Architectural Requirements Specification - GitHub
cumbersome tool to have to port to mobile application clients. 4. Page 7. Description of Components .1 Odin-CLI .1.1 Technologies. The command line interface will be implemented in Python 3, using built-in classes and libraries to provide a usable in

System Requirements Specification - GitHub
This section describes the scope of Project Odin, as well as an overview of the contents of the SRS doc- ument. ... .1 Purpose. The purpose of this document is to provide a thorough description of the requirements for Project Odin. .... Variables. â€

Integration Requirements - GitHub
Integration Requirements. Project Odin. Kyle Erwin. Joshua Cilliers. Jason van Hattum. Dimpho Mahoko. Keegan Ferrett ...

Architectural Requirements Specification - GitHub
porchetta tri-tip kielbasa kevin chicken hamburger sirloin. Cow pastrami short ribs shank. Sirloin spare ribs jowl, beef ham hock kielbasa ribeye prosciutto cow. Capicola pork chop landjaeger jowl venison beef ribs sirloin tri-tip tenderloin pastrami

System Requirements Specification - GitHub
System Requirements Specification. Project Odin. Kyle Erwin. Joshua Cilliers. Jason van Hattum. Dimpho Mahoko. Keegan Ferrett. Note: This document is constantly under revision due to our chosen methodology, ... This section describes the scope of Pro

Solution Requirements and Guidelines - GitHub
Jan 14, 2014 - will be specific to J2EE web application architectures, these requirements ... of other common web technologies a foundation for developing an Anti-‐CSRF solution with .... http://keyczar.googlecode.com/files/keyczar05b.pdf.

pdf sponsor final - GitHub
the conference is supported, allowing attendees fees ... conference organisation and other related costs. ... Call for Participation (CFP) Deadline: 3 Sept 2017.

Final Report - GitHub
... user inputs a certain number of TV shows he wants a recommendation for, let's call this set .... Proceedings of the 21st international conference on World Wide.

Final PDF - GitHub
innovative kitchen solution. H. Simply. Write SS-304 ... We are fully equipped for handling bulk orders as per your specifications & Design. "Life time performance ...

Project 4.3 - Project Proposal - GitHub
Nov 5, 2013 - software will find the optimal meet time for all users. This component is similar to the ... enjoy each others company! Existing Approaches:.

Final Report.pages - GitHub
COMP7604 Game Design & Development. Dept. of ... as the user base of Android is very vast and our game .... This android tutorial which helped with creating ...

Monkey Clicks Project - GitHub
Apr 24, 2014 - Support Google admonb ads. • Share on social media. This manual describe the Tree android application V1 from Monkey. Clicks projects ...

Project Zygote - GitHub
Hardware : Thread, WiFi, Zigbee, BT 4.0, Z-Wave... ○ Network : 6LoWPAN, CoAP, XMPP, MQTT, STOMP... ○ App : WoT ... Using the framework. We will use the Zygote ... Even as a developer you need to know only JS to take full advantage.

Project 1 - GitHub
The analytical solution for this PDE is given by: ... 2 dr is the complementary error ... Show that the accuracy of the solution improves on a finer mesh (as ∆y ...

Project 2 - GitHub
Use the following explicit schemes: 1. Finite-Volume: FTCS for both convection and diffusion. 2. Finite-Volume: First order upwind for convection, FTCS for ...

Project 3 - GitHub
Discuss the following: 1. Plot the residual vs. number of iteration for each method. Use different relaxation factors for PSOR and LSOR. 2. What relaxation factor ...

Project 4 - GitHub
Project 4. Vorticity-Stream F\rnction Method. Due: Mon., Dec. 6, 2010 at 6:00 pm. Consider the incompressible laminar flow in the plane channel shown below.

Speaker Recognition Final Report - GitHub
Telephone banking and telephone reservation services will develop ... The process to extract MFCC feature is demonstrated in Figure.1 .... of the network. ..... //publications.idiap.ch/downloads/papers/2012/Anjos_Bob_ACMMM12.pdf. [2] David ...

COWRIE FINAL PROJECT IMPLEMENTATION REPORT_WebReport ...
Retrying... COWRIE FINAL PROJECT IMPLEMENTATION REPORT_WebReport.pdf. COWRIE FINAL PROJECT IMPLEMENTATION REPORT_WebReport.pdf.

Alcohol Final Project
hope that you have learned at least something new from this paper because I do believe I am an expert on alcohol and its effects along with many statistics.

micro-services sample project - GitHub
Netflix Zuul acts as the API Gateway for accessing the micro services behind the gateway. It also acts as the load balancer and the reverse proxy server which routes the request to the respective service. Microservice's stay behind reverse proxy serv