Corporate Learning at Scale: Lessons from a Large Online Course at Google Arthur Asuncion, Jac de Haan, Mehryar Mohri, Kayur Patel, Afshin Rostamizadeh, Umar Syed, Lauren Wong Google 76 9th Ave., New York, NY 10011 {arta, jacis, mohri, kayur, rostami, usyed, laurenbw}@google.com ABSTRACT

Google Research recently tested a massive online class model for an internal engineering education program, with machine learning as the topic, that blended theoretical concepts and Google-specific software tool tutorials. The goal of this training was to foster engineering capacity to leverage machine learning tools in future products. The course was delivered both synchronously and asynchronously, and students had the choice between studying independently or participating with a group. Since all students are company employees, unlike most publicly offered MOOCs we can continue to measure the students’ behavioral change long after the course is complete. This paper describes the course, outlines the available data set and presents directions for analysis. Author Keywords

MOOCs; connectivist MOOCs; corporate training; distance learning; online learning. ACM Classification Keywords

of these developments, it is important to determine what features of a MOOC, if any, lead to the greatest educational gains in a corporate setting. In the fourth quarter of 2013, Google Research produced a massive online class for company engineers around the world on the topic of machine learning with an emphasis on Google-specific machine learning software tools. We collected student-reported data about the course content and delivery method. As code written by Google engineers is stored in a central repository and all code execution is logged, we are able to directly measure the course’s effect on student usage of the technologies taught. This rich data set may answer several important questions about the effectiveness of MOOCs: Are students who attended live lectures more likely to apply what they learned in the course than students who watched recorded lectures? Do a student’s intentions to apply course content correlate with future actions? Does engagement with ancillary instructional channels (such as forums and office hours) have any impact on students’ post-MOOC behavior?

K.3.1. Computer Uses in Education: Distance learning COURSE DESCRIPTION INTRODUCTION

Massive Open Online Courses (MOOCs) have generated a great deal of excitement for their potential to make traditional university material accessible to a very wide audience. However, despite the growing popularity of MOOCs, there is considerable skepticism about their success and efficacy. A recent study showed that MOOC completion rates are very low, often in the single digits [2, 5], and some MOOC providers have shifted their focus to offering job training for corporations [1]. While there is an increasing focus on examining MOOC effectiveness in various contexts (see [3] for a recent example, and [4] for an excellent survey), relatively little analysis has been published in the corporate context. In light

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). Copyright is held by the owner/author(s). L@S 2014, March 4–5 2014, Atlanta, Georgia, USA. ACM 978-1-4503-2669-8/14/03. http://dx.doi.org/10.1145/2556325.2567874

Approximately 6,500 students registered for an optional course on machine learning, representing a large fraction of all full-time employees distributed across more than 80 offices worldwide. The 10-week course was a hybrid of theory-based lectures and Google-specific implementations. Each class was devoted to a single machine learning topic and was divided into three components. First, an internal machine learning expert delivered a lecture on the theory behind the weekly topic. The lecture was followed by one or more case studies, where experts explained how the techniques taught in lecture had been applied to solve important problems at Google. Finally, time was spent answering student questions from around the world. At the conclusion of each week, students were directed to an optional programming assignment to gain handson experience using internal libraries and technologies to reinforce core concepts from the class. The course was offered in three formats. Each week for 10 weeks, a live video feed of the class was streamed to viewing rooms in many Google offices where students watched the content together in small groups and, in some cases, held dis-

cussions immediately after class. Students could alternately watch the live stream individually from their own computers. Finally, recordings of all 10 classes (as well as links to all slides, exercises, supplementary material and relevant external resources) were posted on the course website for asynchronous access. DATA AND PRELIMINARY RESULTS

Per employee consent, all student page impressions were captured with a timestamp, length of page visit and unique student identifier. Individual student data was also captured when assignments were opened and code executed. Most software written at Google is stored in a central repository, therefore this corpus can be crawled to locate files that reference machine learning function calls and algorithm libraries. Students were surveyed 3 times over the 10 weeks. A precourse survey was used to understand the prior level of experience with machine learning and students participation goals. A mid-class survey was used to gather feedback on class format and content pacing. A post-class survey was sent to collect student feedback for course improvements and understand how engineers plan to implement machine learning in future projects. Forty-six percent of post-class survey respondents are “planning to use machine learning as a result of this class,” and six percent report that they are already “using machine learning as a result of this class.” Of final survey respondents, 62% report having machine learning conversations with others (managers, teammates, other students) as a result of the class.

job title, level of education, etc) to answer our research questions: • Which predictors (variables) are most likely to result in a student’s transition from learning to implementation? – What type of employees are more likely to adopt machine learning after taking the class? – Is there a difference in post-course satisfaction ratings based on a student’s reported pre-class experience with machine learning? • Which course components were most successful? – Are students who attempted the interactive assignments more likely to implement machine learning? – As students begin using machine learning in real projects, does their perceived value of course components change over time? – Does the student-selected viewing method (independent or group setting, synchronous or asynchronous) have a measurable impact on course completion or future implementation of machine learning? • Viewing Google as a social network, what is the density, reachability, and centrality of this content throughout the organization? While the course served as a catalyst to build machine learning awareness, to raise visibility of leading experts and their work, and to foster dialogue across the company, the impact to engineering performance is yet to be determined. Rather than assessment of employee knowledge recall or recognition through online assessments, the use of machine learning concepts and tools in current and future products will become the measure of this course’s success. REFERENCES

1. Chafkin, M. Udacity’s Sebastian Thrun, Godfather of Free Online Education, Changes Course. Fast Company (December 2013). 2. Jordan, K. MOOC Completion Rates: The Data. http://www.katyjordan.com/MOOCproject.html.

Accessed: January 2, 2014. 3. Kizilcec, R. F., Piech, C., and Schneider, E. Deconstructing disengagement: Analyzing learner subpopulations in massive open online courses. In Proceedings of Third Conference on Learning Analytics and Knowledge (2013). Figure 1. Comparing pre- and post-class surveys. Students responded to the question, “What is your current level of experience with machine learning?”

4. Liyanagunawardena, T., Adams, A., and Williams, S. Moocs: A systematic study of the published literature 2008-2012. The International Review of Research in Open and Distance Learning 14, 3 (2013).

Future Research and Conclusion

5. Perna, L., Ruby, A., Boruch, R., Wang, N., Scull, J., Evans, C., and Ahmad, S. The Life Cycle of a Million MOOC Users. Presentation at the MOOC Research Initiative Conference, December 5, 2013.

Course designers will continue to collect and analyze longitudinal data over the coming year to assess course impact. Codebase references to machine learning files, individual satisfaction ratings and course participation data can be combined with employee-specific attributes (tenure, product area,

L@S 2014 Work-in-Progress Format - Research at Google

Mar 5, 2014 - The 10-week course was a hybrid of theory-based lectures ... Viewing Google as a social network, what is the density, reachability, and ...

156KB Sizes 2 Downloads 238 Views

Recommend Documents

SIGCHI Conference Proceedings Format - Research at Google
based dialpad input found on traditional phones, which dates ..... such as the Android Open Source Project (AOSP) keyboard. ...... Japan, 2010), 2242–2245. 6.

SIGCHI Conference Paper Format - Research at Google
for gesture typing is approximately 5–10% higher than for touch typing. This problem ..... dimensions of the Nexus 5 [16] Android keyboard. Since most of today's ...

SIGCHI Conference Proceedings Format - Research at Google
May 12, 2016 - ... the three most popular online contexts [39]: search engines, social networks ... number), online browsing history (the list of websites you vis- ited), online .... to rank, from the most to the least personally identifying, 10 type

SIGCHI Conference Paper Format - Research at Google
Murphy and Priebe [10] provide an ... adopted social media despite infrastructural challenges. .... social networking sites and rich communication systems such.

SIGCHI Conference Paper Format - Research at Google
the Google keyboard on Android corrects “thaml” to. “thank”, and completes ... A large amount of research [16, 8, 10] has been conducted to improve the qualities ...

SIGCHI Conference Proceedings Format - Research at Google
spectral illumination to allow a mobile device to identify the surfaces on which it is ..... relies on texture patterns from the image sensor, which add computational and .... each transparent surface naturally encodes an ID in the mate- rial's optic

SIGCHI Conference Paper Format - Research at Google
Exploring Video Streaming in Public Settings: Shared .... The closest known activity is the online documented use of. Google+ .... recurring themes in our data.

SIGCHI Conference Proceedings Format - Research at Google
Apr 23, 2015 - Author Keywords. Online communities; Online social networks; Google+ ..... controversial; ten people commented that they felt that dis- cussion topics ..... Sites like. Digg have demonstrated this diffuse attention can effectively.

SIGCHI Conference Paper Format - Research at Google
awkward to hold for long video calls. Video Chat ... O'Brien and Mueller [24] explored social jogging and found that a key ... moved from forests and parks to everyday urban centers. .... geocaches each within an hour time period. The area.

Syllabus - LS 534 Health Scienc - Research at Google
other good sources for finding library and/or medical blogs, login into your ..... April 24. UA SLIS. Class Period 13. Library Management Issues II: Hospitals.

linux format oct 2014.pdf
Sign in. Loading… Whoops! There was a problem loading more pages. Whoops! There was a problem previewing this document. Retrying... Download. Connect ...

arXiv:1405.0631v2 [cs.NI] 6 May 2014 - Research at Google
May 6, 2014 - For example, Figure 1 shows an example of a static sharing policy, in ... ages datacenter network bandwidth among services, in a flexible, and ...

arXiv:1412.1454v1 [cs.LG] 3 Dec 2014 - Research at Google
Dec 3, 2014 - {noam,jpeleman,ciprianchelba}@google.com. Abstract. We present a novel ... yet should scale to very large amounts of data as gracefully as ...

arXiv:1412.1820v1 [cs.CL] 3 Dec 2014 - Research at Google
Dec 3, 2014 - Most tagging systems only consider a small set of. 3-18 type labels .... The top level types were chosen to align with the most common type set used in traditional entity tagging ..... We split the gold data into a development set.

MobileHCI Conference Paper Format - Research Explorer
Jul 2, 2012 - numerous consumer mobile phone applications such as. Augmented Reality .... smartphone running Android 2.3. ..... CHI 2010, ACM Press,.

Mathematics at - Research at Google
Index. 1. How Google started. 2. PageRank. 3. Gallery of Mathematics. 4. Questions ... http://www.google.es/intl/es/about/corporate/company/history.html. ○.

LS High School Parental Consent and Zika Research 2016.pdf ...
concerns, before or after the blood drive, please call our Medical Help Desk. at 800.310.9556. A trained staff member will be happy to assist you. Sixteen.

Full text in pdf format - Inter Research
Jun 4, 2010 - World Bank (WB) identified the country as a primary source country for ..... of lorises by rangers, or reports in the community of recent use in ...

Faucet - Research at Google
infrastructure, allowing new network services and bug fixes to be rapidly and safely .... as shown in figure 1, realizing the benefits of SDN in that network without ...

BeyondCorp - Research at Google
41, NO. 1 www.usenix.org. BeyondCorp. Design to Deployment at Google ... internal networks and external networks to be completely untrusted, and ... the Trust Inferer, Device Inventory Service, Access Control Engine, Access Policy, Gate-.

VP8 - Research at Google
coding and parallel processing friendly data partitioning; section 8 .... 4. REFERENCE FRAMES. VP8 uses three types of reference frames for inter prediction: ...

JSWhiz - Research at Google
Feb 27, 2013 - and delete memory allocation API requiring matching calls. This situation is further ... process to find memory leaks in Section 3. In this section we ... bile devices, such as Chromebooks or mobile tablets, which typically have less .

Yiddish - Research at Google
translation system for these language pairs, although online dictionaries exist. ..... http://www.unesco.org/culture/ich/index.php?pg=00206. Haifeng Wang, Hua ...