“Best Dinner Ever!!”: Automatic Generation of Restaurant Reviews with LSTM-RNN Alberto Bartoli Andrea De Lorenzo Eric Medvet Dennis Morello Fabiano Tarlao Department of Engineering and Architecture University of Trieste Italy

October 16th, 2016 http://machinelearning.inginf.units.it

Motivation

Table of Contents

1

Motivation

2

The tool

3

Experimental evaluation

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

2 / 15

Motivation

Product/service commerce People buy products/services online/offline When choosing seller, they trust other people’s opinion (reviews)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

3 / 15

Motivation

Product/service commerce People buy products/services online/offline When choosing seller, they trust other people’s opinion (reviews)

A malicious seller might want to manipulate the choice (opinion spamming) fabricating positive reviews for its products fabricating negative reviews for competitors products Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

3 / 15

Motivation

Review fabrication Can be done “manually”:

“$100–$400” to “write and post a total or 10 reviews”, among which “5 good reviews about our hotel” and “5 very bad reviews about another hotel”! Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

4 / 15

Motivation

Review fabrication: the next level

Can be done automatically by a tool? much cheaper (≈free) for the single malicious seller much larger problem for the online retailer (Amazon, TripAdvisor, . . . ) (maybe) harder problem for opinion spamming researchers

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

5 / 15

Motivation

Review fabrication: the next level

Can be done automatically by a tool? much cheaper (≈free) for the single malicious seller much larger problem for the online retailer (Amazon, TripAdvisor, . . . ) (maybe) harder problem for opinion spamming researchers

Is that tool feasible?

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

5 / 15

The tool

Table of Contents

1

Motivation

2

The tool

3

Experimental evaluation

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

6 / 15

The tool

Problem statement

Restaurant Rating s

Review generator

Review r

r should: appear as generated by humans appear specific for restaurant express an overall rating s (from F to FFFFF)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

7 / 15

The tool

Input: what’s a restaurant? A set C of categories: e.g., Italian, Cafe, International, Mediterranean

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

8 / 15

The tool

Method overview C s r 1

2

3

4

Given an input C , s:

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

9 / 15

The tool

Method overview C s r 1

2

3

4

Given an input C , s: 1

generate many “human-like” reviews (NLG w/ LSTM-RNN)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

9 / 15

The tool

Method overview C s r 1

2

3

4

Given an input C , s: 1

generate many “human-like” reviews (NLG w/ LSTM-RNN)

2

discard those not consistent with categories C (many binary classifiers)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

9 / 15

The tool

Method overview C s r 1

2

3

4

Given an input C , s: 1

generate many “human-like” reviews (NLG w/ LSTM-RNN)

2

discard those not consistent with categories C (many binary classifiers)

3

discard those not consistent with rating s (one multiclass classifiers)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

9 / 15

The tool

Method overview C s r 1

2

3

4

Given an input C , s: 1

generate many “human-like” reviews (NLG w/ LSTM-RNN)

2

discard those not consistent with categories C (many binary classifiers)

3

discard those not consistent with rating s (one multiclass classifiers)

4

select randomly one review r among remaining reviews Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

9 / 15

The tool

Generating human-like reviews Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) works at the character level char-rnn library with default settings (3 layers of 700 neurons) trained on a corpus of 500000 reviews (≈ 1 month) when generating, seed is a random sentence of a real review first generated review is discarded (influence of the seed)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

10 / 15

Experimental evaluation

Table of Contents

1

Motivation

2

The tool

3

Experimental evaluation

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

11 / 15

Experimental evaluation

Aims

Is an artificial review considered genuine? (intrinsic evaluation) Can an artificial review influence the human subject? (extrinsic evaluation) Extrinsic performed first; 39 subjects involved, 3–4 forms each

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

12 / 15

Experimental evaluation

Extrinsic evaluation Simulates the restaurant choice by a user: each form with 3 reviews at least 1 artificial and 1 genuine

Uncle Sam’s Meat & Wine american, steakhouse Review with FFFFF The atmosphere was very cozy. With small seating areas the noise is minimized. The service was good. [. . . ] Useful? Y  N  Review with FF This place is dimly lit and reminded me of a bad prom decorating. The waitress was nice, but a little over [. . . ] Useful? Y  N  Review with FFFF Great food and even better atmosphere. It is a quiet darker setting with no windows. The service [. . . ] Useful? Y  N  Would you go to this restaurant? Y  N 

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

13 / 15

Experimental evaluation

Extrinsic evaluation Simulates the restaurant choice by a user: each form with 3 reviews at least 1 artificial and 1 genuine Genuine

Artificial

Going [%]

Not going [%]

≥FFF ≥FFF ≤FF ≤FF

≥FFF ≤FF ≥FFF ≤FF

47 71 24 23

53 29 76 77

Uncle Sam’s Meat & Wine american, steakhouse Review with FFFFF The atmosphere was very cozy. With small seating areas the noise is minimized. The service was good. [. . . ] Useful? Y  N  Review with FF This place is dimly lit and reminded me of a bad prom decorating. The waitress was nice, but a little over [. . . ] Useful? Y  N  Review with FFFF Great food and even better atmosphere. It is a quiet darker setting with no windows. The service [. . . ] Useful? Y  N  Would you go to this restaurant? Y  N 

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

13 / 15

Experimental evaluation

Extrinsic evaluation Simulates the restaurant choice by a user: each form with 3 reviews at least 1 artificial and 1 genuine Genuine

Artificial

Going [%]

Not going [%]

≥FFF ≥FFF ≤FF ≤FF

≥FFF ≤FF ≥FFF ≤FF

47 71 24 23

53 29 76 77

Uncle Sam’s Meat & Wine american, steakhouse Review with FFFFF The atmosphere was very cozy. With small seating areas the noise is minimized. The service was good. [. . . ] Useful? Y  N  Review with FF This place is dimly lit and reminded me of a bad prom decorating. The waitress was nice, but a little over [. . . ] Useful? Y  N  Review with FFFF Great food and even better atmosphere. It is a quiet darker setting with no windows. The service [. . . ] Useful? Y  N  Would you go to this restaurant? Y  N 

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

13 / 15

Experimental evaluation

Extrinsic evaluation Simulates the restaurant choice by a user: each form with 3 reviews at least 1 artificial and 1 genuine Genuine

Artificial

Going [%]

Not going [%]

≥FFF ≥FFF ≤FF ≤FF

≥FFF ≤FF ≥FFF ≤FF

47 71 24 23

53 29 76 77

Useful [%]

Not useful [%]

80 29

20 71

Genuine Artificial

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

Uncle Sam’s Meat & Wine american, steakhouse Review with FFFFF The atmosphere was very cozy. With small seating areas the noise is minimized. The service was good. [. . . ] Useful? Y  N  Review with FF This place is dimly lit and reminded me of a bad prom decorating. The waitress was nice, but a little over [. . . ] Useful? Y  N  Review with FFFF Great food and even better atmosphere. It is a quiet darker setting with no windows. The service [. . . ] Useful? Y  N  Would you go to this restaurant? Y  N 

October 16th, 2016

13 / 15

Experimental evaluation

Intrinsic evaluation Has this review been written by a human for this restaurant? forms with 5 reviews for each restaurant, 4 forms per user 4 kinds of reviews Rgs Rgd Rad Rad

genuine for specific restaurant genuine for different restaurant artificial for specific restaurant artificial for different restaurant (no step 2)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

14 / 15

Experimental evaluation

Intrinsic evaluation Has this review been written by a human for this restaurant? forms with 5 reviews for each restaurant, 4 forms per user 4 kinds of reviews Rgs Rgd Rad Rad

genuine for specific restaurant genuine for different restaurant artificial for specific restaurant artificial for different restaurant (no step 2)

Rgs Rgd Ras Rad

Bartoli et al. (UniTs)

Yes [%]

No [%]

81 52 24 24

19 48 76 76

Generation of Restaurant Reviews

October 16th, 2016

14 / 15

Conclusions

Conclusions

Automatic Generation of Restaurant Review: “Is that tool feasible?” Yes! (we did it!)

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

15 / 15

Conclusions

Conclusions

Automatic Generation of Restaurant Review: “Is that tool feasible?” Yes! (we did it!) “Is an artificial review considered genuine?” ≈ 1 on 4 “Can an artificial review influence the human subject?” Unclear, deeper experiments needed

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

15 / 15

Conclusions

Conclusions

Automatic Generation of Restaurant Review: “Is that tool feasible?” Yes! (we did it!) “Is an artificial review considered genuine?” ≈ 1 on 4 “Can an artificial review influence the human subject?” Unclear, deeper experiments needed Machine generated reviews might become a real threat for (e-)commerce!

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

15 / 15

Conclusions

Thanks!

Bartoli et al. (UniTs)

Generation of Restaurant Reviews

October 16th, 2016

15 / 15

Best Dinner Ever!!'': Automatic Generation of Restaurant ...

Oct 16, 2016 - People buy products/services online/offline. When choosing seller, they trust other people's opinion (reviews). Bartoli et al. (UniTs). Generation ...

776KB Sizes 2 Downloads 223 Views

Recommend Documents

“Best Dinner Ever!!!”: Automatic Generation of ...
Although the services hosting product reviews do apply filters and procedures aimed at limiting the proliferation of false reviews, an attacker able to generate ...

Sovereign Dinner Menu Restaurant Week 8.19.16
Aug 19, 2016 - 20 / 25. CURRY white wine, madras curry, toasted garlic, apple, fennel ... follow us @thesovereigndc. GENERAL MANAGER zackary sHelton.

Automatic generation of synthetic sequential ...
M. D. Hutton is with the Department of Computer Science, University of. Toronto, Ontario M5S ... terization and generation efforts of [1] and [2] to the more dif- ficult problem of ..... for bounds on the fanin (in-degree) and fanout (out-degree) of

Automatic Generation of Scientific Paper Reviews
maximizing the expected reward using reinforcement learning. ..... Oh, A.H., Rudnicky, A.I.: Stochastic natural language generation for spoken dialog systems.

Automatic Generation of Scientific Paper Reviews
whose incentives may or may not actually drive the overall process toward those ideal goals. ... (c) conveys a recommendation specified as input. A tool that is ..... Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech.

Automatic Generation of Release Notes
mining approaches together to address the problem of re- lease note generation, for ... ing data sets of the three evaluation studies. Paper structure. Section 2 ... To design ARENA, we performed an exploratory study aimed at understanding ...

Automatic Generation of Scientific Paper Reviews
paper_overly_honest_citation_slips_into_peer_reviewed_journal.html ... Oh, A.H., Rudnicky, A.I.: Stochastic natural language generation for spoken dialog.

Automatic generation of synthetic sequential ...
M. D. Hutton is with the Department of Computer Science, University of. Toronto ..... an interface to other forms of circuits (e.g., memory [20]) or to deal with ...

Automatic Navmesh Generation via Watershed ...
we do not necessarily need a convex area, just simpler area .... A Navigation Graph for Real-time Crowd Animation on Multilayered and Uneven Terrain.

Automatic Generation of Provably Correct Embedded ...
Scheduling. Model. Checking ... Model. Non-functional. Information. Counterexample. Software. C/C++ Code. Implementation ... e = queue.get() dispatch(e) e.

Towards Automatic Generation of Security-Centric ... - Semantic Scholar
Oct 16, 2015 - ically generate security-centric app descriptions, based on program analysis. We implement a prototype ... Unlike traditional desktop systems, Android provides end users with an opportunity to proactively ... perceive such differences

Automatic Generation of Efficient Codes from Mathematical ... - GitHub
Sep 22, 2016 - Programming language Formura. Domain specific language for stencil computaion. T. Muranushi et al. (RIKEN AICS). Formura. Sep 22, 2016.

Automatic Generation of Test-Cases Using Model ...
The methods and tools addressing model based test generation can be classified based on .... test generation meeting various structural and behavioral criteria.

Automatic generation of instruction sequences targeting ...
Testing a processor in native mode by executing instruc- tions from cache has been shown to be very effective in dis- covering defective chips. In previous work, we showed an efficient technique for generating instruction sequences tar- geting specif

Comparing SMT Methods for Automatic Generation of ...
In this paper, two methods based on statistical machine trans- lation (SMT) are ... Lecture Notes in Computer Science: Authors' Instructions pronunciations for ...

Automatic generation of research trails in web ... - Research at Google
Feb 10, 2010 - thematic exploration, though the theme may change slightly during the research ... add or rank results (e.g., [2, 10, 13]). Research trails are.

G4LTL-ST: Automatic Generation of PLC Programs
G4LTL-ST generates code in IEC 61131-3-compatible Structured. Text, which is ... Linear temporal logic specification with arithmetic constraints and a timer.

Automatic Generation of Regular Expressions from ... - Semantic Scholar
Jul 11, 2012 - ABSTRACT. We explore the practical feasibility of a system based on genetic programming (GP) for the automatic generation of regular expressions. The user describes the desired task by providing a set of labeled examples, in the form o

Automatic generation of instructions to robustly test ...
This led to broadside testing [5], where. ATPG spreads to two ... good small delay fault model, are actually paths. We use ... instruction based testing might be the best option in detecting .... The DATPG procedure deals with the circuit as a com-.

Robust Semi-Automatic Depth Map Generation in Unconstrained ...
Robust Semi-Automatic Depth Map Generation in ... ideo Sequences for 2D to Stereoscopic 3D Conversion.pdf. Robust Semi-Automatic Depth Map Generation ...