Skip links
Main content

Building Natural Language Generation Systems - Ehud Reiter and Robert Dale

maandag 25 april 2011 10:41

I want to write an agent that I can talk to and that can talk back to me. The talking back parts exists of turning some semantic representation (in predicate logic?) into a sentence that a human can read. Somewhere in the middle should be a syntax tree, an hierarchical representation of the sentence. That's all I knew. So there was a large gap between the stack of propositions and the surface level representation. And I had no clue as to where to start.

So I bought this book because I got the impression from Amazon reviews that it is the best book on NLG (natural language generation) available, even though it is ten years old. And I am glad I did. Because the book takes you through the jungle that is called Natural Language Processing and even tells you how to builld your house in it. I am calling it a jungle because there is way too much information available in this field. And the information does not seem to form a coherent body. There are many conceptual views and they all cover but part of the field. And I have no intention, nor the time, to understand everything that has been produced.

The authors of the book (BNLGS) understand this and they really do a great job of making this as simple as possible. The book is about document generation, not discourse planning (having a conversation with someone), and the main flow of document generation is as follows:

Document Planning -> Microplanning -> Surface Realization

Document Planning takes as input a communicative goal and delivers as output a document plan. A communicative goal is a simple statement of what the document is trying to achieve. A document plan is a tree whose branches are rhetorical relations (or discourse relations) and whose leaves are messages. The document planner creates the structure of the text as a whole. There is a predefined set of messages that can be produced. What these messages look like, is completely up to the application. There are no standards.

Microplanning takes this document plan as input and produces proto-phrase specifications. It does this by applying templates to the messages and then applying lexicalization, aggregation, and referring expression generation. A proto-phrase specification is not just a syntactic structure, it contains semantic information as well.

Surface Realization takes these proto-phrase specifications as input and produces a sentence as output. Existing surfice realizers take several types of proto-phrase specifications as input, but mainly these: lexicalized case frames, and abstract syntactic structures.

The only drawback of the book is that it completely skips the implementation of the surface realizer. The reason for this is that the authors claim that it is not smart to build one from scratch, because several advanced one exist, notably KPML, SURGE, and RealPro. I have to disagree with them, because the realizers have a "non-trivial learning curve" and require some conceptual preconceptions.

Natural Language Generation applications often use very specific types of grammars. The important ones are Systemic Functional Grammar (KPML, SURGE) and Functional Unification Grammar (RealPro). These are different from the ones oft used for Natural Language Understanding, because NLG is about choice management and NLU is about hypotheses management. There are many ways to express the same meaning and these grammars deal with the choices to be made better than, say, HPSG.

So what exactly was missing in my idea of language generation before I read this book? Well mainly that you can't go for a system that tries to generate just any sentence. Choose your domain and create some domain specific rules and structures. This is what keeps in manageable. And keep semantics involved as long as possible. After all, you are trying to get your meaning across.

In conclusion: I love this book. It gave me exactly what I needed, some structure in this complicated field, and it deals both with theory and its practical application. And very accessible too.

Labels
books
nlp

« Terug

Reacties op 'Building Natural Language Generation Systems - Ehud Reiter and Robert Dale'

1 2 3 Laatste pagina
cheap authentic jerseys
Geplaatst op: 07-10-2013 20:43 Quote
5555 He was also researching forward to gorging himself around the holiday.. Jones in the beginning was billed with felony coercion for his function inside the Minxx melee.
cheap authentic jerseys
Kristy
Geplaatst op: 15-10-2013 04:47 Quote
christian louboutin outlet sale
Geplaatst op: 16-10-2013 18:23 Quote
333 Put the price of a $100 view approximately $150 and shoppers will flip absent. While you're it valid which the hassle-free gladness of the
christian louboutin outlet sale
Adelphia
Geplaatst op: 18-10-2013 10:31 Quote
louis vuitton galliera
Geplaatst op: 18-10-2013 13:23 Quote
999 The creation of the new antitheft not just delightful but in addition practical. She wouldn't chat carats or selling price tags whilst showing me a ring comparable towards the an individual Ponder obtained that was produced by Precision Established.
louis vuitton galliera
Heidi
Geplaatst op: 19-10-2013 09:54 Quote
cheap nfl jerseys authentic
Geplaatst op: 19-10-2013 16:16 Quote
888 If find out how to have on saree is intricate dilemma for you then here you'd find the very easy respond to to this question.. She was magnificent in her 20s, she's beautiful now and she'll be attractive 20 years from now.
cheap nfl jerseys authentic
louis vuitton wallets usa
Geplaatst op: 21-10-2013 16:03 Quote
888 If guidelines on how to put on saree is intricate issue for you personally then listed here you would discover the very simple solution to this query.. She was wonderful in her 20s, she's magnificent now and she'll be gorgeous twenty years from now.
louis vuitton wallets usa
cheap red bottom shoes
Geplaatst op: 22-10-2013 15:04 Quote
888 If techniques to wear saree is difficult problem to suit your needs then in this article you should discover the basic respond to to this problem.. She was attractive in her 20s, she's wonderful now and she'll be wonderful 20 years from now.
cheap red bottom shoes
mdyjqxtx
Geplaatst op: 23-10-2013 16:02 Quote
mdyjqxtx
Prudy
Geplaatst op: 24-10-2013 00:31 Quote
wholesale louis vuitton
Geplaatst op: 24-10-2013 16:24 Quote
999 "I hope my daughter will not go through this!" suggests the mom of Lily.. The Damier Canvas Fast thirty is definitely a fairly popuar bag in The big apple .
wholesale louis vuitton
Ellyanna
Geplaatst op: 25-10-2013 16:45 Quote
Gwenelda
Geplaatst op: 28-10-2013 14:15 Quote
Jobeth
Geplaatst op: 05-11-2013 11:02 Quote
Ducky
Geplaatst op: 08-11-2013 06:45 Quote
efciyjkt
Geplaatst op: 08-11-2013 18:02 Quote
efciyjkt
pyibil http://vgvkep.com/ <a href="http://dijjag.c
Geplaatst op: 09-11-2013 16:17 Quote
pyibil http://vgvkep.com/ <a href="http://dijjag.com/ ">josgczk</a> josgczk
Prue
Geplaatst op: 13-11-2013 19:47 Quote
dopeilp [url=http://paydayloansusatrj.com/]fast ca
Geplaatst op: 15-11-2013 11:22 Quote
dopeilp fast cash advance bwyrB payday loans GZNZWW payday loans 0258
Laicee
Geplaatst op: 19-11-2013 19:42 Quote
snrscu [url=http://paydayloansukpqa.co.uk/]pay day
Geplaatst op: 21-11-2013 16:16 Quote
snrscu pay day loans 6287 payday loans >:-OOO pay day loan PuGDr
uniqhcb [url=http://paydayloansusapqi.com/]payday
Geplaatst op: 22-11-2013 11:42 Quote
uniqhcb payday loan VncKnF pay day loans cOOfO paydayuk 9041
Susannah
Geplaatst op: 23-11-2013 18:17 Quote
Idalia
Geplaatst op: 25-11-2013 08:21 Quote
1 2 3 Laatste pagina
Nieuw bericht