Dr ADVAITH SIDDHARTHAN


Dr ADVAITH SIDDHARTHAN The University of Aberdeen Natural & Computing Sciences Dr ADVAITH SIDDHARTHAN Senior Lecturer work +44 (0)1224 272282 http://homepages.abdn.ac.uk/advaith/pages pref 218 Meston

Senior Lecturer

Dr ADVAITH SIDDHARTHAN

Personal Details

Telephone: +44 (0)1224 272282
Email:

advaith@abdn.ac.uk

Personal website: http://homepages.abdn.ac.uk/advaith/pages
Address: 218 Meston
hCard

Jump to:

Research Interests

 My research area is Computational Linguistics, and my research focus is on making information more accessible by exploiting results in Linguistics and Computer Science, including Information Retrieval and Machine Learning. My main interests are Natural Language Regeneration, where existing texts are automatically simplified and summarised to make the content more accessible, and Natural Language Generation, particularly generating engaging narratives from data (check out the  Blogging Birds). My recent research also covers information extraction from scientific literature, citation analysis, multi-document and multilingual news summarization, text simplification, open-domain referring expression generation, anaphora resolution and the semantic annotation of multilingual corpora.


^ top

Research Grants

2012-2018. ``The Northern Temperament'' (with Isobel Cameron, Peter Davidson, Arnar Arnason, John Crawford, Thomas McKean, Ian Reid, Ian Russell and Justin Williams). 72 month project funded through the University of Aberdeen's research theme "The North".

2013-2014. ``Lexico-syntactic text simplification for improving information access'' 15 month EPSRC funded project (Grant Number EP/J018805/1).

2012-2015. ``An investigation of how patients, public and stakeholders perceive and interpret information about anti-depressants in UK newspapers.'' CSO Doctoral Fellowship awarded to Nooreen Akhtar (supervised by Isobel Cameron, Barbara Fennel, Margaret Maxwell and Advaith Siddharthan)

2013-2016. ``Developing Digital Tools for Citizen Science: Towards New Ways of Learning about the Natural Environment'' (Advaith Siddharthan, Rene van der Wal and Laura Colucci-Gray). Funded through the University of Aberdeen's research theme "Environment and Food Security".

2010-2013. ``Digital Conservation'' ( with Rene Van Der Wal and Chris Mellish). 36 month Digital Economies Research Hub project funded by RCUK.

2009-2010. ``Studying the appropriateness of different formulations of a discourse relation in context.'' (with Napoleon Katsos). 12 month ESRC funded project (Grant Number RES-000-22-3272).   

 

 

 


^ top

Teaching Responsibilities

  1. CS5050 / CS3017 Adaptive Interactive Systems(100%)
  2. CS4025 / CS5057 Natural Language Processing (50%)

^ top

External Responsibilities

Editorial Board of Computational Linguistics (MIT Press)


^ top

Admin Responsibilities

  1. Programme Coordinator for M.Sc. in Information Technology
  2. Advisor for Levels 1-4
  3. Tutor for Levels 1-4

^ top

Publications 

 

Contributions to Journals

Articles

  • Siddharthan, A., Nenkova, A. & McKeown, K. (2011). 'Information Status Distinctions and Referring Expressions: An Empirical Study of References to People in News Summaries'. Computational Linguistics, vol 37, no. 4, pp. 811-842., http://dx.doi.org/10.1162/COLI_a_00077
    [Online] DOI: 10.1162/COLI_a_00077
    [Link] http://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_00077
  • Dorr, BJ., Passonneau, RJ., Farwell, D., Green, R., Habash, N., Helmreich, S., Hovy, E., Levin, L., Miller, KJ., Mitamura, T., Rambow, O. & Siddharthan, A. (2010). 'Interlingual Annotation of Parallel Text Corpora: a new framework for annotation and evaluation'. Natural Language Engineering, vol 16, no. 3, pp. 197-243., http://dx.doi.org/10.1017/S1351324910000070
    [Online] DOI: 10.1017/S1351324910000070
  • Siddharthan, A. (2006). 'Syntactic Simplification and Text Cohesion'. Research on Language and Computation, vol 4, no. 1, pp. 77-109., http://dx.doi.org/10.1007/s11168-006-9011-1
    [Online] DOI: 10.1007/s11168-006-9011-1

Chapters in Books, Reports and Conference Proceedings

Chapters

  • Briscoe, T., Harrison, K., Naish-Guzman, A., Parker, A., Rei, M., Siddharthan, A., Sinclair, D., Slater, M. & Watson, R. (2011). 'Intelligent Information Access from Scientific Papers'. in K Mayer & J Tait (eds), Current Challenges in Patent Information Retrieval. Springer, pp. 329-342., http://dx.doi.org/10.1007/978-3-642-19231-9_16
    [Online] DOI: 10.1007/978-3-642-19231-9_16
    [Link] http://www.springer.com/computer/database+management+%26+information+retrieval/book/978-3-642-19230-2
  • Farwell, D., Dorr, B., Green, R., Habash, N., Helmreich, S., Hovy, E., Levin, L., Miller, K., Mitamura, T., Rambow, O., Reeder, F. & Siddharthan, A. (2009). 'Interlingual annotation of multilingual text corpora and FrameNet'. in HC Boas (ed.), Multilingual FrameNets in Computational Lexicography: Methods and Applications. Moutin de Gruyter, Berlin, pp. 287-318.

Conference Proceedings

  • Ponnamperuma, K., Siddharthan, A., Zeng, C., Mellish, CS. & van der Wal, R. (2013). 'Tag2Blog: Narrative Generation from Satellite Tag Data'. in Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, Sofia, Bulgaria, pp. 169-174.
    [Link] http://www.aclweb.org/anthology/P13-4029
  • Blake, S., Siddharthan, A., Nguyen, H., Sharma, N., Robinson, A-M, O'Mahony, E., Darvill, B., Mellish, CS. & Van Der Wal, R. (2012). 'Natural Language Generation for Nature Conservation: Automating Feedback to help Volunteers identify Bumblebee Species'. in M Kay & C Boitet (eds), COLING 2012 :24th International Conference on Computational Linguistics : Proceedings of COLING 2012 : Technical Papers. The COLING 2012 Organizing Committee, Mumbai, pp. 311-324.
    [Online] AURA: C12_1020.pdf
    [Link] http://aclweb.org/anthology/C/C12/
  • Siddharthan, A. & Katsos, N. (2012). 'Offline Sentence Processing Measures for testing Readability with Users'. in Proceedings of the NAACL 2012 Workshop on Predicting and Improving Text Readability (PITR 2012). ACL, pp. 17-24.
    [Link] http://www.aclweb.org/anthology/W/W12/W12-2203.pdf
  • Siddharthan, A., Green, MJ., van Deemter, K., Mellish, CS. & Van Der Wal, R. (2012). 'Blogging birds: Generating narratives about reintroduced species to promote public engagement'. in Proceedings of the 7th International Natural Language Generation Conference (INLG 2012). ACL Anthology.
    [Online] AURA: inlgKitesShort.pdf
    [Link] http://homepages.abdn.ac.uk/advaith/pages/inlgKitesShort.pdf
  • Siddharthan, A. (2011). 'Text Simplification using Typed Dependencies: A Comparision of the Robustness of Different Generation Strategies'. in Proceedings of the 13th European Workshop on Natural Language Generation. 13th European Workshop on Natural Language Generation, Nancy, France, 28-30 September.
  • Walker, A., Siddharthan, A. & Starkey, A. (2011). 'Investigation into Human Preference between Common and Unambiguous Lexical Substitutions'. in Proceedings of the 13th European Workshop on Natural Language Generation. 13th European Workshop on Natural Language Generation, Nancy, France, 28-30 September.
  • Briscoe, T., Harrison, K., Naish-Guzman, A., Parker, A., Siddharthan, A., Sinclair, D., Slater, M. & Watson, R. (2010). 'Camtology: Intelligent Information Access for Science'. in Proceedings of the NAACL HLT 2010: Demonstration Session. ACL, Los Angeles, CA, Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, California, United States, 1-6 June.
    [Link] http://www.aclweb.org/anthology/N/N10/N10-2001.pdf
  • Siddharthan, A. & Katsos, N. (2010). 'Reformulating Discourse Connectives for Non-Expert Readers'. in Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL, (NAACL-HLT 2010). ACL, Los Angeles, CA, USA, pp. 1002–1010, Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, California, United States, 1-6 June.
    [Link] http://www.aclweb.org/anthology/N/N10/N10-1144.pdf
  • Liakata, M., Teufel, S., Siddharthan, A. & Batchelor, C. (2010). 'Corpora for the conceptualisation and zoning of scientific papers'. in Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC'2010). ELDA, Paris, France, pp. 2054-2061, 7th International Conference on Language Resources and Evaluation (LREC'2010), Valletta, Malta, 17-23 May.
    [Link] http://www.lrec-conf.org/proceedings/lrec2010/pdf/644_Paper.pdf
  • Siddharthan, A. (2010). 'Complex lexico-syntactic reformulation of sentences using typed dependency representations'. in Proceedings of the 6th International Natural Language Generation Conference (INLG 2010). ACL, Dublin, Ireland, pp. 125-134, 6th International Natural Language Generation Conference (INLG 2010), Trim, County Meath, Ireland, 7-9 July.
    [Link] http://www.aclweb.org/anthology/W/W10/W10-4213.pdf
  • Teufel, S., Siddharthan, A. & Batchelor, C. (2009). 'Towards Domain-Independent Argumentative Zoning: Evidence from Chemistry and Computational Linguistics'. in Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP'09). ACL, Suntec, Singapore., pp. 1493–1502, 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP'09), Singapore, 6-7 August.
    [Link] http://www.aclweb.org/anthology/D/D09/D09-1155.pdf
  • Siddharthan, A. & Copestake, A. (2008). 'Generating research websites using summarisation techniques'. in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies : Demo Session. ACL, Morristown, NJ, USA, pp. 5-8, The 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL HLT 2008), Colombus, Ohio, United States, 15-20 June.
    [Link] http://delivery.acm.org/10.1145/1570000/1564146/p5-siddharthan.pdf?key1=1564146&key2=0317728821&coll=GUIDE&dl=GUIDE&CFID=107639502&CFTOKEN=61055457
  • Rupp, CJ., Copestake, A., Corbett, P., Murray-Rust, P., Siddharthan, A., Teufel, S. & Waldron, B. (2008). 'Language Resources and Chemical Informatics'. in Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008). ELDA, Paris, France, pp. 2196-2200, 6th International Conference on Language Resources and Evaluation (LREC'2008), Marrakesh, Morocco, 28-30 May.
    [Link] http://www.lrec-conf.org/proceedings/lrec2008/pdf/556_paper.pdf
  • Siddharthan, A. & Teufel, S. (2007). 'Whose idea was this, and why does it matter?: Attributing scientific work to citations'. in Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007). ACL, Morristown, NJ, USA, pp. 316-323, Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), Rochester, New York, United States, 22-27 April.
    [Link] http://www.aclweb.org/anthology/N/N07/N07-1040.pdf
  • Siddharthan, A. & Teufel, S. (2007). 'Whose idea was this? Deciding attribution in scientific literature'. in Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07). 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07), Logos, Portugal, 29-30 March.
  • Rambow, O., Dorr, B., Farwell, D., Green, R., Habash, N., Helmreich, S., Hovy, E., Levin, L., Miller, KJ., Mitamura, T., Reeder, F. & Siddharthan, A. (2006). 'Parallel Syntactic Annotation of Multiple Languages'. in Proceedings of the International conference on Language Resources and Evaluation (LREC 2006). ELDA, Paris, France, 5th International Conference on Language Resources and Evaluation (LREC'06), Genoa, Italy, 22-28 May.
  • Teufel, S., Siddharthan, A. & Tidhar, D. (2006). 'An annotation scheme for citation function'. in Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue. ACL, Sydney, Australia, pp. 80-87, 7th SIGdial Workshop on Discourse and Dialogue, Sydney, Australia, 1 July.
    [Link] http://www.aclweb.org/anthology-new/W/W06/W06-1312.pdf
  • Teufel, S., Siddharthan, A. & Tidhar, D. (2006). 'Automatic classification of citation function'. in Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP'06): Discourse. ACL, Morristown, NJ, USA, pp. 103-110, 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP 2006), Sydney, Australia, 22-23 July.
    [Link] http://delivery.acm.org/10.1145/1620000/1610091/p103-teufel.pdf?key1=1610091&key2=2405538821&coll=GUIDE&dl=GUIDE&CFID=110846368&CFTOKEN=12757432
  • Copestake, A., Corbett, P., Murray-Rust, P., Rupp, CJ., Siddharthan, A., Teufel, S. & Waldron, B. (2006). 'An Architecture for Language Processing for Scientific Texts'. in Proceedings of the UK e-Science Programme All Hands Meeting 2006 (AHM2006). National e-Science Centre, UK e-Science All Hands Meeting 2006, Nottingham, United Kingdom, 18-21 September.
    [Link] http://www.allhands.org.uk/2006/proceedings/papers/689.pdf
  • Nenkova, A., Siddharthan, A. & McKeown, K. (2005). 'Automatically learning cognitive status for multi-document summarization of newswire'. in Proceedings of Conference on Human Language Technology/Empirical Methods in Natural Language Processing(HLT/EMNLP). Vancouver, Canada, Human Language Technology Conference(HLT), Conference on Empirical Methods in Natural Language Processing(EMNLP), Vancouver, Canada, 6-8 October.
    [Link] http://userweb.cs.utexas.edu/~ml/HLT-EMNLP05/
  • Siddharthan, A. & McKeown, K. (2005). 'Improving Multilingual Summarization: Using Redundancy in the Input to Correct MT errors'. in Conference on Human Language Technology Conference / Empirical Methods in Natural Language Processing(HLT-EMNLP). Vancouver, Canada, Human Language Technology Conference(HLT), Conference on Empirical Methods in Natural Language Processing(EMNLP), Vancouver, Canada, 6-8 October.
    [Link] http://userweb.cs.utexas.edu/~ml/HLT-EMNLP05/
  • Siddharthan, A. & Copestake, A. (2004). 'Generating Referring Expressions in Open Domains'. in Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics (ACL). Barcelona, Spain, 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, Spain, 21-26 July.
    [Link] http://aclweb.org/anthology/P/P04/P04-1052.pdf
  • Farwell, D., Helmreich, S., Reed, F., Dorr, B., Habash, N., Hovy, E., Levin, L., Miller, K., Mitamura, T., Rambow, O. & Siddharthan, A. (2004). 'Interlingual Annotation of Multilingual Text Corpora'. in Workshop on Frontiers in Corpus Annotation (NAACL/HLT). Boston, MA, Workshop on Frontiers in Corpus Annotation, NAACL/HLT 2004, Boston, United States, 2-7 May.
    [Link] http://www.aclweb.org/anthology/W/W04/W04-2709.pdf
  • Siddharthan, A., Nenkova, A. & McKeown, K. (2004). 'Syntactic Simplification for Improving Content Selection in Multi-Document Summarization'. in Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004). Geneva, Switzerland, 20th International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland, 23-27 August.
    [Link] http://aclweb.org/anthology/C/C04/C04-1129.pdf
  • Siddharthan, A. (2003). 'Resolving Pronouns Robustly: Plumbing the Depths of Shallowness'. in Proceedings of the Workshop on Computational Treatments of Anaphora, 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL'03). Budapest, Hungary, 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL'03), Budapest, Hungary, 12-17 April.
    [Link] http://www.csd.abdn.ac.uk/~kvdeemte//siddharthan.pdf
  • Siddharthan, A. (2003). 'Preserving Discourse Structure when Simplifying Text'. in Proceedings of the European Natural Language Generation Workshop (ENLG), 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL'03). Budapest, Hungary, 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL'03), Budapest, Hungary, 12-17 April.
    [Link] http://www.csd.abdn.ac.uk/~advaith/discourse_EACL03.pdf
  • Siddharthan, A. (2002). 'An Architecture for a Text Simplification System'. in Proceedings of the Language Engineering Conference (LEC'02): Hyderabad, India, December 13-December 15, 2002. IEEE Computer Society, London, United Kingdom, pp. 64, Language Engineering Conference (LEC'02), Hyderabad, India, 13-15 December., http://dx.doi.org/http://doi.ieeecomputersociety.org/10.1109/LEC.2002.1182292
    [Online] DOI: http://doi.ieeecomputersociety.org/10.1109/LEC.2002.1182292
  • Siddharthan, A. & Copestake, A. (2002). 'Generating Anaphora for Simplifying Text'. in Proceedings of the 4th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'02). Lisbon, Portugal, 4th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'02), Lisbon, Portugal, 18-20 September.

Contributions to Conferences

Papers

  • Siddharthan, A., Mellish, CS., Nguyen, H., Zeng, C., Pearce, I. & Van Der Wal, R. (2011). 'Tell me a story about the birds and the bees: Using NLG to foster public engagement in nature conservation projects'. Paper presented at Digital Engagement 2011, Newcastle, United Kingdom, 15/11/11,.
  • Siddharthan, A. & Evans, D. (2005). 'Columbia University at MSE 2005'. Paper presented at Multilingual Summarization Evaluation Workshop, Michigan, United States, 29/06/05 - 1/07/05,.
    [Link] http://www.cs.columbia.edu/nlp/papers/2005/siddharthan_evans_05.pdf
  • Blair-Goldensohn, S., Evans, D., Hatzivassiloglou, V., McKeown, K., Nenkova, A., Passonneau, R., Schiffman, B., Schlaikjer, A., Siddharthan, A. & Siegelman, S. (2004). 'Columbia University at DUC 2004'.
    [Link] http://www.cs.columbia.edu/nlp/papers/2004/blair-goldensohn_al_04a.pdf
  • Siddharthan, A. (2002). 'Resolving Attachment and Clause Boundary Ambiguities for Simplifying Relative Clause Constructs'. Paper presented at 40th Meeting of the Association for Computational Linguistics (ACL'02), Philadelphia, PA, United States, 6/07/02 - 12/07/02,.

Books and Reports

Other Reports

^ top

update | about Staff Pages

back