Annotated Bibliography: Corpus Linguistics

  1. Corpus Lings: Introductions and Overviews
    1. Biber, Douglas, Conrad, Susan, and Reppen, Randi. Corpus Linguistics: Investigating Language Structure and Use.CUP, 1998. [Summary]
    2. McEnery, Tony, and Wilson, Andrew. Corpus Linguistics, 2nd ed. Edinburgh University Press, 2001.

      Excellent intro on CL in relation to Chomskyan rev. Good on POS tagging and parsing, and explanation of ENGCG. No interest in literary topics that I can see.

    3. Kennedy, Graeme. An Introduction to Corpus Linguistics. Longman, 1998.

      Comprehensive and detailed about major corpora, their makeup and use; many lists and tables, many studies cited. Also covers POS tagging and parsing; some on tools. Impressed with ENGCG.

    4. Aston, Guy and Burnard, Lou. The BNC handbook: exploring the British National Corpus with SARA. Edinburgh University Press, 1998.

      Although tied to a specific corpus query tool and corpus, this is a wonderfully detailed and clear walk-through of how to raise and answer questions with a corpus. (Copy also available from me.)

    5. Hoey, Michael. Lexical Priming: a new theory of words and language. Routledge, 2005.

      Argues that collocation is a psychological relation between words

    6. Partington, Alan. Patterns and Meanings, 1998. Amsterdam: Benjamins, 1998.

      Fine little introduction; works with corpus of business English

    7. Sinclair, John. Corpus, Concordance, Collocation Oxford UP, 1991.

      Excellent, lucid, non-technical introduction to the key ideas. Good examples. Often cited in Stubbs, which is in many ways an update and expansion.

    8. _______________. "The lexical item." In E. Weigand, editor, Contrastive Lexical Semantics. Amsterdam: Benjamins, 1998, 1-24.
    9. _______________. Trust the text: Language, corpus, and discourse. Routledge, 2004.
    10. Stubbs, Michael. Words and phrases: Corpus studies of lexical semantics. Blackwell, 2001.
    11. Wynne, Martin, ed. Developing Linguistic Corpora: a Guide to Good Practice. Oxford: Oxbow Books.[Here]

      Useful practical guidance; pieces by Sinclair, Leech, Burnard, and others

  2. Collections
    1. Ädel, Annelie and Randi Reppen, eds. Corpora and Discourse: The challenges of different settings. John Benjamins, 2008.
    2. Baker, Mona, Gill Francis, and Elena Tognini-Bonelli, eds., Text and Technology: In honour of John Sinclair John Benjamins, 1993
    3. Connor, Ulla and Thomas Upton, Discourse in the Professions. John Benjamins, 2004.
    4. Hundt, Marianne, Nadja Nesselhauf, and Carolin Biewer, eds., Corpus Linguisics on the Web. Amsterdam: Rodopi, 2007.
    5. Jucker, Andreas H., Daniel Schreier, and Marianne Hundt, eds. Corpora: Pragmatics and Discourse Rodopi (2009)--ebook. [Abstract]
    6. Peters, Pam, Collins, Peter, and Smith, Adam. New Frontiers in Corpus Research. Conference papers from 21st ICELRCC, Sydney, 2000. Rodopi, 2002. [Summary]
    7. Fairon, Cédrick, Hubert Naets, Adam Kilgarriff, Gilles-Maurice de Schryver, eds., Building and Exploring Web Corpora. UCL Presses Universitaires de Louvain, 2007.

      Papers from WebAsCorpus (WAC3) and CLEANEVAL on removing boilerplate from Web HTML pages. Pieces by Wm. Fletcher and Serge Sharoff.

    8. Renouf, Antoinette and Andrew Kehoe, eds. The Changing Face of Corpus Linguistics. Rodopi, 2006.
    9. Reppen, Randi, Susan M. Fitzmaurice, and Douglas Biber, eds. Using Corpora to Explore Linguistic Variation. Benjamins, 2002.

  3. About particular corpora
    1. Nelson, Gerald, Wallis, Sean, and Aarts, Bas. Exploring Natural Language: Working with the British Component of the International Corpus of English John Benjamins Publishing, 2002.

      All you would ever want to know about ICECUP and the grammatical markup.

    2. Grant, Lynn E. "Frequency of 'core idioms' in the British National Corpus (BNC)." IJCL 10:4 (2005): 429-451. here. [Abstract]
    3. Greenbaum, Sidney, ed. Comparing English Worldwide: The International Corpus of English Oxford: Clarendon Press, 1996

      More programmatic, less technical than Nelson et al. 19 articles

    4. Hoffmann, Sebastian, Stefan Ever, Hicholas Smith, David Lee, and Ylva Berglund Prytz, Corpus Linguistics with BNCweb--a Practical Guide. Peter Lange:2008.

      An excellent guide to this very rich interface to BNC (which is available on line), including its special 'Simple' query languge.

    5. Lee, David. Genres, Registers, Text Types, Domains, and Styles: Clarifying the Concepts and Navigating a Path Through The BNC Jungle.Language Learning and Technology, 5:3 pp. 37-76.

      Fine explanation of why he built his indexer. Applies Gerard Steen's notion of genre as a basic level term. Not just about BNC.

    6. Steen, Gerard. "Genres of discourse and the definition of literature." Discourse Processes 28 (1999): 109-120.

      annotate me

    7. Tribble, Christopher. Writing Difficult Texts c. 6 "Lexical Dimensions". Ph.D diss. Lancaster University, 1998. On line at his site (prints 43 pages)

      Uses "PP", a field and genre-specific corpus of business writing (proposals). Chapter runs gamut of lexical issues using WordSmith, and esp. likes Keywords. A model of analysis.

  4. Statistics
    1. Gries, Stefan Th. "Exploring variability within and between corpora: some methodological considerations," Corpora 1:2 (2006): 109-151. [Abstract]
    2. Rayson, Paul and Roger Garside. "Comparing Corpora using frequency profiling," [Here] [Abstract]
  5. Particular terms and constructions in discourse
    1. Tognini-Bonelli, Elena. "Interpretative Nodes in Discourse: Actual and Actually. in Baker et al. 193-212.
    2. Kennedy, Graeme. "An under-exploited resource: using the BNC for exploring the nature of language learning." in Hundt et al. (2007): 151-166.

      on lose/find, amplifiers/downtowners in BNC

    3. (And see the articles in
      Rohdenburg, Günter and Julia Schlüter, eds. One Language, Two Grammars? CUP (2009) PE1072 O54 2009.
      and in
      Rohdenburg, Günter and Britta Mondorf, eds. Determinants of Grammatical Variation in English. Mouton/de Gruyter, 2003.
    4. Mautner, Gerlinde. "Mining large corpora for social information: The case of elderly," in Language in Society 36:51-72. [Abstract]
  6. The Web as Corpus
    1. Biber, Douglas and Jerry Kurjian. Towards a taxonomy of web registers and text types: a multi-dimensional analysis. In Hundt et al. 109-32.

      Discusses 3 ways to identify genres of web documents: external description, text language traits, and a third, best way, which involves hand culling of the texts. Concern is with top-level categories. Google's subject categories ot based on coherent linguistic features; cluster analysis of features pops out four 'text-types':

      • Personal, involved narrative (1988--narratives not personal in 88 corpus)
      • Persuasive/argumentative discourse (1988: Dim 4)
      • Addressee-focused discourse (no direct Dim--often advice-giving function)
      • Abstract/technical discourse (1988--Dim 1 (neg pole)
    2. Claridge, Claudia. "Constructing a corpus from the web: message boards," in Hundt et al (2007): 87-108.

      Need for a corpus of on-line forums. Procedures for setting one up. Issues of locality and identity. Pilot study on interaction and stance markers and differences in dif. localities.

    3. Fletcher, William. "Making the Web More Useful as a Source for Linguistic Corpora." (to appear). At
    4. ______________."Implementing a BNC-Compare-able Web Corpus." in Fairon et al. 43-56.

      Much about his design and plans for WebAsCorpus

    5. ______________. "Concordancing the web: promise and problems, tools and techniques," in Hundt et al. (2007): 25-45.

      about using web as corpus; types of search options, explanation of KWIKFinder program

    6. Hundt, Marianne and Carolin Biewer. "The dynamics of inner and outer circle varieties in the South Pacific and East Asia," in Hundt et al. (2007):249-69.

      Using on-line regional newspapers to monitor ongoing changes and dynamics of standardizing.

    7. Hoffman, Sebastian. "From webpage to mega-corpus: the CNN transcripts," in Hundt et al (2007)69-86.
    8. Ide, Nancy, Reppen, Randi, and Suderman, Keith. (2002) "The American National Corpus: More Than the Web Can Provide." Proceedings of the Third Language Resources and Evaluation Conference (LREC), Las Palmas, Canary Islands, Spain, 839-44. Available on line at
    9. Jones, Steven, Carita Paradis, M. Lynne Murphy, and Caroline Willners. "Googling for 'opposites': a web-based study of antonym canonicity," Corpora 2:2 (2007): 129-155. [Abstract]
    10. Kilgarriff, Adam. "Introduction to the Special Issue on the Web as Corpus." Computational Linguistics 29:3 (2003), 333-47.
    11. Leech, Geoffrey. "New resources, or just better old ones? the Holy Grail of representativeness." in Hundt et al.(2007): 133-150.

      Reminds us of what is not available in a ready-made web corpus.

    12. Mehler, Alexander, Serge Sharoff, Marina Santini, eds. Genres on the Web: Computational Models and Empirical Studies (Text, Speech and Language Technology). Springer Verlag (2010).[Abstract]
    13. Mondorf, Britta. "Recalcitrant problems of comparative alternation and new insights emerging from internet data." in Hundt et al., 211-32
    14. Renouf, A., A. Kehoe & J. Banerjee. "WebCorp: an integrated system for web text search" in Hundt et al. (eds.)2007:47-67

      This is somewhat later version of The WebCorp Search Engine: a holistic approach to Web text Search (2005), but has some other examples of use.

    15. Rohdenburg, Gunter. "Determinants of grammatical variation in English and the formation / confirmation of linguistic hypotheses by means of internet data," in Hundt et al. (2007): 191-209.

      offers 4 universal, functionally motivated tendencies affecting choices among varying alternatives. (See also collections edited by Rohdenburg and Schlüter and by Rohdenburg and Mondorf.)

    16. Rosenbach, Anette. "Exploring constructions on the web: a case study." in Hundt et al., 167-190.

      NP-Poss N vs. N N (driver's license/ driver license) hard to detect animacy on web

    17. Santini, Marina. Work on Web genres.
    18. Sharoff, Serge. "Creating general-purpose corpora using automated search engine queries,"in wackybook
    19. Thelwall, Mike (2005) "Creating and using Web corpora" IJCL 10:4:

      Better at discussing the pitfalls/complexities of Web as corpus than in practical solutions. Compared university websites in OZ,NZ, and UK and found some differences between them (as a group) and BNC in top 50 most frequent words. Parturient montes...

  7. Change in Recent English
    1. Hickey, Raymond. "Productive lexical processes in present-day English". In: Mair, Christian, Reinhard Heuberger and Josef Wallmannsberger 2006. Corpora and the History of English. A Festschrift for Manfred Markus. Heidelberg: Winter, 2006, pp. 153-68. [Here]
    2. Hundt, Marianne. "The mediopassive in American 20th-century advertising language," in Renouf and Kehoe, 163-83.
    3. Kehoe, Andrew. "Diachronic linguistic analysis on the web with WebCorp." in Renouf and Kehoe, pp.

      The URL for RDUES is now at Birmingham City University:

    4. Leech, Geoffrey and Nicholas Smith. "Recent grammatical change in written English 1961-1992: some preliminary findings of a comparison of American with British English," in Renouf and Kehoe, 185-204.
    5. Meurman-Solin, Anneli and Paivi Pahta. "Circumstantial adverbials in discourse: a synchronic and a diachronic perspective" in Renouf and Kehoe, 117-142
    6. Leech, Geoffrey, Marianne Hundt, Christian Mair, and Nicholas Smith, Change in Contemporary English: A Grammatical Study. CUP, 2009. P128,C68 L43 2009
    7. Mair, Christian. (2003). "Tracking ongoing grammatical change and recent diversification in present-day standard English: the complementary role of small and large corpora," in Renouf and Kehoe, 355-376.

      Interesting positions on many points besides the get-passive.

    8. ______________. "Change and variation in present-day English: integrating the analysis of closed corpora and web-based monitoring." in Hundt et al., 231-47

      long list of points of change; different from/than/to and progressive; natl. difs. by country extension

    9. Partington, Alan, "Corpus evidence of language change: the case of the intensifier," in Baker, et al. 177-92.

  8. Academic Discourse
    1. Ädel, Annelie and Randi Reppen. Corpora and Discourse: The challenges of different settings. Benjamins, 2008.
    2. Charles, Maggie, Diane Pecorari and Susan Hunston, eds. Academic Writing: At the interface of corpus and discourse. Continuum, 2009. P 302.18 A33 2009
    3. Flowerdew, John, ed. Academic Discourse. Pearson Education, 2002.
    4. Hyland, Ken and Marina Bondi, eds. Academic Discourse Across Disciplines. Peter Lang, 2006.
    5. Nash, Walter, ed. The Writing Scholar: Studies in Academic Discourse. Sage Publications, 1990.

      Old and non-corpus, but quite linguistic. several on modality (Paul Simpson, Joanna Channell, Christopher Butler)

    6. Tognini-Bonelli, Elena and Gabriella Del Lungo Camiciotti, eds. Strategies in Academic Discourse. John Benjamins, 2005.
    7. Articles and monographs
    8. Aijmer, Karin. "Evaluation and pragmatic markers, " in Tognini-Bonelli and Camiciotti (2005) 82-96.

      uses parallel translated Swedish/English texts to tease out three functions of prag markers: a 'reality' use, a hedging use, and a conclusion from a number of alternatives.

    9. Bell, D. "Sentence-initial AND and BUT in academic writing." Pragmatics 17 (2007):183-201
    10. Biber, Douglas. University Language: A corpus-based study of spoken and written registers. John Benjamins (2006).

      Based on his own T2K-SWAL Corpus: chapters on vocaculary, grammatical variation, expression of stance, lex bundles, and multidimensional analysis

    11. Bondi, Marina. "Emphatics in academic discourse: Integrating corpus and discourse tools in the study of cross-disciplinary variation," in Ädel et al. (2008); 31-56.

      About "boosters" as used in two "soft" disciplines: history and economics. Corpora of all articles in 10 journals in each discipline for year 1999-2000. Mostly initial adverbials, esp. "the use significantly, invariably, undoubtedly. shows that the use of emphatics in history is much more varied and graded than in economics (52_

    12. ____________. "'A case in point': Signals of Narrative Development in Business and Economics," in Hyland and Bondi, eds. (2006):49-74
    13. ____________. "In the Wake of the Terror: Phraseological tools of time setting in the narrative of history," in Charles et al. (2009): 73-90.

      Extracts keywords for time setting in 2.5 M word history corpus (reference list a similar sized one in econ and business. Key words and phrases that emerge are frequent in history (COCA), sometimes most frequent, but never more than 2x more. Except of course those including 19th and 20th century, middle ages, history

    14. Charles, M. "In search of representativity in specialised corpora: Categorisation through collocation". International Journal of Corpus Linguistics, 7 (1) (2009): 43-64.
    15. _________. "Revealing and obscuring the writer's identity: Evidence from a corpus of theses". In R. Kiely, P. Rea-Dickins, H. Woodfield & G. Clibbon (Eds.), Language, Culture and Identity in Applied Linguistics. London: BAAL/Equinox, 1471
    16. Cortez, Viviana. "Lexical bundles in Freshman composition," in Reppen et al. (2002):132-145.

      Comparison of most frequent 4-grams in freshman essays and in acad/convers subcorpora of Longman's corpus (Biber et al.). Shows clearly Freshmen not influenced by conversational models, but by desire to replicate academic formulae in their readings. Suggests looking for variation across disciplines and academic levels.

    17. ______________. "Lexical Bundles in Published and Student Disciplinary Writing: Examples from history and biology," English for Specific Purposes 23 (2004): 397-423
    18. Dahl, Trine. "Contributing to the academic conversation: A study of new knowledge claims in economics and linguistics," Journal of Pragmatics 40 (2008): 1184-1201. [Here]
    19. ____________. "The linguistic representation of rhetorical function: A study of how economists present their knowledge claims," Written Communication 26:4 (2009):370-391.[Abstract]
    20. Flowerdew, Lynne. "Corpus-Based Analyses in EAP," in Flowerdew (2002) 95-114

      Review of literature

    21. Freddi, Maria. "From corpus to register: The construction of evaluation and argumentation in linguistics textbooks.

      Compares introductory chapters in 10 intro textbooks on point of overlapping of evaluation and argumentation. Style emerges as a parameter.

    22. _____________, "Arguing linguistics: Corpus investigation of one functional variety of academic discourse. Journal of English for Academic Purposes 4(1) (2005):5-26.
    23. Flowerdew, John and Richard W. Forrest. "Schematic Structure and Lexico-Grammatical Realization in Corpus-based Genre Analysis: The case of research in the PhD Literature Review," in Charles et al., eds., (2009): 15-36.h

      Traces 3 key keywords in lit. review chapters of diss.s in applied lings. Finds difs. in frequency of research, study, and studies by Moves. Finds study almost always self-referential (this, the present, the current).

    24. Gardner, Dee and Mark Davies. "A New Academic Vocabulary List". Applied Linguistics 2013: 11-24.
    25. Gardezi, S. Amina, and Hilary Nesi. "Variation in the writing of economics students in Britain and Pakistan: the case of conjunctive ties." in Charles et al. eds. (2009)

      Compares Pakistani and British essays by economics students (Br. from from BAWE) in use of conjunctive ties; finds and and but used initially more by the Pakistanis and in general more conjunctive ties used by them. Illustrates use of Rayson-Garside LL calculator to measure similarity or corpora.

    26. Giannoni, Davide Simone. "Book Acknowledgements across Disciplines and Texts," in Hyland and Bondi, eds. (2006): 151-176.

      Sharp rise in Acknowledgment sections of books in last 50 years creates a place where subjectivity can be indulged, esp. via hyperbole, irony, and emotivity. More of this across the board in the 'soft' sciences (lings, econ, soc). Size of networks to be acknowledged is greater in hards.

    27. Groom, Nicholas. "Pattern and meaning across genres and disciplines: An exploratory study". Journal of English for Academic Purposes, 4 (3) (2005), 257-277. [Abstract]
    28. Hewings, M. & Hewings, A. "Anticipatory "it" in academic writing: An indicator of disciplinary difference and developing disciplinary knowledge". In M. Hewings (Ed.), Academic Writing in Context: Implications and Applications. Birmingham: University of Birmingham Press, (2001): 199-214.
    29. ________________________, "It is interesting to note that"": A comparative study of anticipatory "it" in student and published writing". English for Specific Purposes, 21 (4)(2002): 367-383.
    30. _________________________. "Impersonalizing stance: A study of anticipatory "it" in student and published academic writing". In C. Coffin, A. Hewings & K. O'Halloran (Eds.), Applying English Grammar: Functional and Corpus Approaches. London: Arnold, (2004) 101-116.
    31. Holmes, Jasper, and Hilary Nesi. "Verbal and Mental Processes in Academic Disciplines," in Charles et al. (2009): 58-72. [Abstract]
    32. Hunston, Susan. "Evaluation and ideology in scientific writing," in Register Analysis: Theory and Practice, ed. Mohsen Ghadessy. Pinter Pub Ltd. (1993), pp. 57-75.

    33. ______________. "Professional Conflict: Disagreement in academic discourse," in Baker et al, eds. (1993): 115-134. [Abstract] [text]
    34. ______________. "Conflict and Consensus: Construing opposition in Applied Linguistics," in Tognini-Bonelli and Camiciotti (2005) 1-16.
    35. Hyland, Ken. "Writing Without Conviction? Hedging in Science Research Articles," Applied Linguistics 17: 4(1996): 433-454. [Abstract]. [Here]
    36. ___________. "Bringing in the reader: Addressee features in academic artices." Written Communication 18 (4):549-574.
    37. ___________. "Directives: Argument and engagement in academic writing," Applied Linguistics 23:2 (2002: 215-239 [Here]
    38. ___________. "Activity and Evaluation: Reporting practices in academic writing," in Flowerdew (2002), 115-130

      variation across disciplines

    39. ___________. "As can be seen: Lexical bundles and disciplinary variation," in English for Specific Purposes 27 (2008): 4-21. [Abstract] [Here]
    40. ___________. "Authority and invisibility: Authorial identity in academic writing," Journal of Pragmatics 34 (8)
    41. ___________. "Humble Servants of the Discipline? Self-mention in research articles." English for Specific Purposes 20 (3):207-226.
    42. ___________ . "Self-citation and self-reference: Credibility and promotion in academic publication." Jounral of American Society for Information Science and Technology 54 (3): 251-259.
    43. ___________. "A convincing argument: Corpus analysis and academic persuasion," in Connor and Upton, eds., 87-112. [here]

      Contrasts research articles in 4 'hard' disciplines (App Lings, EE ME, Physics) with a matching set in 4 'soft' disciplines(Soc, Marketing, Philosophy, Biology) along three parameters of contrast: citation practices, Reader-oriented features, and self-mention/acad promotion.

    44. ___________. Academic Discourse. Continuum, 2009.

      Very introductory—not really corpus lings. Sketches research, instructional, student, and popular discourses.

    45. __________ and Polly Tse. "'The leading journal in its field': evaluation in journal descriptions," Discourse Studies 11:6 (2009): 703-20. [Abstract]
    46. Kaltenbock, G. "On the syntactic and semantic status of anticipatory it".English Language and Linguistics , 7 (2) (2003): 235-255.
    47. Lewin, Beverly A. "Contentiousness in science: The discourse of critique in two sociology journals," Text 25:6 (2005): 723-744. [Abstract] [Here]
    48. McEnery, Tony and Nazareth Amselom Kifle, "Epistemic modality in argumentative essays of second language writers, in Flowerdew (2002)182-194

      use of lexical verbs expressing e. modality in writing of NS/NNS

    49. Oakey, David. "Formulaic language in English academic writing: A corpus-based study of the formal and functional variation of a lexical phrase in different academic disciplines" in Reppen et al. (2002): 111-129.

      Compares uses of 'sentence-builder' "it is/has been (often) asserted/believed/noted that X" in 3 subcorpora of BNC (socsci, medicine, tech/engineering) and three major functions (topic priming, support cited, straw man, self-reference) .

    50. ____________. "Academic vocabulary in academic discourse: The phraseological behaviour of EVALUATION in Economics research articles," in Tognini-Bonelli and Camiciotti (2005): 169-184

      NNS learners' difficulties are more phraseological than word based.

    51. Okamura, Akiko. "pragmatic force in biology papers written by British and Japanese scientists," in Tognini-Bonelli and Camiciotti (2005: 69-82)

      Japanese scientists use past tense rather than present perfecdt in introduction, making their 'we' less authoritative.

    52. Peacock, Matthew. "A cross-disciplinary comparison of boosting in research articles," Corpora 1:1 (2006) 61-84. [Summary] [Here]
    53. _______________. "A comparative study of introductory it in research articles across eight disciplines," International Journal of Corpus Linguistics 16:1 (2011): 72-100. [Abstract]. [Here]
    54. Pecorari, Diane. "Formulaic Language in Biology: A Topic-specific Investigation," in Charles et al (2009):91-104

      Found high density of 4+ ngrams in 500K word corpus of artitcles on candida albicans. Strongest in 'external' reference to materials, procedures, and the liberature. Lots of 'reporting verb' ngrams ([have] been identified as

    55. Poos, Deanna and Rita Simpson. "Cross-disciplinary comparisons of hedging: Some findings from the Michigan Corpus of Academic Spoken English," in Reppen et al. (2002): 3-23.
    56. Poudat, Céline and Sylvain Loiseau, "Authorial presence in academic genres," in Tognini-Bonelli and Camiciotti(2005), 51-69.

      First person pronouns in French philosophical vs. linguistic writings

    57. Römer, Ute. "'This seems somewhat counterintuitive, though…': Negative evaluation in linguistic book reviews by male and female authors," in Tognini-Bonelli and Camiciotti (2005): 97-116.

      Focus on a corpus of book reviews in lings, on ways of expressing negative valuation. Considers gender. keys on words like surprising and disappointing.

    58. ________________. "The inseparability of lexis and grammar: Corpus linguistic perspectives". Annual Review of Cognitive Linguistics 7 (1) (2009), 140-162
    59. Sanderson, Tamsin. "Interaction, identity and culture in academic writing: The case of German, British and American academics in the humanities," in Ädel et al, eds. (2008): 57-92.

      Compares German and Eng Lang author's self-reference (and some 2nd person); finds 'I-taboo' much more robust for Germans (pref. for we) and German Philosophers very heavy users (77%) of we-humans. Some 3rdp ref. Status and gender also imp. parameters.

    60. Shaw, Philip."Linking adverbials in student and professional writing in literary studies: what makes writing mature," in Charles et al. (2009): 215-35. [Abstract]
    61. Silver, Marc. "Introducing abstract reasoning: World of reference and writer argument across disciplines," in Hyland and Bondi, eds. (2006): 75-101.

      4 subcorpora: Unified Physics, Molecular Biology, Economics, and Business Management. Comparative keyword lists of 'doing science' verbs. logic of hard/soft disciplines.

    62. Simpson-Vlach, Rita and Nick C. Ellis. "An Academic Formulas List: New Methods in Phraseology Research," Applied Linguistics 31:4 (2010): 487-512. [Abstract]
    63. Stotesbury, Hilkka. "Gaps and False Conclusions: Criticism in Research Article Abstracts," in Hyland and Bondi, eds. (2006):123-148. [Summary]
    64. Taylor, G. & Chen, T. "Linguistic, cultural, and subcultural issues in contrastive discourse analysis: Anglo-American and Chinese scientific texts". Applied Linguistics, 12 (3) (1991):31-36.
    65. Tse, Polly and Ken Hyland. "Gender and discipline: Exploring metadiscourse variation in academic book reviews," in Hyland and Bondi, eds. (2006): [Summary]
    66. Thompson, Paul. "Aspects of identification and position in intertextual reference in PhD theses," in Tognini-Bonelli and Camiciotti, eds. (2005) 31-50.

      averral vs. attribution in theses in Agricultural Botany, Agricultural Economics, and Psychology,

    67. Vassileva, Irena. "Who am I/who are we in academic writing? A contrastive analysis of authorial presence in English, German, French, Russian and Bulgarian," International journal of applied linguistics 8:2(1998): 163-190. [Here]
    68. Vihla, Minna. Medical Writing: Modality in Focus. Amsterdam: Rodopi, 1999.

      Analyzes distribution of modals and modality markers over four genres of professisonal American medical writing (Research articles, Editorial articles, Textbook samples, and Handbook samples) as well as two groups of popular texts (Newspaper/magazine articles and Guidebook samples). Finds main contrast in med writing between research and the clinic (advisory). Chapter on argumentation is good. [ [Here]]

    69. Vold, Eva Thue. "Epistemic modality markers in research articles: a cross-linguistic and cross-disciplinary study," International journal of applied linguistics 16:1 (2006): 611-87. [Here]
    70. ______________, "The choice and use of epistemic modality markers in linguistics and medical research articles," in Hyland and Bondi, eds. (2006): 225-252.
    71. Walsh, Steve, Anne O;Keefe and Michahel McCarthy. "', multi-culturalism, structuralism, feminism, post-modernism and so on and so forth: A comparative analysis of vague category markers in academic discourse," in Ädel et al (2008):9-29. [spoken academic discourse]
    72. Williams, G. "In search of representativity in specialised corpora: Categorisation through collocation". International Journal of Corpus Linguistics, 7 (1) (2002), 43-64.
    73. Wulff, Stefanie and Ute Römer. "Becoming a proficient academic writer: shifting lexical preferences in the use of the progressive," Corpora 4:2(2009): 115-133. [Here]

  9. Issues and Applications in Corpus Lings
    1. Deignan, Alice. "Corpus-based Research into Metaphor," ln Lynne Cameron and Graham Low, Eds., Researching and Applying Metaphor, CUP (1999):177-202.
    2. Dillon, George. The genres speak: Using large corpora to profile generic registers. Journal of Literary Semantics, 36(2):159-88, 2007.

      Finds key words and phrases that characterize journalism and popular romance, using BNC and Lexis Nexis.

    3. Moon, Rosamund. Words, frequencies, and texts (particularly Conrad): A stratified approach. Journal of Literary Semantics, 36(1):1-34, 2007.
    4. Semino, Elena and Short, Mick. Corpus Stylistics: speech, writing, and thought presentation in a corpus of English writing. Routledge, 2004.

      Based on a 258,348 word corpus of late 20th century British narrative prose.

    5. Hori, Masahiro. Investigating Dickens' Style : A Collocational Analysis. Palgrave, Macmillan, 2004. [Summary]
    6. Stubbs, Michael. "Conrad in the Computer: examples of quantitative stylistic methods." Language and Literature 14 (2005) 5-24.

      A model paper for tackling a big issue (The Fish fork) and arguing for the whole application of machines in the humanities

Last modified: Thurs Feb 26th, 2014