- •Contents
- •1 Russian
- •1.1 The Russian language
- •1.1.1 Russian then and now
- •1.1.2 Levels of language
- •1.2 Describing Russian grammar
- •1.2.1 Conventions of notation
- •1.2.2 Abbreviations
- •1.2.3 Dictionaries and grammars
- •1.2.4 Statistics and corpora
- •1.2.5 Strategies of describing Russian grammar
- •1.2.6 Two fundamental concepts of (Russian) grammar
- •1.3 Writing Russian
- •1.3.1 The Russian Cyrillic alphabet
- •1.3.2 A brief history of the Cyrillic alphabet
- •1.3.3 Etymology of letters
- •1.3.4 How the Cyrillic alphabet works (basics)
- •1.3.5 How the Cyrillic alphabet works (refinements)
- •1.3.6 How the Cyrillic alphabet works (lexical idiosyncrasies)
- •1.3.7 Transliteration
- •2 Sounds
- •2.1 Sounds
- •2.2 Vowels
- •2.2.1 Stressed vowels
- •2.2.3 Vowel duration
- •2.2.4 Unstressed vowels
- •2.2.5 Unpaired consonants [ˇs ˇz c] and unstressed vocalism
- •2.2.6 Post-tonic soft vocalism
- •2.2.7 Unstressed vowels in sequence
- •2.2.8 Unstressed vowels in borrowings
- •2.3 Consonants
- •2.3.1 Classification of consonants
- •2.3.2 Palatalization of consonants
- •2.3.3 The distribution of palatalized consonants
- •2.3.4 Palatalization assimilation
- •2.3.5 The glide [j]
- •2.3.6 Affricates
- •2.3.7 Soft palatal fricatives
- •2.3.8 Geminate consonants
- •2.3.9 Voicing of consonants
- •2.4 Phonological variation
- •2.4.1 General
- •2.4.2 Phonological variation: idiomaticity
- •2.4.3 Phonological variation: systemic factors
- •2.4.4 Phonological variation: phonostylistics and Old Muscovite pronunciation
- •2.5 Morpholexical alternations
- •2.5.1 Preliminaries
- •2.5.2 Consonant grades
- •2.5.3 Types of softness
- •2.5.4 Vowel grades
- •2.5.5 Morphophonemic {o}
- •3 Inflectional morphology
- •3.1 Introduction
- •3.2 Conjugation of verbs
- •3.2.1 Verbal categories
- •3.2.2 Conjugation classes
- •3.2.3 Stress patterns
- •3.2.4 Conjugation classes: I-Conjugation
- •3.2.5 Conjugation classes: suffixed E-Conjugation
- •3.2.6 Conjugation classes: quasisuffixed E-Conjugation
- •3.2.7 Stress in verbs: retrospective
- •3.2.8 Irregularities in conjugation
- •3.2.9 Secondary imperfectivization
- •3.3 Declension of pronouns
- •3.3.1 Personal pronouns
- •3.3.2 Third-person pronouns
- •3.3.3 Determiners (demonstrative, possessive, adjectival pronouns)
- •3.4 Quantifiers
- •3.5 Adjectives
- •3.5.1 Adjectives
- •3.5.2 Predicative (‘‘short”) adjectives
- •3.5.3 Mixed adjectives and surnames
- •3.5.4 Comparatives and superlatives
- •3.6 Declension of nouns
- •3.6.1 Categories and declension classes of nouns
- •3.6.2 Hard, soft, and unpaired declensions
- •3.6.3 Accentual patterns
- •3.6.8 Declension and gender of gradation
- •3.6.9 Accentual paradigms
- •3.7 Complications in declension
- •3.7.1 Indeclinable common nouns
- •3.7.2 Acronyms
- •3.7.3 Compounds
- •3.7.4 Appositives
- •3.7.5 Names
- •4 Arguments
- •4.1 Argument phrases
- •4.1.1 Basics
- •4.1.2 Reference of arguments
- •4.1.3 Morphological categories of nouns: gender
- •4.1.4 Gender: unpaired ‘‘masculine” nouns
- •4.1.5 Gender: common gender
- •4.1.6 Morphological categories of nouns: animacy
- •4.1.7 Morphological categories of nouns: number
- •4.1.8 Number: pluralia tantum, singularia tantum
- •4.1.9 Number: figurative uses of number
- •4.1.10 Morphological categories of nouns: case
- •4.2 Prepositions
- •4.2.1 Preliminaries
- •4.2.2 Ligature {o}
- •4.2.3 Case government
- •4.3 Quantifiers
- •4.3.1 Preliminaries
- •4.3.2 General numerals
- •4.3.3 Paucal numerals
- •4.3.5 Preposed quantified noun
- •4.3.6 Complex numerals
- •4.3.7 Fractions
- •4.3.8 Collectives
- •4.3.9 Approximates
- •4.3.10 Numerative (counting) forms of selected nouns
- •4.3.12 Quantifier (numeral) cline
- •4.4 Internal arguments and modifiers
- •4.4.1 General
- •4.4.2 Possessors
- •4.4.3 Possessive adjectives of unique nouns
- •4.4.4 Agreement of adjectives and participles
- •4.4.5 Relative clauses
- •4.4.6 Participles
- •4.4.7 Comparatives
- •4.4.8 Event nouns: introduction
- •4.4.9 Semantics of event nouns
- •4.4.10 Arguments of event nouns
- •4.5 Reference in text: nouns, pronouns, and ellipsis
- •4.5.1 Basics
- •4.5.2 Common nouns in text
- •4.5.3 Third-person pronouns
- •4.5.4 Ellipsis (‘‘zero” pronouns)
- •4.5.5 Second-person pronouns and address
- •4.5.6 Names
- •4.6 Demonstrative pronouns
- •4.7 Reflexive pronouns
- •4.7.1 Basics
- •4.7.2 Autonomous arguments
- •4.7.3 Non-immediate sites
- •4.7.4 Special predicate--argument relations: existential, quantifying, modal, experiential predicates
- •4.7.5 Unattached reflexives
- •4.7.6 Special predicate--argument relations: direct objects
- •4.7.7 Special predicate--argument relations: passives
- •4.7.8 Autonomous domains: event argument phrases
- •4.7.9 Autonomous domains: non-finite verbs
- •4.7.12 Retrospective on reflexives
- •4.8 Quantifying pronouns and adjectives
- •4.8.1 Preliminaries: interrogatives as indefinite pronouns
- •4.8.7 Summary
- •4.8.9 Universal adjectives
- •5 Predicates and arguments
- •5.1 Predicates and arguments
- •5.1.1 Predicates and arguments, in general
- •5.1.2 Predicate aspectuality and modality
- •5.1.3 Aspectuality and modality in context
- •5.1.4 Predicate information structure
- •5.1.5 Information structure in context
- •5.1.6 The concept of subject and the concept of object
- •5.1.7 Typology of predicates
- •5.2 Predicative adjectives and nouns
- •5.2.1 General
- •5.2.2 Modal co-predicates
- •5.2.3 Aspectual co-predicates
- •5.2.4 Aspectual and modal copular predicatives
- •5.2.5 Copular constructions: instrumental
- •5.2.6 Copular adjectives: predicative (short) form vs. nominative (long) form
- •5.2.9 Predicatives in non-finite clauses
- •5.2.10 Summary: case usage in predicatives
- •5.3 Quantifying predicates and genitive subjects
- •5.3.1 Basics
- •5.3.2 Clausal quantifiers and subject quantifying genitive
- •5.3.3 Subject quantifying genitive without quantifiers
- •5.3.4 Existential predication and the subject genitive of negation: basic paradigm
- •5.3.5 Existential predication and the subject genitive of negation: predicates
- •5.3.6 Existential predication and the subject genitive of negation: reference
- •5.3.8 Existential predication and the subject genitive of negation: predicates and reference
- •5.3.9 Existential predication and the subject genitive of negation: context
- •5.3.10 Existential predication and the subject genitive of negation: summary
- •5.4 Quantified (genitive) objects
- •5.4.1 Basics
- •5.4.2 Governed genitive
- •5.4.3 Partitive and metric genitive
- •5.4.4 Object genitive of negation
- •5.4.5 Genitive objects: summary
- •5.5 Secondary genitives and secondary locatives
- •5.5.1 Basics
- •5.5.2 Secondary genitive
- •5.5.3 Secondary locative
- •5.6 Instrumental case
- •5.6.1 Basics
- •5.6.2 Modal instrumentals
- •5.6.3 Aspectual instrumentals
- •5.6.4 Agentive instrumentals
- •5.6.5 Summary
- •5.7 Case: context and variants
- •5.7.1 Jakobson’s case system: general
- •5.7.2 Jakobson’s case system: the analysis
- •5.7.3 Syncretism
- •5.7.4 Secondary genitive and secondary locative as cases?
- •5.8 Voice: reflexive verbs, passive participles
- •5.8.1 Basics
- •5.8.2 Functional equivalents of passive
- •5.8.3 Reflexive verbs
- •5.8.4 Present passive participles
- •5.8.5 Past passive participles
- •5.8.6 Passives and near-passives
- •5.9 Agreement
- •5.9.1 Basics
- •5.9.2 Agreement with implicit arguments, complications
- •5.9.3 Agreement with overt arguments: special contexts
- •5.9.4 Agreement with conjoined nouns
- •5.9.5 Agreement with comitative phrases
- •5.9.6 Agreement with quantifier phrases
- •5.10 Subordinate clauses and infinitives
- •5.10.1 Basics
- •5.10.2 Finite clauses
- •5.10.4 The free infinitive construction (without overt modal)
- •5.10.5 The free infinitive construction (with negative existential pronouns)
- •5.10.6 The dative-with-infinitive construction (overt modal)
- •5.10.7 Infinitives with modal hosts (nominative subject)
- •5.10.8 Infinitives with hosts of intentional modality (nominative subject)
- •5.10.9 Infinitives with aspectual hosts (nominative subject)
- •5.10.10 Infinitives with hosts of imposed modality (accusative or dative object)
- •5.10.11 Final constructions
- •5.10.12 Summary of infinitive constructions
- •6 Mood, tense, and aspect
- •6.1 States and change, times, alternatives
- •6.2 Mood
- •6.2.1 Modality in general
- •6.2.2 Mands and the imperative
- •6.2.3 Conditional constructions
- •6.2.4 Dependent irrealis mood: possibility, volitive, optative
- •6.2.5 Dependent irrealis mood: epistemology
- •6.2.6 Dependent irrealis mood: reference
- •6.2.7 Independent irrealis moods
- •6.2.8 Syntax and semantics of modal predicates
- •6.3 Tense
- •6.3.1 Predicates and times, in general
- •6.3.2 Tense in finite adjectival and adverbial clauses
- •6.3.3 Tense in argument clauses
- •6.3.4 Shifts of perspective in tense: historical present
- •6.3.5 Shifts of perspective in tense: resultative
- •6.3.6 Tense in participles
- •6.3.7 Aspectual-temporal-modal particles
- •6.4 Aspect and lexicon
- •6.4.1 Aspect made simple
- •6.4.2 Tests for aspect membership
- •6.4.3 Aspect and morphology: the core strategy
- •6.4.4 Aspect and morphology: other strategies and groups
- •6.4.5 Aspect pairs
- •6.4.6 Intrinsic lexical aspect
- •6.4.7 Verbs of motion
- •6.5 Aspect and context
- •6.5.1 Preliminaries
- •6.5.2 Past ‘‘aoristic” narrative: perfective
- •6.5.3 Retrospective (‘‘perfect”) contexts: perfective and imperfective
- •6.5.4 The essentialist context: imperfective
- •6.5.5 Progressive context: imperfective
- •6.5.6 Durative context: imperfective
- •6.5.7 Iterative context: imperfective
- •6.5.8 The future context: perfective and imperfective
- •6.5.9 Exemplary potential context: perfective
- •6.5.10 Infinitive contexts: perfective and imperfective
- •6.5.11 Retrospective on aspect
- •6.6 Temporal adverbs
- •6.6.1 Temporal adverbs
- •6.6.2 Measured intervals
- •6.6.3 Time units
- •6.6.4 Time units: variations on the basic patterns
- •6.6.14 Frequency
- •6.6.15 Some lexical adverbs
- •6.6.16 Conjunctions
- •6.6.17 Summary
- •7 The presentation of information
- •7.1 Basics
- •7.2 Intonation
- •7.2.1 Basics
- •7.2.2 Intonation contours
- •7.3 Word order
- •7.3.1 General
- •7.3.6 Word order without subjects
- •7.3.7 Summary of word-order patterns of predicates and arguments
- •7.3.8 Emphatic stress and word order
- •7.3.9 Word order within argument phrases
- •7.3.10 Word order in speech
- •7.4 Negation
- •7.4.1 Preliminaries
- •7.4.2 Distribution and scope of negation
- •7.4.3 Negation and other phenomena
- •7.5 Questions
- •7.5.1 Preliminaries
- •7.5.2 Content questions
- •7.5.3 Polarity questions and answers
- •7.6 Lexical information operators
- •7.6.1 Conjunctions
- •7.6.2 Contrastive conjunctions
- •Bibliography
- •Index
Russian 3
Russian has undergone some change since the political and economic turmoil of the late eighties and early nineties, but it is difficult to assess how much. Most tangibly, there have been changes in vocabulary.5 Borrowing and native derivational processes have produced many new words and word combinations, leading to macaronic texts: ytqk-fhn ‘nail-art’, WEB-lbpfqy ‘WEB-design’,
Htrbq ,bhvbyutvcrbq lb-l;tq gj bvtyb Graham Mack lb-l;tbk ct,t, lb-l;tbk, lf nfr b ljlbl;tbkcz, xnj c hflbj eitk ‘A certain Birmingham DJ, named Graham Mack, DJ-ed, DJ-ed, and so DJ-ed out, that he had to leave the radio station’. This internationalized vocabulary now dominates the linguistic landscape, just as Soviet-speak used to dominate language a half century ago. Along with these changes in vocabulary has come a less quantifiable but still palpable change in the mores of language. Unedited, informal texts of written Russian of a type that would never have become public during the days of active Soviet censorship are now available in print and especially electronic form. And yet, despite political changes and a loosening of speech manners, contemporary Russian in its grammatical structure remains Russian.
1.2 Describing Russian grammar
1.2.1 Conventions of notation
The notational conventions employed here are those of Table 1.1.
In the body of the text, Cyrillic words and phrases will be given in italics, and English translations in single quotation marks. Stress is marked in citation forms of words or short phrases; stress is not marked on vowels in fragments of text cited in the text or in set-off numbered examples. In numbered examples, italics and quotations are not used.
1.2.2 Abbreviations
The abbreviations used in this study are listed in Table 1.2.
1.2.3 Dictionaries and grammars
The definitive dictionary of Russian in Russian is the Slovar sovremennogo russkogo literaturnogo iazyka, a seventeen-volume dictionary published over 1950--65. Selfevidently it does not include the numerous new words from the last several decades. Shorter Russian-language dictionaries are fully useful, notably Ozhegov’s one-volume classic, which conveniently lists grammatical forms with stress. More than adequate bilingual dictionaries are the Oxford dictionary (both directions) and now the Novyi Slovar (Russian to English), the most up-to-date
5 Zemskaia 2000.
4A Reference Grammar of Russian
Table 1.1 Conventions used
notation |
interpretation |
|
|
nom sg ntnhƒlm |
grammatical gloss and Russian word |
ntnhƒlm<nom sg> |
alternative grammatical gloss of Russian word |
lj<\gen> |
grammatical form conditioned by another word (preposition or |
|
verb) |
≤=≥ |
spelling of letter (or word) in Cyrillic, when spelling is at issue |
[ƒ] |
sound (from narrow phonetic through broad phonetic to |
{ƒ} or {ƒ : ø : ə} |
phonemic) |
vowel series, or set of stressed and unstressed vowels related |
|
{-ej} or -tq |
by etymology and/or synchronic alternation |
morphological unit |
|
{X : Y} |
any relation of elements, notably two stems of verbs, |
X Y |
{CVC-a-<pst/inf> : CVC-aj-|e|-}<prs>} |
two forms potentially available in the same context |
|
jcnƒnmcz/jcnfdƒnmcz |
aspect pair: perfective and secondary (derived) imperfective |
vf[ƒnm\vf[yénm |
aspect pair: simplex imperfective and semelfactive perfective |
(gj)ghjc∫nm or |
aspect pair: simplex imperfective and prefixed perfective |
ghjc∫nm\gjghjc∫nm |
|
√/ ± / ? / |
hierarchy of acceptability judgments: neutral, acceptable, |
|
frequent / less preferred option / restricted, marginal / |
|
dubious, ungrammatical |
|
|
|
|
dictionary available. A selection of dictionaries -- Russian only and bilingual -- is available on the web.
Russian dictionaries, unlike many dictionaries of English, do not give information about etymology, for which one should consult the dictionary of Max Vasmer (in its original German edition of 1953 or the Russian edition of 1986--87 revised by O. N. Trubachev), nor about earlier usage, for which one should use Srevnevskii’s “materials” for a dictionary of Old Russian from 1893--1912 (and later reprints), Slovar russkogo iazyka XI--XVII vv., or Slovar russkogo iazyka XVIII veka. Lubensky (1995) should be consulted for Russian idioms.
For grammatical information, the “grammatical dictionary” of A. A. Zalizniak (1977[a]), with 100,000 entries arranged in reverse alphabetical order, is definitive. Entries of the dictionary are indexed with paradigm numbers; exceptions are marked. The 142 introductory pages list paradigms with accentual contours.
A variety of grammars is available, including two compact grammars in English (Unbegaun 1957, Wade 1992), which, however, do not treat syntax extensively, as well as the multiple generations of “academy grammars” (for example,
Russian |
5 |
Table 1.2 Abbreviations used
abbreviation |
interpretation |
|
|
C / C/ / R / W |
set of consonants / obstruents / sonorants / {[v v˛]} |
C¸ / C 0 |
set of palatalized consonants / set of non-palatalized consonants |
V / V!/ V* |
set of vowels / stressed vowels / unstressed vowels |
ˇ |
consonant articulations: labial / dental / velar / alveo-palatal |
P / T / K / S |
|
C 0 / C j / C i / C i / C i / C i |
consonant grades (§2.5.2) |
[z˛] / [ür] / [3r] |
palatalized [z] / voiceless [r] / voiced [r] |
[´a5 ] / [´a5] / [´a55] |
[a] fronted in initial transition / final transition / both transitions |
ˇrür |
articulation in which one feature changes over duration of |
|
segment |
nom / acc / gen / dat / |
nominative / accusative / genitive / dative / locative / instrumental |
loc / ins |
|
gen1 / gen2 // loc1 / loc2 |
primary / secondary genitive // primary / secondary locative |
nom=acc / acc=gen |
syncretism of nominative and accusative (“inanimate accusative”) / |
|
syncretism of accusative and genitive (“animate accusative”) |
sg / pl / du |
singular / plural / dual |
msc / fem / nt |
masculine / feminine / neuter |
an / in |
animate / inanimate |
pv |
predicative (= “short”) adjective |
nn / qu / adj / pss |
noun / quantifier / adjective / possessive |
Declension<I> |
first declension: Declension<Ia> and Declension<Ib> |
Declension<Ia> |
first declension (masculine type with nom sg {- }: ,j´,) |
Declension<Ib> |
first declension (neuter type with nom sg {-o -e}: cnƒlj) |
Declension<II> |
second declension |
Declension<III> |
third declension |
Declension<IIIa> |
third declension (feminine with nom sg {- }: gkj´oflm) |
Declension<IIIb> |
third declension (neuter with nom sg -z: dh†vz) |
Declension<IIIc> |
third declension (masculine with nom sg {- }: génm ) |
R / E / A / F / T / M |
stress paradigms -- stress on: root / ending / classificatory suffix |
|
(verbs) / antethematic syllable / thematic syllable / mobile stress |
prs / pst / fut / inf / |
present / past / future / infinitive / imperative / irrealis / realis / |
imv / irr / rls / pcl / |
participle / adverbial participle (lttghbxfcnbt) / passive participle |
dee / psv |
|
if / pf // dt / id |
imperfective / perfective // determinate (imperfective) / |
|
indeterminate |
1sg / 1pl / 2sg / 2pl / 3sg / |
first-person singular / first-person plural / second-person singular / |
3pl |
second-person plural / third-person singular / third-person plural |
dim |
diminutive |
intg |
interrogative |
/ B / ↔ / |
address by ns / address by ds / mutual address by ns / mutual |
B ↔B / ↔B |
address by ds / asymmetric address, one speaker using ns, the |
|
other ds / |
У / И / О / Ф |
diminutive name / first name / patronymic / surname |
s jyf v dpzkf o vtyz d yf |
word order: subject verb object domain manner |
,fpfh m ,kfujgjkexyj |
|
|
|
|
|
6A Reference Grammar of Russian
RG 1980). The four-volume “functional grammar” is superb (Bondarko 1991--96). Good grammars exist in other European languages (for example, Garde 1980 in French, Isaˇcenko 1975 in German). The discussion below, though it is informed by this tradition of grammatical analysis, does not cite them in the interests of avoiding a clutter of references.
1.2.4 Statistics and corpora
To characterize how likely some construction is, it is often useful to cite statistics of usage. At the same time, it is important to acknowledge the limitations on statistical statements. The likelihood of using some or another morphological form or syntactic construction is really the likelihood of using the context in which the form or construction is appropriate; statistics ultimately measure how likely people are to say a whole context. For example, if we find that the combination e ytq is less frequent than e ytt, what we have really found is that the contexts in which e ytq is appropriate occur less frequently than those in which e ytt is appropriate. Any statistical statement, even one that appears to deal with morphological variants, is a measure of the frequency of the contexts in which these variants are appropriate. When the discussion below cites statistical observations, it is usually to say, informally and without pretense of scientific rigor, that a certain construction occurs surprisingly often or not particularly often, relative to what one might expect. The limitations on what statistical statements mean should always be kept in mind.
As a corpus for making statistical observations, I initially used the “Uppsala Corpus.” The corpus, assembled by the Slavic Institute of Uppsala University and mounted on the web by the University of Tubingen,¨6 offers a balanced selection of styles of texts through the 1980s; it has its own search. As time went on, I made use of the broader resources of the web. The address “http://www.lib.ru/” has a vastly larger number of (belletristic) texts. By using a powerful search engine (such as Google, Zndex, or Rambler), it is possible to search this site or the whole web for words or phrases, and produce quantities of Russian larger by orders of magnitude than the Uppsala Corpus. For example, in the Uppsala Corpus, the target ins sg nsczxtq produced no tokens, the target e ytq five tokens. In contrast, a search of http://www.lib.ru/ (with Google, <20.X.02>) produced 233 hits for nsczxtq and 796 for e ytq; and on the whole web (with Google, <20.X.02>), there were 8,790 hits for nsczxtq and 25,900 for e ytq. The new electronic resources, then, offer the possibility of vast quantities of Russian, most of it very contemporary.
6At: http://www.sfb441.uni-tuebingen.de/b1/korpora.html. The description (<http://www.slaviska.uu. se/korpdesc.htm>) states that the corpus is based on 600 Russian texts, one million running words, of informative (late 1980s) and literary texts (1960--88).
Russian 7
There are, however, some negatives, which grow in proportion to the size of the corpus and the frequency of the target word or phrase. Unlike the Uppsala Corpus, which was designed to serve as a corpus and has a balanced selection of genres of texts, the web was not designed to serve as a corpus for linguistic investigation. The web has properties that make it less than ideal as a corpus:
(a) the relative weight of genres -- www.libr.ru is heavy on literary texts and translations (if one has hesitations about translations), while the web as a whole has a random mix of commercial writing, personal travelogues, detailed histories of the repair records of automobiles, journalism, and religious texts; (b) the quality of Russian, which includes translations, sites from outside Russia, and informal personal writing and commercial writing that is no longer subjected to the same editing as was Russian printed in the Soviet era; (c) the fact that many of the texts show up on more than one site, undercutting the value of statistical observations; (d) instability -- the sites are not stable over time, impeding replication and verifiability; (e) the number of positive hits, which can be so large that the finite amount of time it takes to evaluate any token makes it difficult to examine all the data. The enormous volume of Russian available now is a mixed blessing.7
Allow me to cite cautionary tales. With respect to repetition: the phrase e;t jnrhsdfk<pst.if> jryj ‘[he] already opened the window’ -- a familiar phrase in aspectology -- gave a modest forty hits on the whole web (<20.XII.01>). But every one of them was the same sentence from a text by A. Tolstoy. With respect to stability, I searched the web for the expressions hfymit ytuj ‘earlier than him’ and d jnyjitybb ytuj ‘in relation to it’, and came up with 1,590 and 5,490 tokens, respectively (<20.XII.01>). The same search nine months later (<15.IX.02>) yielded 2,080 and 7,190 tokens -- an increase of 17 percent. With respect to quantity: I searched the web (<20.X.02>) for tokens of nsczxtq -- 8,790 hits -- and nsczxm/ -- 10,800 hits -- with the goal of finding out in crude terms the relative frequency of these two forms of the instrumental case of nsczxf. It would take perhaps eighty hours to evaluate all that data, if a modest fifteen seconds were devoted to each token. In short, the investigator has no control over the web and no way of determining what its properties as a corpus really are. The Uppsala Corpus, though smaller, offers a more balanced corpus.
In light of such difficulties, it is important to emphasize the limitations on citations from the web. All statistical statements made on the basis of the web should be taken for what they are: informal characterizations of frequency over unstable, often repetitive, collections of Russian assembled for other (commercial, etc.) purposes than to serve as a corpus for linguistic investigation. The corpus is not stable and one cannot control for repetition.
7 Browne 2001 explores the problems of using the web as a corpus.