TOEIC Link Grammar Tag Question Recognition and Formation: The Auxiliary-Polarity-Pronoun Triplet that High-Band Listeners and Speakers Must Process in Under One Second
A tag question is the short interrogative appended to a declarative statement that converts the statement into a confirmation request — "The shipment arrived yesterday, didn't it?" — and is built from three coordinated grammatical elements: the auxiliary (which must match the main clause's verbal structure), the polarity (which must reverse the main clause's polarity), and the pronoun (which must agree with the main clause's subject). The three elements have to coordinate correctly in approximately one second of speech, and the coordination is what makes tag questions one of the most failure-prone short structures in candidate-level English.
On TOEIC Link, tag questions appear in both directions. In Listening, they appear in short dialogues where the function of the tag (confirmation request, polite assertion, or rhetorical hedge) is part of what the candidate must recognize to answer the comprehension question. In Speaking, they appear whenever the candidate produces a hedged opinion or a softened request, and the production has to be grammatically clean enough that the rater does not mark the response down for a structural error.
This article is the recognition guide for the Listening direction and the formation guide for the Speaking direction. The two directions share the same underlying three-element coordination, so the recognition skill and the formation skill reinforce each other in training.
The structural anatomy of the tag question
The tag question is constructed by extracting an auxiliary verb from the main clause, reversing its polarity, and pairing it with a pronoun that agrees with the main clause's subject. The three steps are deterministic in standard English and produce a small set of canonical tag forms that high-band candidates have to recognize and produce automatically.
Step 1 — auxiliary extraction. The candidate identifies the auxiliary in the main clause. If the main clause has an auxiliary (is, are, was, were, has, have, had, will, would, can, could, should, may, might, must, shall), the tag uses the same auxiliary. If the main clause has no auxiliary and uses a simple present or simple past finite verb, the tag uses the dummy auxiliary "do" (do, does, did) inflected to agree with the main clause's tense and subject.
Step 2 — polarity reversal. The candidate reverses the polarity of the auxiliary. If the main clause is positive ("The shipment arrived yesterday"), the tag is negative ("didn't it"). If the main clause is negative ("The shipment didn't arrive yesterday"), the tag is positive ("did it"). The polarity rule has narrow exceptions for the same-polarity tag used in sarcastic or challenging contexts ("So you think you can win, do you?"), which is uncommon in TOEIC Link contexts.
Step 3 — pronoun agreement. The candidate selects a pronoun that agrees with the main clause's subject. The pronoun is "it" for inanimate subjects ("the shipment, didn't it"), "they" for plural human and non-human subjects ("the shipments, didn't they"; "the employees, didn't they"), "he" or "she" for singular human subjects ("Maria, didn't she"; "the manager, didn't he" — gender selected from context), and "you," "I," "we" for pronoun subjects (carried over directly).
The three steps coordinate deterministically, and the candidate who has internalized the deterministic procedure can produce the correct tag in approximately one second of speech.
The auxiliary-extraction matrix
The auxiliary extraction step is the source of most candidate-level errors because the candidate has to match the tag auxiliary to the main clause's verbal structure under real-time pressure. The matrix has four cases.
Case 1 — main clause with a modal or auxiliary already present. The tag uses the same modal or auxiliary. "He will join the meeting, won't he?" The auxiliary "will" is extracted directly.
Case 2 — main clause with simple present. The tag uses "do" or "does." "She runs the team, doesn't she?" The third-person singular "does" is selected because "she" requires it.
Case 3 — main clause with simple past. The tag uses "did." "They submitted the report, didn't they?" The simple past "did" is selected, and no tense agreement adjustment is needed for the plural pronoun.
Case 4 — main clause with "be" as the main verb (not auxiliary). The tag uses the same form of "be." "The deadline is Friday, isn't it?" The "is" functioning as the main verb is treated as auxiliary-equivalent for the purposes of tag formation.
The candidate who memorizes the four-case matrix can extract the correct auxiliary in under a quarter-second, leaving the remaining three-quarters of the second for the polarity and pronoun steps.
The polarity reversal rules and their exceptions
The polarity rule is straightforward in the canonical case: positive main clause produces negative tag, and negative main clause produces positive tag. Three exceptions or edge cases produce predictable confusion for candidates.
Exception 1 — negative adverbs without explicit negation. The main clause uses a negative adverb ("rarely," "seldom," "hardly," "scarcely," "barely") to express negation rather than the explicit "not." The tag is treated as if the main clause were explicitly negative. "She rarely misses a deadline, does she?" The negative adverb "rarely" is semantically negative, so the tag is positive.
Exception 2 — "I am" with the irregular "aren't I" tag. First-person singular "I am" takes the tag "aren't I" rather than the regular "am I not" or "amn't I" (which is non-standard). "I'm on the team, aren't I?" The irregular tag is the only common irregular form in standard tag formation.
Exception 3 — same-polarity tags in challenging or sarcastic contexts. As noted above, same-polarity tags are used in challenging contexts and are uncommon in TOEIC Link. The candidate should recognize them in listening but should not produce them in speaking unless the speaking task specifically calls for that register.
The high-band candidate has the three exceptions memorized and applies the canonical polarity reversal to all other cases without further thought.
The pronoun agreement rules and their edge cases
The pronoun agreement step is the third coordinated element and produces a small set of edge cases that high-band candidates must handle without hesitation.
Edge case 1 — indefinite pronoun subjects. Subjects like "everyone," "someone," "nobody," and "everybody" take the plural pronoun "they" in the tag in modern usage. "Everyone has signed the contract, haven't they?" The traditional singular "he" or "she" tag is now non-standard in this context.
Edge case 2 — "there" as the dummy subject. Subjects with "there" as the dummy subject take "there" again in the tag. "There is a meeting at three, isn't there?" The dummy "there" is repeated rather than replaced with a pronoun.
Edge case 3 — collective nouns. Subjects like "the team," "the committee," "the board" can take singular ("it") or plural ("they") tags depending on whether the speaker is conceptualizing the collective as a unit or as a group of individuals. "The team has won, hasn't it?" (collective as unit) or "The team have argued among themselves, haven't they?" (collective as individuals). TOEIC Link generally uses the singular conceptualization for business contexts.
Edge case 4 — questions that themselves contain tags. Recursive tag formation does not occur in TOEIC Link Listening or Speaking contexts. The candidate does not have to handle the recursive case.
Recognition in TOEIC Link Listening
The Listening recognition skill turns on three elements: hearing the tag against the prosodic background, processing the three-element coordination in real time, and inferring the speaker's intent from the tag-prosody combination.
The prosodic background matters because the tag is produced with one of two intonation contours, and the contour signals the speaker's intent. The rising tag ("...didn't it?↗") signals genuine information-seeking — the speaker does not know the answer and is asking for confirmation. The falling tag ("...didn't it.↘") signals assertion or rhetorical hedging — the speaker knows the answer and is inviting agreement rather than information. TOEIC Link Listening comprehension questions often turn on the rising-versus-falling distinction, because the speaker's intent changes the appropriate response.
The real-time processing skill requires the candidate to parse the main clause's verbal structure and the tag's three elements within the natural speech rate of one to two seconds per tag. Candidates who slow down to parse the tag often miss the next utterance in the dialogue.
The inference skill requires the candidate to combine the tag prosody, the speaker's prior turns, and the conversational context to identify the function of the tag. Functions in TOEIC Link contexts include confirmation requests in business meetings, polite assertions in customer service interactions, rhetorical hedging in feedback conversations, and softened requests in scheduling exchanges.
Formation in TOEIC Link Speaking
The Speaking formation skill turns on producing tags fluently in candidate responses without introducing the three-element coordination errors that consume score points.
The candidate should use tag questions in three Speaking response contexts. The first is hedged opinion ("Productivity tools have improved a great deal in the past few years, haven't they?"), where the tag softens the assertion and invites the rater to engage with the opinion. The second is collaborative reasoning ("The argument depends on the assumption that the market remains stable, doesn't it?"), where the tag signals reasoning rather than assertion. The third is rhetorical framing ("That's the question the policy has to answer, isn't it?"), where the tag emphasizes the rhetorical move rather than asking for information.
The candidate should avoid tag questions in three contexts. The first is direct response to a comprehension question, where the tag would signal evasion. The second is when the candidate is not confident of the main clause's verbal structure, because the structural uncertainty propagates into the tag and produces a coordination error. The third is in formal essay-style Speaking responses, where the tag is too colloquial for the register.
Three failure patterns and their repair
Three failure patterns recur in candidate-level tag formation, and each pattern has a specific repair.
Failure 1 — auxiliary mismatch. The candidate uses the wrong auxiliary in the tag — typically using "do" when the main clause has an auxiliary, or using the wrong tense of "do" when the main clause is in a tense that requires a specific form. Repair: Apply the four-case auxiliary-extraction matrix explicitly during practice until the matrix is automatic.
Failure 2 — polarity preservation. The candidate produces a same-polarity tag in a context that requires polarity reversal. Repair: Practice with positive-main-clause and negative-main-clause examples in alternation, focusing on the polarity reversal as a distinct step from the auxiliary selection.
Failure 3 — pronoun mismatch. The candidate uses "it" when the subject is plural, or "they" when the subject is singular, or fails to handle the indefinite-pronoun edge case correctly. Repair: Memorize the indefinite-pronoun rule (plural "they") and the "there"-as-dummy-subject rule, and practice with mixed-subject examples.
How tag question fluency fits into TOEIC Link prep
Tag question fluency is one of the high-leverage skills in TOEIC Link prep because the tag itself is a short structure (typically two or three syllables) but the cognitive coordination required to produce it correctly is disproportionate to its length. The candidate who can produce tags fluently signals to the rater that the candidate's grammatical processing is automatic at the clause level, not just at the word level.
The recognition skill in Listening is one of the diagnostic skills that distinguishes high-band from mid-band listeners. The candidate who can identify the function of a falling tag in a business meeting dialogue is the candidate whose listening processing has reached the inference layer, beyond the lexical and structural layers that mid-band candidates operate at.
Related EnglishBlitz resources
For more on TOEIC Link grammar at the clause and sentence level, see:
- TOEIC Link Grammar Modal Verbs — the modal-auxiliary inventory that tag questions draw from in Case 1.
- TOEIC Link Grammar Subject-Verb Agreement — the agreement rules that govern the tag's auxiliary and pronoun.
- TOEIC Link Listening Functional Language and Speech Act Recognition — the speech-act recognition framework that the tag-question function fits inside.
The tag question is one of the short structures where high-band candidates and mid-band candidates diverge cleanly. The mid-band candidate produces tags slowly, with hesitation, and with frequent coordination errors. The high-band candidate produces tags automatically, with the three elements coordinated in under a second, and with the prosody calibrated to the intended function. The deterministic three-step procedure is what closes the gap.