Number and Grammatical Gender Attraction in Spanish Pronouns: Evidence for a Syntactic Route to Their Features

Margaret Kandel; Claudia Pañeda; Nasimeh Bahmanian; Mercedes Martinez Bruera; Colin Phillips; Sol Lago

Introduction

An essential part of language production is transforming preverbal concepts into words. Production models propose that concepts are grammatically encoded as lemmas, which are then phonologically encoded as lexemes and finally articulated (Bock & Levelt 1994; Levelt 1989). For example, the concept of multiple exemplars of a certain type of fruit may be grammatically encoded using the lemma apple with a plural number feature, phonologically encoded as the lexeme /æplz/, and articulated as [ˈæplz].

However, concepts that can be grammatically encoded as nouns may at times appear as pronouns instead. For example, if the concept ‘multiple apples’ is contextually salient, in focus in the discourse (Schmitt, Meyer & Levelt 1999), or has recently been mentioned within a sentence, a speaker may decide to refer to it as they rather than as apples, as in Those apples are riper than the pears, aren’t they?.

Although much research has tried to determine when a speaker decides to refer to a concept using a pronoun (see Arnold & Zerkle 2019 for review), it is an open question how speakers determine the appropriate pronoun form to produce. For instance, do speakers consult conceptual and/or linguistic representations when determining pronoun form? How do speakers access the features of the relevant representations? Can non-antecedent noun phrases influence this process? Our study uses agreement attraction to investigate how speakers determine pronoun form in sentences where a pronoun is coreferential with a linguistic antecedent (e.g., the determiner phrase those apples in the example above). We consider two potential routes: a conceptual-lexical route, whereby pronoun form is chosen independently of a linguistic antecedent, and a syntactic route, whereby pronoun form is established through a matching operation with an antecedent, similar to subject–verb agreement. These routes make different predictions about whether or not other non-antecedent noun phrases can influence pronominalization and thus whether one should expect agreement attraction effects.

Two routes to determine pronoun form

In the conceptual-lexical route (referred to by Meyer & Bock 1999 as the “conceptual hypothesis”) the speaker determines pronoun form in a similar way to noun form: by activating a concept like ‘multiple apples’, which is then grammatically encoded as a lemma and phonologically encoded as a lexeme. The lemma that serves as the input for the phonological encoding stage would just be a pronoun rather than a noun. Within the conceptual-lexical route, two paths are potentially available to determine pronoun form (Figure 1). On the direct path, the pronoun’s lemma is accessed directly from the concept. This may be possible when all the features that determine pronoun form have conceptual correlates. For example, a speaker may refer to some apples with the plural pronoun they because the concept has a multiplicity feature. Alternatively, on the mediated path, the pronoun form is accessed from the concept via an associated noun in the mental lexicon (Jescheniak, Schriefers & Hantsch 2001; Meyer & Bock 1999; Schmitt, Meyer & Levelt 1999). In this case, the concept ‘multiple apples’ activates the lemma apple with a plural number feature, which in turn activates the lemma for the pronoun. This form of mediation may be required when pronoun form is influenced by grammatical features without conceptual correlates, such as grammatical gender. For example, in Spanish, the pronoun referring to a set of apples is not only plural, but also feminine, as in ¿Te las vas a comer? (‘Are you going to eat them._FEM?’). The pronoun las reflects the feminine gender of the noun manzana (‘apple’), which cannot be attributed to apples being conceptually female.

Figure 1

Summary of the two routes to determine pronoun form in production.

Note. The figure demonstrates the two routes for selecting the pronoun they in the sentence Those apples are ripe, aren’t they?. Abbreviations: “MULT” denotes the concept of multiplicity, and “pl” denotes a grammatical plural feature. In the conceptual-lexical route, the concepts APPLE and MULT provide the features for the pronoun “they”, either directly (Direct Path) or through activating the corresponding representations at the lemma level (Mediated Path). In the syntactic route, the features for the pronoun “they” are accessed from the lemma-level representations corresponding to the linguistic antecedent “apples”.

The conceptual-lexical route may be the only one available when pronouns are used deictically, without a linguistic antecedent, as when a speaker points to some apples and says, They are ripe. However, it may also apply when there is a linguistic antecedent, as in the sentence Those apples are ripe, aren’t they?, where the pronoun they co-refers with the antecedent those apples. In this case, the pronoun and antecedent would be encoded fully separately: the concept of ‘multiple apples’ would first be grammaticality encoded as apples — and phonologically encoded as /æplz/ — and later in the sentence, the concept would be grammatically encoded as they — and phonologically encoded as /ðeɪ/. In this case, the antecedent and the pronoun end up sharing grammatical and/or morpho-phonological features as a collateral effect of encoding the same concept.

In contrast, in the syntactic route (similar to Meyer & Bock’s 1999 “lexical hypothesis”), a pronoun receives its features from the antecedent rather than the concept, similar to how verbs receive their features from a sentence’s subject phrase (Figure 1). Under this route, speakers access what has been previously said in the sentence or discourse and perform a matching operation between the antecedent and the pronoun’s features. The syntactic route may be important in cases when a concept has more than one possible grammatical encoding. For instance, the concept ‘scissors’ may be referred to using the plural noun scissors or the singular noun phrase pair of scissors. In this case, how a concept has previously been referred to in a discourse is important for selecting the correct pronoun form — they or it. This route to pronoun form selection can also capture grammatical gender agreement, since the linguistic antecedent has grammatical gender. The syntactic route may be readily applied in cases of intra-sentential pronominalization, when speakers could already be monitoring the content of the sentence in order to abide by constraints on sentence formulation, such as binding constraints and avoidance of noun phrase repetition.

To summarize, a key difference between the two routes is whether pronoun form is influenced by other parts of the sentence or previous discourse (syntactic route) or not (conceptual-lexical route). Our study assesses whether the syntactic route is used during the production of pronouns with intra-sentential antecedents in Spanish, a language in which pronouns mark the number and grammatical gender of their referent. We address these questions by investigating whether pronouns are subject to agreement attraction.

Agreement attraction as a tool to arbitrate between the two routes

Observing what speakers say is usually not enough to arbitrate between the conceptual-lexical route and the syntactic route: If everything goes well during production, a pronoun’s form will be the same regardless of the route taken. However, speakers sometimes make errors, and these errors can help reveal which route was taken. Here we focus on agreement attraction errors, as in (1), where the pronoun they does not agree with the agreement controller actor but rather with the local attractor noun soap operas. In production studies, attraction effects are indexed by a higher number of agreement errors when the agreement controller and attractor have different features (e.g., the actor in the soap operas) than when they have the same features (e.g., the actor in the soap opera) (Bock, Nicol & Cutting 1999; Bock, Eberhard & Cutting 2004; Bock et al. 2006; Kandel & Phillips 2022; Kandel, Wyatt & Phillips 2024).

(1) *The actor in the soap operas rehearsed, didn’t they?

The finding that pronouns are subject to agreement attraction demonstrates that pronoun form can be influenced by elements that are neither the concept referred to by the pronoun nor its associated lemma. This finding is unexpected under the conceptual-lexical route but could arise naturally from the syntactic route, as retrieving an antecedent for feature-matching opens up an opportunity for other nouns in the sentence to interfere.

However, the interpretation that the syntactic route is supported by attraction phenomena crucially depends on the assumption that errors are driven by the presence of an attractor noun within the sentence. This is not obviously the case in most previous studies on pronoun production, which focus on number attraction (Bock, Nicol & Cutting 1999; Bock, Eberhard & Cutting 2004; Bock et al., 2006; Kandel & Phillips 2022; Kandel, Wyatt & Phillips 2024). Since grammatical number is usually associated with conceptual number (also referred to as notional number), in a sentence like (1), it is not necessary for attraction to come from the grammatical features of the attractor noun soap operas itself; rather, attraction may derive from the message-level representation of the plural concept encoded by this noun. For instance, soap operas should activate a conceptual representation of ‘multiplicity’ that could be erroneously combined with the concept ‘actor’, explaining why the pronoun appears as they. Indeed, pronouns are sensitive to conceptual number, leading speakers to produce plural pronouns to reference antecedents that are conceptually plural but grammatically singular (e.g., collectives such as fleet; Bock et al. 1999). Importantly, the conceptual properties of attractors, such as natural gender, have also been found to influence pronoun form (Slevc, Lane & Ferreira 2007). Thus, while number attraction indicates that pronoun form can be influenced by something other than the underlying concept or lemma, ruling out the conceptual-lexical route, it does not unequivocally support that the influence comes from a linguistic element, as required by the syntactic route.

One way to assess whether pronoun attraction can indeed derive from a linguistic element is to test whether pronouns show attraction from the grammatical gender of inanimate nouns, e.g., manzana._FEMININE in Spanish. Since grammatical gender doesn’t have systematic conceptual correlates (though see Boroditsky, Schmidt & Phillips 2003 and references therein for a different view), grammatical gender attraction cannot be attributed to the influence of the natural gender of other concepts in the message. Rather, grammatical gender attraction would demonstrate that pronoun form can be influenced by other parts of the sentence or previous discourse, as predicted by the syntactic route.

In support of this hypothesis, Meyer and Bock (1999) observed evidence of grammatical gender attraction with Dutch pronouns using a preamble completion paradigm. In this paradigm, participants heard a sentence containing two nouns (e.g., aardappel._COMMON ‘potato’; badpak._NEUTER ‘swimsuit’) and were shown a predicate that only matched one of them (e.g., gaar ‘cooked’). Participants were instructed to repeat the preamble sentence and add a continuation sentence with the structure [pronoun] is [predicate], which required using a gender-marked pronoun (e.g., Die._COMMONis gaar). The results showed more pronoun form errors when the two nouns in the preamble mismatched in gender than when they matched, consistent with gender attraction.

While the presence of gender attraction in Meyer and Bock’s (1999) study provides some evidence in favor of a syntactic route, the observed effects may have in part arisen as a consequence of the elicitation paradigm used. This paradigm differs from natural production in ways that could inflate errors and/or favor the use of the syntactic route over the conceptual-lexical route. First, participants must interpret the grammatically-encoded representations of both nouns in the preamble sentence (the antecedent and the attractor) in order to determine which option should be pronominalized. Requiring speakers to consult the attractor as part of the pronominalization process may have artificially raised error rates relative to natural speech when the attractor would be irrelevant to pronoun form. In addition, the paradigm involves processes related to both parsing and working memory, as participants must remember, interpret, and correctly recall the provided preamble in order to repeat it and produce a continuation. This process might lead participants to plan constituents at a different time than they normally would if they are initially focused on recalling and repeating the preamble (Kandel & Phillips 2022). Also, elicited errors could reflect misinterpretations of the preamble structure rather than an influence of the attractor noun (Ryskin et al. 2021). Crucially, in Meyer and Bock’s (1999) task, speakers did not generate a prelinguistic message that they later transformed into linguistic output. Rather, the content of the message was predetermined by the provided preamble sentence and predicate, and the majority of the linguistic structure was already provided to participants — the only part of the utterance that speakers needed to plan for themselves was the pronoun. This may have led speakers to rely less on the message-level representations of the utterances to be produced, making them more likely to use a syntactic route than a conceptual-lexical route to determine pronoun form.

Concerns about the preamble paradigm have led some researchers to adopt description tasks to elicit agreement attraction effects (see Kandel & Phillips 2022; Kandel, Wyatt & Phillips 2024 for pronoun attraction; see Kandel & Phillips 2022; Kandel, Wyatt & Phillips 2022; Nozari & Omaki 2022; Veenstra, Acheson & Meyer 2014 for verb attraction). Scene description tasks alleviate some of the concerns of the preamble paradigm that might affect pronoun production: These tasks do not provide participants with any pre-packaged linguistic material to interpret or recall, speakers generate a message themselves based on the events of the scene, and they do not need to reference the attractor noun when deciding to pronominalize. Because of this, the salience of the message-level and linguistic-level representations of the antecedent and attractor noun are likely more comparable to natural speech production than in a preamble completion task, and speakers may be less biased to use the syntactic route. In line with this, two previous studies that used a description task to test number attraction in pronouns (Kandel & Phillips 2022; Kandel, Wyatt & Phillips 2024) obtained a smaller attraction effect than observed in previous preamble studies (Bock et al. 2006; Bock, Nicol & Cutting 1999; Bock, Eberhard & Cutting 2004), in which errors more closely resembled the attraction rates observed for verbs. This is consistent with the possibility that description tasks are less likely to lead to elevated errors and/or artificially favor using the syntactic route, particularly to the extent that this involves a matching process similar to subject–verb agreement.

However, prior scene description studies only elicited pronouns in one language (English) and only tested number attraction. Therefore, more research is needed to understand how pronoun form is determined cross-linguistically and from what source pronoun attraction errors arise, which provides insight into the pronoun planning route used. First, it is unclear whether the number attraction effect would replicate in other languages, particularly in languages like Spanish or Italian, whose speakers are known to be more sensitive to conceptual number during subject–verb agreement compared to speakers of languages with more limited inflectional morphology like English (Vigliocco 1996; Vigliocco, Butterworth & Garrett 1996; Vigliocco, Butterworth & Semenza 1995). Second, as discussed above, the presence of number attraction on its own cannot provide conclusive insight into the route used to determine pronoun form, as number has conceptual correlates. Grammatical gender attraction would be a better test, as it would show that pronoun attraction errors arise due to the influence of a linguistic element, which can be parsimoniously explained by the syntactic route but not the conceptual-lexical route. It is unclear whether the grammatical gender attraction effects that have so far only been elicited with a preamble task (Meyer & Bock 1999) would persist in a scene description task that is less likely to artificially favor the syntactic route to pronoun form planning.

The present study

The present study investigates how speakers determine pronoun form, using agreement attraction effects to arbitrate between the conceptual-lexical and the syntactic routes. We elicited pronouns using a scene description task inspired by that of Kandel, Wyatt & Phillips (2024), allowing us to investigate how pronoun form is determined when the challenges for the speaker are more similar to those in natural speech than in prior preamble completion experiments (e.g., Bock, Nicol & Cutting 1999; Meyer & Bock 1999). The study comprises two experiments that tested pronoun agreement attraction in Spanish, a language with rich inflectional morphology whose pronouns bear both number and grammatical gender features.

Experiment 1 tested whether the number attraction findings observed in prior studies replicate in Spanish, allowing us to assess the reliability of the effect cross-linguistically and whether there are cross-linguistic differences in pronoun form planning. As discussed above, findings from subject–verb agreement studies suggest that speakers of languages with richer inflectional morphology than English rely more on conceptual number. If this extends to antecedent–pronoun agreement, Spanish speakers may use the conceptual-lexical route more and potentially avoid attraction. Alternatively, the findings with subject–verb agreement might not extend to pronouns — these findings have been attributed to the availability of tacit subjects in Spanish/Italian, but this may not be relevant for antecedent–pronoun dependencies. If so, other outcomes are conceivable. For instance, Spanish speakers may be more likely than English speakers to use a syntactic route to pronoun form, potentially showing more attraction: This is because Spanish pronouns not only contain features with conceptual correlates (number, natural gender) but must also agree with the grammatical gender of their antecedent (a linguistic feature).

Experiment 2 tested whether Spanish pronouns show grammatical gender attraction, providing additional insight into the source of pronoun attraction effects. If we observe both number and gender attraction effects, that would provide evidence of an influence from linguistic representations, suggesting that pronouns are planned using the syntactic route. By contrast, if we observe number attraction but no grammatical gender attraction, it is possible that pronoun attraction errors only arise due to interference at the conceptual level, since number (but not grammatical gender) has conceptual correlates. By eliciting both forms of attraction using the same paradigm within the same language, we can be confident that any differences we observe between Experiments 1 and 2 are due to the pronoun feature tested (number vs. gender) as opposed to differences between studies or languages.

Following Kandel, Wyatt and Phillips (2024), we diagnosed attraction using two dependent measures. The first was the presence of agreement errors (the typical measure used to assess attraction). The second was the duration of the post-attractor region in error-free sentences. Previous work with the scene description paradigm has shown that the time taken to produce an agreement target can provide insight into how often attraction pressures are active, even in cases when no error is made (Kandel & Phillips 2022; Kandel, Wyatt & Phillips 2022). Consequently, by assessing attraction with two different measures, we may be able to derive greater insight into how often and in what contexts the processes and pressures that underlie pronoun number and gender attraction effects are active.

Experiment 1: Number Attraction

Methods

Participants

Experiment 1 had a sample of 47 native speakers of Spanish who were born in Spain and located there at the time of testing (23 female, 23 male, 1 other). Participants had a mean age of 29 years (SD = 8.2 years) and reported no language, vision or auditory impairments. They were recruited through the online platform Prolific (www.prolific.com) and received monetary compensation for their participation. An additional nine participants completed the study but were excluded from the analysis due to poor sound quality in their recordings, technical difficulties preventing completion of the experiment, or producing pronouns in less than 30% of their responses.

Materials

Participants described videos of inanimate objects touching other objects and turning them black. This action was referred to with the made-up verb pipear (‘pipping’). This word was based on the nonce word mimmed used by Kandel et al. (2024), except that the initial phoneme was replaced by a plosive in order to facilitate onset detection in the latency analysis. We used a non-word to discourage participants from preceding the direct objects of their sentences with the differential object marking preposition a (see Von Heusinger & Kaiser 2007 for the relationship between differential object marking and the lexical semantics of verbs in Spanish). The use of differential object marking may contribute to participants’ perception of objects as animate and, relatedly, female or male (see von Heusinger & Kaiser 2003 for the connection between differential object marking and animacy, and Fábregas 2013 for an overview of differential object marking in Spanish). This was important for Experiment 2, which aimed to test whether grammatical (rather than natural) gender elicits agreement attraction.¹

The elicited sentences had the target structure: NP1 (antecedent) + verb pipear + NP2 (attractor) + above/below + pronoun. We manipulated the number of the antecedent noun (singular/plural) and whether the antecedent and attractor nouns matched in number (match/mismatch), resulting in four experimental conditions (2 match conditions and 2 mismatch conditions; Table 1).

Table 1

Example target sentences in Experiment 1.


CONDITION	TARGET SENTENCE

SS – match	El chaleco ha pipeado el candado (de) debajo de él The vest has pipped the lock below it

SP – mismatch	El chaleco ha pipeado los candados (de) debajo de él The vest has pipped the locks below it

PP – match	Los chalecos han pipeado los candados (de) debajo de ellos The vests have pipped the locks below them

PS – mismatch	Los chalecos han pipeado el candado (de) debajo de ellos The vests have pipped the lock below them

Note. The antecedent and coreferential pronoun are bolded, while the attractor is underlined. The preposition “de” before the adverb “debajo” is shown between parentheses to reflect its optional status: In pilot testing, some Spanish speakers prefered to produce it, while others didn’t (Supplemental file 1). Therefore, both utterances with and without the preposition were accepted as target responses in Experiment 1. Abbreviations: SS = singular antecedent, singular attractor, SP = singular antecedent, plural attractor, PP = plural antecedent, plural attractor, PS = plural antecedent, singular attractor.

Sixteen nouns were used as antecedents and attractors (8 masculine and 8 feminine), leading to 112 target sentences (28 per participant per condition; half with masculine nouns and half with feminine nouns; within a sentence, all nouns were the same gender). The nouns were all three syllables with stereotypical gender suffixes (-o for masculine and -a for feminine). Because the same nouns were used in Experiment 2 (which manipulated gender), care was taken to ensure that masculine and feminine nouns had similar frequency values. Based on the Subtlex-ESP database (Cuetos et al. 2011), masculine nouns had an average raw frequency per million words of 17.47 (range: 3.03–41.54) and feminine nouns of 17.01 (range: 4.52–43.07) (t(14) = –0.07, p = 0.95).²

To create videos corresponding to each target sentence, we used images corresponding to each noun from the MultiPic database (Duñabeitia et al. 2018). The images had intermediate visual complexity on a 1–5 scale (average: 1.92, range: 1.4–2.29), and there was no evidence that complexity differed across masculine and feminine nouns (masculine nouns average: 1.89, range: 1.5–2.29; feminine nouns average: 1.98, range: 1.4–2.29; t(14) = 0.62, p = 0.54). The images all had high naming consistency (average: 99%, range: 94–100%), though naming consistency was slightly higher for feminine nouns (masculine nouns average: 98%; range: 94–100%; feminine nouns average: 100%, range: 100–100%; t(14) = 2.17, p = 0.048). This difference is unlikely to have affected the study results, as participants were introduced to the target names for all images at the start of the experiment (see Procedure below).

Each video was divided into a target and an alternative display. This was done to prevent participants from planning their responses before the pipping action occurred. In each display, a central object set (consisting of one or two objects of the same type) was positioned with two identical sets of objects above and below it (Figure 2). In the target display, this central object set (corresponding to N1; the antecedent) pipped the object set above or below it (corresponding to N2; the attractor). The position of the target display (left/right) was counterbalanced across items. If the target display corresponded to a match condition, the alternative corresponded to a mismatch condition, and vice versa. This was done to avoid some trials being perceived as more difficult than others. We made sure that the position of the attractor in the target display (above or below the antecedent) was evenly distributed across trials. The nouns representing the on-screen objects always had the same gender (across both the target and alternative displays), to avoid mixing number and gender interference.

Figure 2

Example of a visual display in Experiment 1.

Note. The display corresponds to the target sentence “El chaleco ha pipeado los candados debajo de él” (‘The vest has pipped the locks below it’). A trial consisted of a 1 second preview, the pipping action, and then a 5 second response window in which participants had to describe the action.

Procedure

The experiment was conducted using PCIbex (Zehr & Schwarz 2018). Responses were recorded using the participants’ microphones. Recording started and ended automatically for each trial. Participants were told that they would see scenes from a spinoff game of Tetris called The Haunted Attic, where there were sixteen different objects (see the OSF repository for a transcript of the instructions). They were introduced to the objects and their target names, as well as to the pipping action that the objects performed when they interacted with each other, which was both described and shown in video examples. Participants were then told that their task was to describe ‘what pipped what’, and that there would be several objects in the scenes, meaning that they would need to use encima de (‘above’) and debajo de (‘below’) to make their descriptions more precise. Participants were not asked to integrate these adverbial phrases in their utterances in a specific way, and they were also not asked to use pronouns, but they were given examples that followed the structure of the sentences in Table 1. Subsequently, they completed seven practice trials. Each trial started with a one second preview, followed by the pipping action, which was accompanied by a sound. Afterwards, participants articulated their responses. The first two practice trials were not timed, and participants pressed a button after giving their response to end the trial; at the end of each trial, participants were presented with the suggested target sentence. The final five practice trials followed the same procedure as the experimental items: Participants had five seconds to articulate their response before the trial (and recording) ended automatically, and no suggested target sentences were presented. The experimental trials were pseudorandomized for each participant to prevent showing two consecutive trials with the same condition or pronoun. An experimental session lasted on average 30 minutes.

Analysis

Error analysis

The error analysis assessed whether the likelihood of pronoun number errors differed across conditions. Responses were manually transcribed and coded for whether they matched the target response and whether they contained a pronoun number error. Pronoun number errors included both unrevised and revised errors. Responses were considered non-target and excluded from analysis if they were incomplete, if the pronoun was unintelligible, or if they contained incorrect noun or verb number, lexical substitutions (shield instead of hat, or above instead of below), a pronoun gender error, or late corrections at the end of the utterance (e.g., “The hat pipped the sword below him… the shield”), with the exception of pronoun revisions. Responses were considered target if the target nouns were replaced by other nouns with similar meaning and the same gender and number of syllables (e.g., los cerrojos for los candados), if they contained a disfluency (e.g., false starts, word repetitions, etc.), or if the differential object marking preposition a preceded the attractor (e.g., a los chalecos instead of los chalecos).

All statistical analyses in the present study were performed using the package lme4 (v.1.1.35.1; Bates et al. 2015) in R (version 4.3.3, R Core Team 2024). We examined the probability of pronoun number errors with binomial (logistic) mixed effects regression. Errors were coded as 1, and accurate responses were coded as 0. The predictors were Antecedent Number (sum-coded, –0.5 singular/0.5 plural), Match (sum-coded, –0.5 match/0.5 mismatch), and their interaction, as well as a centered numeric predictor: Trial Order. The random effects structure included intercepts by subject and item and slopes for Antecedent Number, Match, and their interaction, which were the predictors of theoretical interest. The simultaneous inclusion of by-participant and by-item random effects allowed us to account for potential differences in the properties of the images and/or nouns across the experimental items. If necessary, the random effect structure was simplified to achieve convergence. We report the final structures in the tables showing the models’ output.

Latency analysis

The latency analysis assessed whether there were articulation slowdowns in utterances with correct pronoun forms across conditions. The latency analysis excluded all non-target utterances as well as utterances with disfluencies and number errors — i.e., only complete and correct responses were included in the analysis. These responses were aligned to their transcriptions at the word boundary level using the Montreal Forced Aligner (v.2.0.0; McAuliffe et al. 2017). We used the word onsets and offsets identified by the forced aligner to compute the duration of the post-attractor segment.³ The post-attractor segment duration was computed by subtracting the offset of the pronoun from the offset of the attractor noun (Figure 3). This segment was chosen because the attractor offset was the earliest point in time in which the pronoun form could be planned without interference from the planning of the two preceding noun phrases. The performance of the forced-aligner was evaluated by comparing its alignments to those of two humans and computing inter-rater agreements for word boundaries (Supplemental file 2). The interquartile range criterion was used to detect outliers (in ms), such that observations below the first quartile or above the third quartile by 1.5 times the interquartile range were excluded (Hawkins 1980).

Figure 3

Example of a target response with its division into segments.

The statistical analysis examined post-attractor segment latencies with linear mixed effects regression, using the same predictors as in the error analysis, with the addition of Syllable Count (the number of syllables of the post-attractor segment). Syllable Count was included in the model to account for the fact that plural pronouns were one syllable longer than singular pronouns. The dependent variable was the log-transformed duration of the post-attractor segment.

In both the error and latency analyses, if there was a significant interaction, the model was re-fit such that the interaction term was replaced by the nested effects of the critical factor (e.g., the effect of Match for singular and plural antecedents separately).

Results

Out of 5,264 utterances, 8.03% were incomplete (range across conditions: 4.94–10.3%). Of the remaining 4,841 complete utterances, 5.41% were excluded from the error analysis due to containing non-target responses (range across conditions: 4.41–6.02%), and 12.95% were excluded from the latency analysis due to containing non-target responses and/or number errors (range across conditions: 10.2–15.3%). No observations were excluded from the latency analysis as outliers after the application of the interquartile range criterion. Thus, 4,579 trials were entered into the error analysis, and 4,214 trials were entered into the duration analysis.

Error analysis

The distribution of number error rates across conditions is shown in Figure 4. The statistical analysis showed an effect of match (Table 2): Number errors were more likely in mismatch than in match conditions, consistent with agreement attraction. Further, there was an interaction between antecedent number and match. Nested comparisons showed that the effect of match was larger with singular antecedents (estimate = 2.230, z = 5.384, p < 0.001) than with plural antecedents (estimate = 1.074, z = 3.585, p < 0.001). This suggests that plural attractors (as in the SP condition) caused more attraction than singular attractors (as in the PS condition), consistent with a number markedness asymmetry.

Figure 4

Descriptive summary of error rates and durations of the post-attractor segment in Experiment 1.

Note. Diamonds show averages across participants in match (SS, PP) and mismatch (SP, PS) conditions. Points show by-participant averages. Abbreviations: SS = singular antecedent, singular attractor, SP = singular antecedent, plural attractor, PP = plural antecedent, plural attractor, PS = plural antecedent, singular attractor.

Table 2

Output of the Experiment 1 error analysis model.


COEFFICIENT	ESTIMATE	STANDARD ERROR	z-value	p-value

Intercept (grand mean)	–4.430	0.234	–18.939	< 0.001

Trial Order	–0.004	0.003	–1.368	0.171

Antecedent Number	0.487	0.309	1.577	0.115

Match	1.651	0.260	6.350	< 0.001

Antecedent Number × Match	–1.158	0.512	–2.259	0.024

Note. Model formula: Number Error ~ Trial Order + Antecedent Number * Match + (1 + Antecedent Number * Match || Participant) + (1 + Match || Item). A positive coefficient for Antecedent Number reflects more number errors with plural than singular antecedents. A positive coefficient for Match reflects a greater likelihood of number errors for mismatch than match conditions. The double bars in the model formula represent the removal of the correlation between random slopes and intercepts.

Latency analysis

The distribution of durations in the post-attractor segment is shown in Figure 4. The statistical analysis showed a main effect of Match (Table 3): The duration of the post-attractor segment was longer in the mismatch than in the match conditions, consistent with agreement attraction. There was also a main effect of antecedent number, with longer durations for trials with plural vs. singular antecedents. Finally, durations increased with increasing number of syllables.

Table 3

Output of the Experiment 1 latency analysis model.


COEFFICIENT	ESTIMATE	STANDARD ERROR	t-value	p-value

Intercept (grand mean)	6.684	0.013	527.406	< 0.001

Trial Order	–0.000	0.000	–2.654	0.008

Syllable Count	0.096	0.004	25.513	< 0.001

Antecedent Number	0.158	0.006	25.98	< 0.001

Match	0.015	0.004	3.941	< 0.001

Antecedent Number × Match	–0.020	0.013	–1.489	0.143

Note. Model formula: log(Duration) ~ Trial Order + Syllable Count + Antecedent Number * Match + (1 + Antecedent Number * Match | Participant) + (1 + Antecedent Number * Match | Item). A positive coefficient for Antecedent Number reflects longer durations for post-attractor segments with plural than singular antecedents. A positive coefficient for Match reflects longer durations for mismatch than match conditions.

Discussion

The results of Experiment 1 show that the number agreement attraction effect observed in prior experiments in English replicates in Spanish. We observed evidence for agreement attraction in both error and timing measures: Number agreement errors were more likely in the mismatch conditions, and speakers were slower to articulate the post-attractor region containing the pronoun in these conditions even when the correct pronoun was produced, suggesting that the process leading to errors is active on more than just the trials in which errors occur (see Kandel, Wyatt & Phillips 2022 for discussion of how to interpret attraction timing effects for verbs). The error effect showed a similar markedness asymmetry to that observed in verb attraction studies, with stronger attraction from plural attractors than singular ones (e.g., Bock et al. 2006; Bock & Miller 1991; Bock, Nicol & Cutting 1999; Eberhard 1997; Thornton & MacDonald 2003; Vigliocco & Nicol 1998).

The results illustrate that representations other than the pronoun antecedent can interfere when determining pronoun number. As discussed in the Introduction, this influence is suggestive of the use of a syntactic route to determine pronoun features. However, since number has conceptual correlates, we cannot conclusively infer that it was the representation of the attractor at the sentence-level rather than the message-level that caused attraction. Grammatical gender, which doesn’t have systematic conceptual correlates, thus serves as a stronger test of whether attraction can arise from linguistic representations other than that of the antecedent. Experiment 2 assessed the presence of grammatical gender attraction.