The Paradigm
As mentioned in the introductory section, several points of divergence have been reported in the reading literature. In this regard, we would like to emphasize the need for more flexibility in experimental paradigms, not only in the reading context but also in different subjects that occupy the attention of cognitive sciences (e.g., episodic vs. semantic memory [
35], multistable perception [
36], memory vs. perception [
37,
38,
39]). Indeed, the current paradigms have contributed significantly to a reasonably good understanding of many cognitive tasks. However, it should be mentioned that the crisis does not lie in the paradigms but in the stimuli and percepts. What we mean by this is that the question ’how do we create knowledge’ can only be answered if we use certain stimuli that are ’known and unknown at the same time’. In the present study, our paradigm proposes to put everything in the same bath. We have focused on the visual span area (conceived as a sensory limitation) [
1], which is, in turn, embodied in the perceptual span [
6,
7,
8] where high-level linguistic and contextual processing occurs. The exclusive character of our paradigm lies in the Arabic percepts used in the sentences with scrambled targets (Css). These are multistable percepts belonging to the original Arabic language and are not scrambled words, as introduced in the methodology section. As mentioned by a body of work, bi or multistable perceptual phenomena occur when an observer perceives the same stimulus differently [
40]. Indeed, the alphabet of the original Arabic language was not dotted and contained only 18 letters, and the words in the texts were neither dotted nor vowelled [
41]. By relying on the context, Arabic readers can remove the ambiguity of words [
42]. In light of this, we emphasize that using such percepts will contribute to studying context-related effects, not only in the task of reading but also in other issues that confront cognitive sciences. For example, the isolated word /
/ could be expressed in the Schrödinger sense as:
In good consistency with previous work, the first analysis on the probability correct responses for the three types of stimuli (correct, incorrect, and non-word) points to a word superiority effect [
31]. As shown in
Figure 2, words were better recognized than non-words. However, reaction time analyses show that non-words were recognized faster than words at the median (P2) and final (P3) positions (see
Figure 2 (bottom)) and contradict the proposals of previous work [
43]. Although this result represents a discrepancy, it indicate the initiation of additional processing. In addition to lexical processing, it is possible that the parallel presentation of words contributed to the initiation of higher-order processing (i.e., syntax and semantics) for correct (Cs) and incorrect (Ins) sentences and that the lack of lexical information in the case of non-word stimuli did not necessitate this kind of processing. Controversially to our postulates and work supporting the sentence superiority effect (SSE) [
23], our analyses revealed no differences in the probability of correct response and reaction times between correct (Cs) and incorrect (Ins) sentences (see
Table 2). These preliminary results run counter to primitive extraction of semantic representations [
26] or extraction of the syntactic category [
24] of words in the sentence. In this respect, we hypothesized (HP) that responses at the level of correct (Cs) and incorrect (Ins) sentences were stereotyped, and word recognition relied solely on visual and lexical information and occurred without any contribution from higher-order processing (at the sentence level). To test our hypothesis’s (HP) validity, we analyzed our data by including the sentences with the scrambled targets. This analysis suggests two results that require particular attention. The first one underlines similar performances in correct sentences(Cs) and incorrect sentences (Ins) (Estimate=0.088, SE=0.258, z=-0.343). The second indicates similar performances in the sentences with scrambled targets (Css) and non-word stimuli (Nw/baseline condition)(Estimate=- 0.2288, SE=0.1199, z=-1.908). Moreover, reaction time analyses showed no significant differences in normalized reaction(RTnorm) times between correct (Cs), incorrect (Ins) sentences and sentences with scrambled target (Css) (see
Table 3). These results thus propose the higher-order processing contribution in scrambled word recognition and underline our hypothesis’s (HP) validity.
Table 2.
Proportion correct and mean normalized reaction time analyses.
Table 2.
Proportion correct and mean normalized reaction time analyses.
Proportion Correct |
Normalized RT |
Condition |
Estimate |
SE |
z |
Estimate |
SE |
t |
(Intercept) |
1.372 |
0.110 |
12.413 |
0.6493 |
0.0340 |
21.019 |
Median |
0.710 |
0.122 |
5.799 |
-0.0503 |
0.0186 |
-3.014 |
Final |
-0.452 |
0.106 |
-4.227 |
-0.0232 |
0.0205 |
-0.623 |
Incorrect |
-0.232 |
0.120 |
-1.922 |
0.0307 |
0.0185 |
-1.047 |
Non-word |
-1.011 |
0.114 |
-8.843 |
-0.0744 |
0.0197 |
-4.439 |
Table 3.
Parameters of pairwise comparisons between stimulus types in the three positions.
Table 3.
Parameters of pairwise comparisons between stimulus types in the three positions.
Proportion Correct |
Normalized RT |
Condition |
Estimate |
SE |
z |
Estimate |
SE |
t |
Initial(P1) |
|
Nw vs Cs |
-1.098 |
0.197 |
-5.576 |
-0.0296 |
0.0609 |
-0.485 |
Nw vs Ins |
-0.726 |
0.185 |
-3.923 |
-0.0600 |
0.0706 |
-0.8491 |
Cs vs Ins |
0.372 |
0.207 |
1.801 |
-0.0304 |
0.0645 |
-0.471 |
Median(P2) |
|
Nw vs Cs |
-0.841 |
0.226 |
-3.712 |
-0.217 |
0.053 |
-4.050 |
Nw vs Ins |
-0.930 |
0.232 |
-4.005 |
-0.123 |
0.054 |
-2.255 |
Nw vs Css |
0.359 |
0.206 |
1.742 |
-0.185 |
0.062 |
-2.975 |
Cs vs Css |
-0.482 |
0.235 |
-2.051 |
0.031 |
0.062 |
0.505 |
Ins vs Css |
-0.571 |
0.240 |
-2.374 |
-0.062 |
0.064 |
-0.969 |
Cs vs Ins |
0.088 |
0.258 |
0.343 |
0.093 |
0.055 |
1.675 |
Final (P3) |
|
Nw vs Cs |
-1.052 |
0.179 |
-5.870 |
-0.1036 |
0.0566 |
-1.829 |
Nw vs Ins |
-0.767 |
0.174 |
-4.407 |
-0.1219 |
0.0566 |
-2.154 |
Cs vs Ins |
0.285 |
0.184 |
1.553 |
-0.0183 |
0.0638 |
-0.287 |
Figure 2.
Probability correct responses and mean normalized reaction time (bottom) as a function of position (P1, P2 and P3) and stimuli type (Cs, Ins and Nw). Cs, Ins and Nw correspond to correct (red line), incorrect sentence (green line), and non-word stimuli (blue line). Error bars indicate 95% confidence intervals.
Figure 2.
Probability correct responses and mean normalized reaction time (bottom) as a function of position (P1, P2 and P3) and stimuli type (Cs, Ins and Nw). Cs, Ins and Nw correspond to correct (red line), incorrect sentence (green line), and non-word stimuli (blue line). Error bars indicate 95% confidence intervals.
Figure 3.
Probability correct response (left) and mean normalized reaction time (right) at the median position (P2) for sentences with scrambled word (green bar), non-word stimuli (purple bar), correct sentences (red bar), and incorrect sentences (aqua green bar). Error bars indicate 95% confidence intervals.
Figure 3.
Probability correct response (left) and mean normalized reaction time (right) at the median position (P2) for sentences with scrambled word (green bar), non-word stimuli (purple bar), correct sentences (red bar), and incorrect sentences (aqua green bar). Error bars indicate 95% confidence intervals.
As the results show, analyses of the probabilities of correct responses suggest an effect of position on word recognition. The targets presented in the initial position were better recognized than those presented in the final position (see
Figure 2-top) [
26]. This observation can be interpreted, in the first place, by effects related to reading habits (the right-left direction for languages read from right to left) [
26]. It is possible that our participants looked at the initial position (P1) or a location between the initial position (P1) and median position (P2) (HP2). Controversially, our analyses suggest a significant difference between the initial (P1) and median (P2) positions on the probability of correct responses (see,
Table 2). If our hypothesis (HP2) were true, our participants would have a high chance of predicting the targets presented in the final position (P3) (the predictability effect) in the correct sentences (Cs). This observation support our hypothesis (HP) suggesting that our participants relied on visual and lexical information in the recognition of words in the correct sentences (Cs). Moreover, If our hypothesis (HP2) were true, we should observe, at least, a decrease in reaction time at the initial position (P1). Analysis of reaction time showed no significant difference between initial (P1) and final (P3) positions (Estimate=0.023, SE=0.038, t=0.62), which means that words were processed simultaneously. Two possible explanation for the reduced performance in the final position (P3). The first one points to poor visual attention skills in Arab readers [
3]. The second point to a deployment of visual attention extend to the beginning of sequence. Controversially to the sequential sentence word processing, our results showed no difference in the probabilities of correct responses at the median position (P2) between sentences with scrambled target stimuli (Css) and non-word stimuli (Nw) (see
Table 3). In good agreement, our finding corroborates studies’ proposals suggesting a parafoveal-on-foveal effect [
29,
30]. To identify the foveal scrambled word (P2), participants had to extract information from words in the final (P3) and initial (P1) positions. In light of this, we support the hypotheses of parallel processing and the extraction of semantic and syntactic information of multiple words at once [
26,
44]. Based on the propositions of Asano and Yokasawa [
26] suggesting primitive extraction of a semantic representation and those of Snell et al. [
44] suggesting parallel extraction of semantic information from the words of the sentence, the present results, therefore, attribute an advantage to semantic processing in the emergence of the sentence superiority effect. If the syntax has directly contributed to the emergence of the sentence superiority effect (SSE) in sentences with scrambled targets (Css), we must first reveal its effects in the correct sentences (Cs). After all, all three words in the correct sentences (Cs) were within the visual span area. In other words, although the syntactic word information (word location) was accessible [
45], no significant difference was revealed between correct and incorrect sentences. For more visibility, let us take the example of the following sentence "
". In this example, it is clear that multistable percepts cannot be read without interaction between semantic and syntactic processing. First, we need elementary units of meaning (words) to perform computations (i.e., syntax) to generate the sentence’s overall meaning. Similarly, when presenting words with dots and vowels (
/ It was said that an elephant was killed), the same interaction between the two levels (syntactic and semantic) will occur. The only difference is that the semantic information of the words in the sentence is available, which makes it easier to extract the primitive semantic representation. Paradigms using isolated words have yet to provide concrete answers to questions concerning the access code to the mental lexicon and the primacy of phonology or semantics in recognition of written words. Three streams have discussed the place of phonology in the reading task [
46]. The first one suggest direct access to the mental lexicon without recourse to phonological coding [
47]. The second proposes a phonological mediation to access the mental lexicon [
17,
18]. In contrast, dual root models [
48,
49]suggest that reading relies on two roots. The sub-lexical root calls on the grapheme-phoneme conversion system while presenting pseudowords and new words. The lexical root provides faster access to the mental lexicon, when a known word is presented. It should be noted that grapheme-phoneme conversion is costly in processing time. We put three hypotheses to the test based on our paradigm and findings. The first hypothesis (HC1) assumes the non-contribution of phonological coding in written word recognition, particularly in silent reading. Given Abu-Rabiaa’s proposal [
42], suggesting that Arabic readers rely on context to deduce words’ phonological form when reading texts, we reject the hypothesis (HC1). A second hypothesis (HC2) assumes that recognition of scrambled words relied on early activation of the semantic representation. Kintsch and Mangalath [
50] propose that word meaning results from the interaction between the various decontextualized semantic representations of words in long-term memory and the sentence’s overall meaning (semantic context). The problem with this proposal is that the orthographic and semantic representations of the percepts used in our paradigm do not exist in long-term memory. Our percepts are new stimuli for the reading system. Note that our results cannot be explained by the multi-trace memory model proposals [
37]. What do exist are orthographic, phonological, and semantic representations of percept’s states (see equation 1). Since our percepts are qualified as new stimuli (i.e., new words), a third hypothesis (HC3) inspired by dual root model proposals [
48,
49] assumes an early phonological coding. We also reject this hypothesis (HC3) in two respects: first, because our results suggest a rapid parallel processing of the words sentence. Second, the grapheme-phoneme conversion process is a slow serial process. Given the multistable nature of scrambled words and the letters that form them, any attempt at early phonological encoding would be even more costly. Rejecting hypothesis (HC3) in no way implies the validity of hypothesis (HC2). We suggest that the recognition of multi-stable Arabic percepts may reflect the co-occurrence of learning (creation of the memory trace) and retrieval. Although the present paradigm supports the interaction between perception and memory [
39], we should keep in mind that the multi-stable Arabic percepts used in the present study did not basically exist in episodic memory.