1. Introduction
In Applied Behavior Analysis (ABA), methodologies and technologies of teaching are developed to solve human issues, including the treatment of children diagnosed with autism spectrum disorder (ASD). Regarding this public in general, ABA interventions are frequently addressed to establish socially significant repertoires, which are commonly impaired in many learners [
1,
2,
3,
4,
5]. Among the variables which bolster development, visual scripts may be used to establish behavioral sequences related to daily functional activities such as getting dressed, brushing teeth, preparing a meal etc. Visual scripts may consist of pictures or phrases depicting functional actions while performing a task (e.g., grabbing a toothbrush and toothpaste, adding toothpaste on the toothbrush etc.). In a study conducted by Pierce and Schreibman [
6], visual scripts (pictures depicting sequences of actions) were used to teach three children with ASD between 6 and 9 years old to perform functionally daily skills independently. Three tasks were defined for each learner (e.g., setting the table; making lunch; washing clothes). After a baseline condition showed that the children were unable to perform the tasks, an intervention condition was implemented in three phases. First, everyone learned to discriminate as listeners pictures representing the actions of each behavioral sequence defined.
After this, the learners were taught to choose a preferred reinforcer, turn pages of a book with pictures, perform motor actions portrayed by the pictures and reinforce themselves. In the end, the presence of an interventionist faded out so that the tasks could be performed during his absence. As a result, all children were able to finish the defined tasks. Besides, there was evidence of reduction in the frequencies of disruptive behaviors (stereotypes). Generalization measures were obtained through probes involving new tasks and different environments (clinical and residential). Post-training probes also showed that the pictures were exerting control over the behaviors of two of the children (there were changes in the order of the pictures in the book during these probes) [
6]. Later, other studies involving visual scripts were also conducted to establish sequences of actions concerning functional play. Phillips and Vollmer [
7] used visual scripts in three 4-year-old children with atypical development (one with ASD and two with language delays).
Four sets of toys were organized (e.g., a barn with a horse, cow, tractor etc.). Five actions were defined for each set and were represented by pictures in a binder/book. After the participants learned prerequisites (e.g., receptively identifying pictures and turning binder pages), a baseline showed that the children could not perform the sequences of actions related to each toy set. An intervention was then implemented, and it consisted of a least-to-most hierarchy of prompts provided by an experimenter. This consisted of the following steps: 1) general instruction to perform the action depicting a given picture; 2) specific instruction to perform an action; 3) pointing to relevant objects during the provision of a specific instruction; 4) modeling a response during a specific instruction; 5) provision of physical guidance. Independent responses were differentially reinforced with praise and sometimes access to preferred items. The intervention was effective for all participants. There was evidence of generalization across new sets of toys not directly trained, and it was suggested that the pictures exerted control over the actions (through probes with changes in the order of the pictures) [
7].
Phillips et al. [
8] conducted two systematic replication experiments (involving three children with ASD between 4 and 9 years old in the first experiment and three other young learners between 7 and 19 years old, two diagnosed with ASD and one with Down Syndrome, in the second experiment) of the previous investigation by Phillips and Vollmer [
7]. However, textual instructions (phrases for oral reading) were used instead of pictures. In each experiment, four sets of toys and five different actions for each set were programmed. Each action to be emitted was represented by a textual instruction (printed phrase) that the learners had to follow. During a baseline condition, a board with numbered printed instructions were shown for carrying out the actions for each set of toys. During intervention, correct responses (performing the actions represented by the instructions) resulted in praise or access to an edible reinforcer, depending on the learner. For each action not demonstrated between 5 and 10s, an experimenter provided up to six levels of prompts following a least-to-most prompting hierarchy [
8].
The prompt levels were as follows: 1) prompt absence between 5 and 10s after the presentation of the initial instruction to perform a given behavioral sequence; 2) general vocal prompt during which the experimenter asked the learner to read the number of a given instruction; 3) specific vocal prompt through which the learner was verbally guided to perform the action; 4) gestural prompt through which the experimenter pointed to the necessary items to perform a given action, while a specific vocal instruction was provided as well; 5) proving a model, that is, the experimenter modeled a given action to be emitted at the same time as he vocally provided the specific vocal instruction; 6) physical guidance to perform the action represented by a given textual instruction [
8].
After an arbitrary learning criterion (two consecutive sessions without errors) was achieved for a given set of toys, probes with the textual instructions under a new order/sequence were conducted. If a given learner was not able to perform the actions, a new teaching condition was defined and, in this case, the textual instructions would be presented one at a time on the board (instead of all at the same time). When needed, the six level prompt hierarchy was used. As a learning criterion was achieved, new probes to assess responding with the textual instructions under a new order/sequence would be conducted. During the second experiment, more specifically, the criterion for each toy set consisted of three errorless consecutive sessions. Orienting responses to the instruction board were evoked (e.g., looking at the board, following the texts with the hand or reading the texts out loud). A test condition was defined after baseline to determine if the learners would be able to perform actions solely under control of a general vocal instruction (e.g., “do what is being said here”). Besides, for one learner, there was a teaching condition during which the textual instructions were presented twice or three times at once [
8].
As a result, regarding both experiments [
8], all learners needed to reach some training level to be able to emit all actions from each set of toys. Everyone also was able to perform the non-trained second sequence of actions after training the first sequence for each of the toy sets. The authors discussed the data as a possible indicator of generalization regarding following instructions for the same set (new order/sequence of instructions), and that the textual instructions were in fact exerting control over the actions performed by the leaners. Nevertheless, nobody showed a generalized repertoire of following textual instructions across different sets of toys. The data in this sense is different from the case by Phillips and Vollmer [
7] involving visual scripts represented by pictures, and which showed evidence of generalization across different sets of toys. It is possible that generalization data from Phillips et a. [
8] would be more robust if the participants had better reading skills. This was pointed out as a limitation and, as recommendation for future studies, it was suggested that pictorial and textual prompts should be compared in individuals with more advanced reading skills.
Using visual scripts and other variables has also been important to teach behaviors to prevent contagion from infectious diseases such as Covid-19. This may be particularly relevant for children diagnosed with ASD who commonly have difficulty performing behavioral sequences. Jezz and Dozier [
9], for example, conducted a review study on behavioral strategies to teach repertoire of washing hands. Some strategies were based on antecedent events such as, for example, instructions; modeling washing hands properly; providing vocal and visual prompts/scripts. Other strategies addressed the manipulation of consequence events such as visual feedback, error correction and differential reinforcement. Another study concerned with the prevention from diseases was conducted by Libardi et al. [
10]. They remotely (through internet) trained parents to use visual (pictures) and textual (phrases representing actions) scripts to teach their children to put on surgical masks and sustain their use for an extended period. Four children diagnosed with ASD between 4 and 10 years old participated. Initially, two baseline conditions were conducted. The first one consisted of assessing the behavior of putting the mask on.
The expected behavior chain concerned ten steps to be performed: 1) Taking the mask holding it by the elastic straps; 2) positioning the open mask in front of the face in the nose region; 3) placing the mask over the face, in the nose and mouth area; 4) Holding the right elastic strap, positioning and placing it behind the right ear; 5) holding the mask to adjust it by pulling it under the chin; 6) holding the left elastic strap, positioning and placing it behind the left ear; 7) keeping the mask on; 8) removing the mask, holding it by the elastic on the left strap; 9) storing the mask in an appropriate place or, if necessary, clean it; 10) washing hands appropriately. The second baseline consisted of assessing the time each participant kept the mask on the face and, when someone showed discomfort, the interventionist removed it from the child’s face without providing feedback [
10].
During training, visual scripts (pictures representing the steps to put on a mask and keep using it) were used to guide the children’s actions. When needed, an interventionist provided physical guidance until the learners were able to perform the behavioral sequence solely in the presence of the visual scripts (pictures). Thereafter, the interventionist gradually faded out the pictures so that the children became able to emit the behavioral sequence in the absence of any visual cue. When this was achieved in three consecutive sessions, a new intervention was applied to increase the time the children kept the masks on their faces. At this time, three types of prompts were used: 1) verbal prompt (the children counted out loud the time they kept the mask on their faces; 2) gestural prompt (an adult showed his fingers while counting out loud the time with the mask on the children’s faces; 3) differential reinforcement of other behaviors (reinforcing any appropriate behavior except a problem behavior such as removing the mask from the face). In this sense, access to preferred playful activities was made contingent to wearing the masks in an appropriate manner. Overall, the interventions conducted were effective for all children. The authors discussed that future studies should evaluate the efficacy of the procedures when children with ASD are on demand (e.g., keeping the masks on the faces during academic tasks) [
10].
Research involving scripts has also been concerned with the development of communication or, in other words, verbal behavior [
11]. The current investigation was particularly consistent with the literature that focused on the use of visual scripts consisting of pictures and/or sentences. A seminal study was conducted by Krantz and McClannahan [
12]. These authors assessed four children with ASD, and who were fluent readers, in the context of activities with groups of learners. The children remained in the same room, each accompanied by an interventionist, and had opportunities to play and verbally interact with each other. All children also understood actions related to games and fun. To help the learners in the process of communicating with each other, textual scripts were organized with statements and questions about actions in the present, past and future tenses (e.g., “did you enjoy playing on the swing yesterday?”). Baseline showed that the children were unable to vocally interact using sentences independently.
In Baseline, they rarely initiated communication with each other, although they all had some functional expressive language and, sometimes, sought out important adults spontaneously. Still during baseline, painting, drawing and coloring activities were also performed by the children. It was at this moment that each one of them was told to play and talk. During the intervention condition at first, the children had to read the scripts in the process of interacting with each other. When necessary, they were physically guided by an interventionist to pay attention to the scripts. After reading the scripts accurately, five fading steps were established by gradually removing/omitting the words from sentences from the end to the beginning. The purpose of this measure was that, at some point, the learners would be able to talk free from any script control and this was demonstrated for all children. There was also evidence of response generalization in the presence of a new communication partner, environment and materials. Performance maintenance was also demonstrated two weeks later [
12].
In a later study, Krantz and McClannahan [
13] used script fading involving pictures and short phrases to establish communication in three children diagnosed with ASD with more limited reading skills at school. Before the research, all learners only spoke in the context of answering questions or demanding access to edibles and preferred toys. In baseline, the children did not verbally communicate with a teacher in a functional manner. During intervention, textual scripts (“look” and “watch me”) to evoke the attention of a communication partner (a known teacher) were incorporated into the children’s photography activity schedules with toys. All children were fluent in performing actions depicted by pictures. After the textual scripts were faded out, the unscripted interactions with the teacher continued and generalized to different activities.
Ledbetter-Cho et al. [
14] systematically replicated the research by Krantz and McClannahan [
12]. Script fading was used in three children with ASD with the purpose of improving communication and interaction among them and with adults during a social activity (e.g., group games with toys). Sessions occurred in a university clinic. In baseline, the children obtained up to four sets of toys. Each set comprised three similar toys. When each child was provided a given set, she was told to share it with the others. After 10 s, the children were instructed to return the toys from the set and the interventionist provided a new set, repeating the procedure. In baseline, the learners hardly ever talked among them. During intervention, besides the access to toy sets for sharing as in baseline, an interventionist provided textual scripts for each child to communicate. The scripts included phrases that would be suitable for starting a game or communicating with a peer while sharing toys. When needed, the interventionist provided help to read the scripts. After the reading performance was fluent, the words from the scripts gradually faded out. As a result, all children communicated better and without the need for reading scripts.
More recently, a study on script fading involved four children with ASD between 5 and 10 years old [
15]. They were unable to describe non-verbal stimuli such as pictures through vocal verbal sentences comprising four or more words (e.g., saying “the car has four wheels” under the picture of car and instruction “tell me something about the car”) as it was shown in a baseline condition. According to a Skinnerian [
11] approach to language, the type of verbal repertoire contemplated in the research is called tact and comprises the emission of a vocal verbal response under the control of a non-verbal discriminative antecedent stimulus. Responding is maintained by an established form of attention (e.g., verbal praise), that is, a generalized conditioned reinforcer. Two of the children exhibited a wide repertoire of sentence reading, and the other two children did not. These, however, demonstrated echoic of sentences at a generalized level. During the intervention condition, for the learners who read fluently, pictures and written sentences were used so they could read. For the remaining learners, the same pictures and written sentences were used as well, but an experimenter read them out loud, so the children could repeat/echo.
When the children consistently read the sentences (or echoed them under the experimenter’s vocal model), script fading was defined in five steps. Along the steps, the words were gradually removed/omitted so, with time, all children were able to vocally emit the sentences solely under the control of pictures and the verbal instructions to describe them. This was systematically demonstrated by each learner across three different groups of five pictures. Besides, a follow up probe session was conducted one month later (after the intervention was discontinued) and all children demonstrated errorless performance. A limitation pointed out by the authors was that varied scripts were not used, that is, the sentence assigned to each picture was always the same. It was discussed that future replication studies should use varied sentences and assess how this impact learners’ performance [
15].
The literature on the use of scripts (and their fading) to teach skills to children diagnosed with ASD shows that this procedure is effective in the acquisition of important non-verbal and verbal repertoires. However, new investigations should be carried out to assess if teaching one type of repertoire using script fading may result in the emergence of another related. The current study aimed to evaluate if script fading would effectively establish a repertoire of emitting sentences with tact function in children with ASD. The emission of sentences was directly taught under the control of scenes/pictures depicting behavioral sequences such as health care routines (putting on a surgical mask and keeping using it; washing hands), and other behavioral chains (daily living skills; functional play). Also, the effects of the intervention were measured on the emission of non-verbal repertoires to which the scenes and corresponding sentences were related. If these repertoires did not emerge as a function of teaching the emission of tacts, they would also be directly taught through a least-to-most prompting hierarchy and differential reinforcement of independent performance, as it was the case of previous studies [
7,
8].
2. Materials and Methods
Participants
This study involved three children diagnosed with ASD, aged between 4 and 6 years old, as participants (P1, P2 and P3). They all had a history of receiving ABA based interventions in a clinical or residential environment. As inclusion criterion, the participants should not be able to emit descriptions of the research materials (scenes depicting behavioral sequences) consisting of sentences with four or more words. However, they already demonstrated a reading or vocal imitation (echoic) repertoire of sentences at a generalized level, as well as generalized instruction following. They were also able to select many non-verbal stimuli as listeners (e.g., by touching pictures in arrays under the control of their spoken names), and to pair many visual stimuli (by relating equal and similar objects and pictures among themselves). Children who did not follow all the requirements described so far would be excluded from the study. It is important to mention that, among the selected participants, only P2 showed a repertoire of reading words and sentences at a generalized level. P2 was also able to pair printed words and sentences to corresponding pictures/scenes.
Environment
Data collection was conducted in a structured room, with a table and three chairs, from a private clinic (under the consent of the director of the institution) and the participants’ own residences under the due authorization of the parents or other guardians. In other words, the environments were those where the children used to receive specialized interventions on ABA regardless of the research. However, data collection only took place at different times and as authorized by those responsible for the children.
Interobserver Agreement and Fidelity of Procedures
During data collection, an experimenter and a given participant sat at the table facing each other. On several occasions, the experimenter administered assessment and teaching trials of repertoires to the participant. Data was recorded on personalized record sheets with pencil and entered into an Excel spreadsheet for tabulation and analysis of the degree of agreement between the experimenter and a second observer who, occasionally, also took data. This second person, blind to the research objectives, collected data in 30% of the study regarding different stages of baseline, probes, teaching and follow up through videos recorded by the experimenter. Along several sessions, interobserver agreement (IOA) was calculated by dividing the number of trials with agreement by the total number of trials. Then the result was multiplied by 100 to obtain a percentage. The average agreement, considering the three participants, was 99% for P1, 98% for P2 and 100% for P3.
To obtain fidelity measures on the implementation of research procedures, a checklist was used by an independent observer, and it described behavioral components whose correct implementation by the experimenter represented high fidelity of the procedures (e.g., correct stimuli manipulation during assessment and intervention sessions; waiting at least 5s for the participant to emit a response in a trial; reinforcing correct participants’ responses during intervention trials; implementing correction procedures during when the participants committed errors in intervention trials). In each session, the number of components implemented correctly by the experimenter was divided by the total number of components. The result was multiplied by 100 to obtain a percentage, which was 100% during assessment and intervention sessions with each participant.
Materials
The experimenter used a personalized sheet and pencil to systematically record responses emitted by the participants in assessment and teaching trials. Non-verbal stimuli were used consisting of pictures/scenes depicting behavioral sequences to (1) putting on a mask and keeping it on the face; (2) cleaning up the hands with hand sanitizer; (3) playing functionally with Mr. Potato Head; (4) performing daily living skill of brushing teeth. For each action from the behavioral sequences, besides the picture, a printed sentence to be read or echoed by the participant (one case or the other depending on his/her repertoire) was also used.
Table 1 shows all sequences, materials/items and scripts/sentences used in the research stages.
Dependent Variables and Independent Variables
The main dependent variable (DV) consisted of the emission of sentences with tact function [
11] related to pictures from the four programmed behavioral sequences for each participating child. Another DV corresponded to the emission of related non-verbal responses portrayed by the pictures (and verbalizations) from the sequences. The independent variables (IV) corresponded to differential reinforcement of independent responses, consisting of sentences with tact function, and script fading (regarding scripts that were read by the child or modeled by an experimenter so the child could echo). One child (P2) participated in a teaching condition involving scripts for reading because he could read sentences of varying lengths fluently. The remaining children (P1 and P3) participated in a teaching condition involving scripts that were dictated to them, so they could echo.
Experimental Design
The current study used a single case research design consisting of a concurrent multiple baseline design [
3] across different behavioral sequences for each participant, as a manner to ensure experimental control of the IV(s) (script fading and differential reinforcement of independent tacts) on the main DV (vocal verbal emission of sentences with tact function). Initially, each participant was assigned to a probe session to assess the vocal verbal emission of sentences under the control of pictures from the behavioral sequences (e.g., pictures representing the step-by-step sequence of brushing teeth) and, also, a probe session to assess the emission of non-verbal repertoire of emitting the actions related to the pictures (e.g., putting toothpaste on the toothbrush in presence of the picture representing this action). Once the participants did not demonstrate these skills during the initial probes, a baseline condition (concerning more sessions to assess the emission of sentences with tact function) was administered for each of the four behavioral sequences. After stability of data representing low performance was achieved, a gradual script fading intervention condition to teach the emission of tacts was established for one of the four behavioral sequences (while baseline sessions continued for the remaining sequences).
Once the repertoire of emitting sentences with tact function was established for the first behavioral sequence in the absence of scripts (either textual or dictated scripts for reading or echoic responding, respectively, depending on the participant), maintenance training was applied, and the intervention was also administered for the next sequence (second). In parallel, baseline condition continued for the third and fourth behavioral sequences. After a child acquired the tact repertoire for the second behavioral sequence, maintenance training was applied and the intervention also commenced for the third sequence. The fourth and last sequence continued in baseline until a learning criterion of the tact repertoire was also achieved for the third sequence (then maintenance training began for the third sequence, and the intervention was also administered for the fourth sequence until acquisition of the target skill). It is worth noting that whenever the learning criterion of tact training with a given behavioral sequence was reached, a new probe session to assess the possible emergence of non-verbal repertoires related to pictures and verbalizations from the sequences was also conducted.
Procedures
The research stages are presented in
Table 2 and described thoroughly next.
First stage (P2 from the condition with textual scripts): Initial probe and baseline of emission of sentences with tact function (first repertoire) and initial probe for the emission of non-verbal related responses (second repertoire). The experimenter presented four behavioral sequences for P2. Each sequence consisted of non-verbal stimuli (pictures) representing several actions. In the presence of each picture, when the experimenter said, “tell me what is happening here”, P2 had up to 5s to make a description (e.g., saying “put toothpaste on the toothbrush” under the control of its corresponding picture). The pictures were organized in a binder (one picture per page) and P2 could already turn the pages independently. In the initial probe session with each sequence, the number of trials to respond depended on the number of pictures representing the sequence. In parallel, one initial probe session for the emission of non-verbal responses concerning the pictures from the routines (e.g., the child put toothpaste on the toothbrush after seeing the related visual stimulus) was conducted. At this moment, the experimenter provided the binder to P2, as well as all necessary items to perform the target responses. When told to perform a given behavioral sequence (e.g., “show me how you brush your teeth”), P2 had up to 30s to emit the actions from the sequence using the binder with pictures. During this period, the experimenter oriented P2 to look at each picture in the binder. After the initial probe session for the two mentioned repertoires, a baseline condition concerning the first (sentences with tact function) was administered for each behavioral sequence. The only difference compared to the initial probe was that baseline involved several sessions (until stability with low performance was demonstrated). It is worth noting that, in all probe and baseline sessions concerning the two repertoires at this stage, differential consequences for correct and incorrect responses were not provided.
Second stage (P2 from the condition with textual scripts): Teaching script reading. For each behavioral sequence, the number of trials to be delivered per training session depended on the number of pictures from the sequence. Each picture from a behavioral sequence represented a trial to respond. It is important to mention that, at this stage, each picture was accompanied by a related printed sentence and that P2 had to read out loud. In each trial from a session, the experimenter presented the verbal instruction “read” and P2 had up to 5s to read. When a correct response was emitted, verbal praise and a token were delivered (cumulative tokens were, by the end of each intervention session, exchanged for a preferred item such as a toy or video appropriate for the participant’s age. This was also applied to sessions during the next intervention stage). When an error occurred, or no response was emitted in 5s, a correction procedure was administered, that is, the experimenter read the sentence so P2 could repeat the information. The termination criterion for this stage of the study, for each behavioral sequence, consisted of two sessions without errors.
Third stage (P2 from the condition with textual scripts): Script fading (fading of textual scripts). In this stage, unlike the previous one, textual scripts gradually faded out, that is, the words from the printed sentences concerning each behavioral sequence were gradually removed (from the end to the beginning of each sentence). The number of fading steps varied between three and four steps across the sentences from the behavioral sequences. The sentences were organized in a manner that the number of words was balanced (Regarding the original words in Brazilian Portuguese). As an example of fading steps, during first step of the sequence to clean up hands with hand sanitizer, in each sentence, the last word was omitted (e.g., “Apply the hand sanitizer to your …”). In this case, P2 had to textually respond (read) to the visible information and “fill in the blank”, that is, verbalize the missing word (e.g., “hand”). When P2 verbalized what was missing, he was praised and received a token. When an error occurred, or no response was emitted in 5s, the experimenter showed the previous sentence including the missing word, so the child could read. The transition to each of the following fading steps occurred as soon as the participant did not commit errors in two consecutive sessions. Along the fading process, once P2 managed to verbalize all sentences solely under the control of the pictures, this stage was terminated.
Fourth stage (P2 from the condition with textual scripts): Additional probes to assess the emergence of the emission of non-verbal responses concerning the behavioral sequences. Throughout the research, additional probe sessions were administered to assess if non-verbal responses would emerge as a function of systematically training the emission of sentences with a tact function. The additional probe sessions were conducted in a manner which was identical to what happened during the initial probe from the first stage of the research. Differential consequences for correct and incorrect responses were not provided as well.
Fifth stage (P2 from the condition with textual scripts): Follow up (two weeks after the end of research). Two weeks after the end of the fourth stage, it was conducted a probe to assess the emission of sentences with tact function. The task was identical to the initial probe from the first stage. The research stages for the remaining participants (P1 and P3), to whom dictated scripts for the emission of echoic responses were used during the teaching of tacts, are described in detail next.
First stage (P1 and P3 from the condition with dictated scripts for the emission of echoics): Initial probe and baseline of emission of sentences with tact function and initial probe for the emission of non-verbal related responses. This stage was identical to the one conducted for P2.
Second stage (P1 and P3 from the condition with dictated scripts for the emission of echoics): Teaching the emission of echoics under the control of dictated scripts. This case was also similar to the second stage for P2. In each trial from a session, the picture also involved a related printed sentence, but this time the experimenter was the one who read the sentence so a given participant could repeat (echo) it. Up to 5s were allowed to respond. An independent response of echoing the whole dictated sentence was followed by verbal praise and a token. When an error occurred, or no response after 5s, the experimenter dictated the sentence again as a second opportunity for the learner (however, only a correct response in the first opportunity was considered to reach the learning criterion). The learning criterion for each behavioral sequence consisted of two sessions without any errors.
Third stage (P1 and P3 from the condition with dictated scripts for the emission of echoics): Script fading (fading of dictated scripts). The characteristics of this stage were similar to those from the third stage for P2 with textual scripts. In the current cases of P1 and P3, the dictated scripts gradually faded out, that is, the words from the sentences were gradually omitted (from the end to the beginning of each sentence). During the first fading step, the last word from each sentence representing a given behavioral sequence was omitted, that is, the experimenter dictated the sentence until the moment of omission of information. The participant, then, had up to 5s to “fill in the blank”, that is, to verbalize what was omitted. An independent response resulted in verbal praise and a token. When an error occurred, or no response was emitted within 5s, the experimenter vocally provided the information omitted so the learner could repeat it. The transition to each of the following fading steps occurred as soon as the participant did not commit errors in two consecutive sessions. Along the fading process, once P1 and P3 managed to verbalize all sentences solely under the control of the pictures, this stage was terminated.
Fourth stage (P1 and P3 from the condition with dictated scripts for the emission of echoics): Additional probes to assess the emergence of the emission of non-verbal responses concerning the behavioral sequences. This stage was identical to the fourth stage for P2 with textual scripts.
Fifth stage (P1 and P3 from the condition with dictated scripts for the emission of echoics): Follow up (two weeks after the end of research). This stage was identical to the fifth stage for P2 with textual scripts.
Ethical Procedures
This study is related to a project, which was approved by an ethics committee in research with humans (authorization No. 6.585.992) from Federal University of Maranhão, Brazil, São Luís-MA. The participants’ parents signed an informed consent form. The participants themselves also signed an informed consent form. The identities were kept confidential, and participation could be terminated at any time and without any harm.
3. Results
For each of the three participants (P1, P2 and P3), the data was organized in the following manner: Data concerning the performance in the repertoires of emitting sentences with tact function and related non-verbal actions under the control of pictures from the behavioral sequences. Data on non-verbal actions were only collected in probes before and after teaching the emission of sentences with tact function.
The graphs from
Figure 1,
Figure 2 and
Figure 3 were organized to represent performances of P1, P2 and P3, respectively, in the mentioned target repertoires. During intervention to teach the emission of sentences, the script fading was established for each of the four behavioral sequences based on the number of words determined for each sentence. For example, three fading steps (F1 – F3) were defined for each sentence of the first sequence (cleaning up the hands). The unfaded first sentence, for example, corresponded to “take the bottle with hand sanitizer”. The fading steps of this case were organized in the following manner: F1 – “take the bottle with …”; F2 – “take the…”; and F3 (all words from the sentence omitted).
Figure 1, below, presents the results of participant P1.
One single initial probe session for tacts was administered for each behavioral sequence. In parallel it was administered one initial probe session for related non-verbal actions. According to data shown in
Figure 1, P1 did not show any correct responses in the initial probes for both repertoires in all sequences. The same case applied to baseline (performance was stable across several sessions). During intervention with the first behavioral sequence (cleaning up the hands), five sessions involving five trials each were necessary for P1 to reach the learning criterion during the condition with unfaded scripts. When script fading commenced, P1 needed three, four, two and four sessions to achieve criterion in F1, F2, F3 and F4 fading steps, respectively. In the end, P1 was able to emit all sentences solely under the control of pictures from the first behavioral sequence.
After P1 was able to describe all pictures from the first sequence without scripts, two additional probe sessions to assess non-verbal actions (AP-NVA) were administered for each of the four behavioral sequences. P1 was not able to emit the non-verbal actions under the control of the pictures. Then, maintenance tact training of the first sequence was applied. After that, all non-verbal actions emerged for the first sequence. This data was replicated across additional sessions (maintenance tact training – probing related non-verbal actions). Independent performance concerning both tact and related non-verbal repertoires was also demonstrated in follow up sessions for the first sequence. In parallel with this, tact training with unfaded scripts (US) was applied for the second behavioral sequence (putting on the mask and keeping it on the face).
Three sessions were needed during the US condition. Across script fading steps, P1 needed three, four, nine and three sessions to achieve criterion in F1, F2, F3 and F4, respectively, for the second sequence. Then, two AP-NVA probe sessions were administered for the second, third and fourth behavioral sequences. As it may be seen in
Figure 1, P1 was able to correctly emit all non-verbal actions regarding the second sequence. Besides, this data was replicated across several additional sessions (maintenance tact training – probing related non-verbal actions). Plus, independent performance concerning both tact and non-verbal repertoires was shown in follow up sessions.
Parallel to the acquisition of the repertoires from the second sequence, tact training concerning the third behavioral sequence (functional play) commenced. Three, two, two and two sessions were needed for P1 to achieve a learning criterion in US, F1, F2 and F3, respectively. Two AP-NVA probe sessions were administered, and P2 did not show any errors. Besides, independent performance concerning the repertoires was replicated across additional sessions (maintenance tact training – probing related non-verbal actions), as well as it was demonstrated across follow up probe sessions.
Parallel to the acquisition of the repertoires from the third sequence, tact training commenced with the fourth sequence (brushing teeth). P1 needed three, two, four and five sessions to reach criterion in US, F1, F2 and F3, respectively. Two AP-NVA probe sessions were conducted, and P1 showed no errors. Besides, as it was the case of the previous sequences, independent performance in both repertoires (maintenance tact training – probing related non-verbal actions) was verified, as well as it was demonstrated across follow up probe sessions. Next,
Figure 2 shows data from P2.
One initial tact probe session (IP-TA) and one initial related non-verbal action probe session (IP-NVA) were administered. P2, for all behavioral sequences, did not show correct responses in IP-TA. Regarding IP-NVA, a low performance between one and two correct responses was shown. During baseline for tact (BL-TA), no correct responses occurred across all behavioral sequences. During tact training with the first sequence (cleaning up the hands), two, seven, three, three were needed to reach criterion in US, F1, F2 and F3, respectively. During AP-NVA probe sessions, no errors were committed regarding the first behavioral sequence. In the cases of the remaining sequences, the performance was low, ranging from one and three correct responses. Besides, still considering first sequence, independent performance for both repertoires was shown in additional probe sessions (maintenance tact training – probing non-verbal actions), as well as across follow up probes. At the same time, tact teaching was defined for the second behavioral sequence (putting on the mask and keeping it on the face).
During the teaching with the second sequence, two, six, eleven, seven and four sessions were needed to reach criterion in US, F1, F2, F3 and F4. When AP-NVA probe sessions were conducted, no errors were committed with the second sequence. Regarding the third and fourth sequences, performance was low (ranging from two to three correct responses). Still considering the second sequence, no errors occurred for the repertoires across follow up probe sessions. In parallel, tact training was established with the third behavioral sequence (functional play).
During training with the third sequence, two, three, two and three sessions were needed to achieve the learning criterion in US, F1, F2 and F3. When AP-NVA probe sessions were applied, no errors occurred with the third sequence and performance was low with the fourth sequence. Still with the third sequence, independent performance for the repertoires was demonstrated in additional sessions (maintenance tact training – probing non-verbal actions), as well as across follow up probe sessions. In parallel, tact training was established with the fourth behavioral sequence (brushing teeth).
P2 needed two, three, two, two and two sessions to achieve criterion in US, F1, F2, F3 and F4. When AP-NVA probe sessions were applied, P2 did not show errors with the fourth sequence. Independent performance with the repertoires was replicated across additional sessions (maintenance tact training – probing non-verbal actions), as well as no errors occurred across follow up probe sessions.
Figure 3, next, shows data from P3.
When one initial probe session for tact (IP-TA) and one initial probe session for related non-verbal actions (IP-NVA) were administered, P3 demonstrated between only one and two correct tact responses concerning all four behavioral sequences. Likewise, only one correct non-verbal response occurred with the fourth sequence. During baseline for tact (BL-TA), overall, performance was low for all sequences. Across many sessions, P3 did not show beyond one correct response. Close to the end of baseline with the fourth behavioral sequence, however, three correct responses were emitted in three sessions. When tact training was defined for the first behavioral sequence (cleaning up the hands), two, five, two and six sessions were needed to achieve the learning criterion in US, F1, F2 and F3, respectively.
When AP-NVA sessions were administered, no correct responses were demonstrated across all four behavioral sequences. Nevertheless, after four maintenance tact training sessions, the related non-verbal repertoires emerged in two new AP-NVA probe sessions (considering the first behavioral sequence). Thereafter, independent performance in the repertoires were demonstrated in additional probe sessions (maintenance tact training – probing non-verbal actions), as well as in follow up probe sessions. Parallel to the acquisition of repertoires concerning the first behavioral sequence, tact training with the second sequence (putting on the mask and keeping it on the face) commenced for P3. Two, five, three, six and three sessions were needed to achieve the learning criterion in US, F1, F2, F3 and F4, respectively. When AP-NVA probe sessions were applied, as can be noticed in
Figure 3, no errors were committed concerning the second behavioral sequence, but no correct responses were shown with third and fourth sequences.
Still regarding the second sequence, independent performance for both repertoires were replicated in additional sessions (maintenance tact training – probing non-verbal actions), as well as it was the case of follow up probe sessions. Parallel to the acquisition of the repertoires for the second sequence, tact training with the third sequence (functional play) commenced. Two, three, two, two and two sessions were needed to achieve criterion in US, F1, F2, F3 and F4, respectively. When two AP-NVA sessions were conducted, P3 showed no errors within the third sequence. In the case of the fourth sequence, performance was low with two correct responses. Still regarding the third sequence, independent performance in the repertoires was demonstrated across additional sessions (maintenance tact training – probing non-verbal actions), as well as it was shown across follow up probe sessions.
Parallel to the acquisition of repertoires for the third sequence, tact training with the fourth behavioral sequence (brushing teeth) commenced. P3 needed two, three, two, two and two sessions to reach criterion in US, F1, F2, F3 and F4. When two AP-NVA probe sessions were conducted, no errors occurred. Independent performance concerning the target repertoires was also replicated across follow-up probe sessions.
4. Discussion
The previous literature on procedures comprising scripts and their fading (combined with differential reinforcement of independent performance) to teach vocal verbal communication using sentences, as well as the emission of important non-verbal behavioral chains (e.g., daily functional skills such getting dressed, brushing teeth and playing functionally) in children diagnosed with ASD and other cases of learning disabilities, demonstrated that such procedures are effective in establishing the mentioned repertoires, contributing to greater autonomy and functionality in life [
6,
7,
8,
9,
10,
12].
The current research systematically replicated the study by Matos et al. [
15], so that script fading was used to establish the vocal verbal emission of sentences with tact function under the control of pictures from behavioral sequences. The procedure was effective for all three children with ASD who participated. In this sense, the results were like those from the previous study, which was considered a pioneer in this topic. Besides, the current study extended the investigation by assessing the effects of tact training on the emission of related non-verbal actions depicted by the pictures from the behavioral sequences. In probe sessions that preceded tact training to all participants, performance concerning the emission of non-verbal actions involved few or no correct responses. As the teaching of tact relations was defined for each participant, the emergence of related non-verbal actions was demonstrated across all four behavioral sequences. Two of the sequences (cleaning up the hands; putting on the mask and keeping it on the face) are especially important as preventive measures against infectious diseases such as COVID-19.
Data suggested that, for certain learners diagnosed with ASD, the acquisition of behavioral chains, regarding functional play and other functional skills (e.g., Cleaning up the hands, using mask to prevent diseases, using the bathroom etc.), may not be established solely through direct teaching with prompting hierarchy and differential reinforcement of independent performance as happened in previous studies [
6,
7,
8,
9]. In other words, teaching the emission of sentences with tact function under the control of pictures from behavioral sequences may result in the emergence of related non-verbal repertoires without direct teaching (data from this research suggested that this happened for all participants). This seems to represent what the literature defines as correspondence between verbal and non-verbal behavior, or “say-do” correspondence [
16], since the participants in the research successfully demonstrated the non-verbal actions from the four sequences in probes, as they were taught to make verbal descriptions about the pictures in the sequences.
On the other hand, a limitation of this research was that the opposite path, that is, teaching the repertoire of non-verbal actions under the control of pictures from the sequences and measuring the effects on the emergence of emitting related sentences with tact function, was not investigated. Whether correspondence between non-verbal and verbal behavior, or “do-say” correspondence, in children with ASD like those from the current study is possible or not remains an empirical question. It is important, in this sense, that future studies investigate this possibility [
16]. The literature on the topic of “do-say” correspondence has discussed that the procedure of differential reinforcement has successfully reduced the emission of inaccurate reports about past behaviors related to item interaction, toy play and academic tasks [
17,
18,
19]. In other words, the procedure may reduce the possibility of lies. In a more recent investigation [
20], researchers were concerned in developing strategies that could both strengthen the emission of honest reports by young children and reduce transgressions.
In the research [
20], the children were first taught to emit honest reports about touching or not a disallowed item. Thereafter, the transgression of touching the not permitted item was decreased by making the children pause, put the item away or move away from it and play with other toys in the environment. Three children (one typical and two diagnosed with ASD), aged between 6 and 7 years old, demonstrated an increase in the emission of honest reports and decrease in transgressions to satisfying levels to their caregivers. A future study, in which the effects of teaching behavioral sequences (through prompting hierarchy and differential reinforcement) are measured on the establishment of corresponding reports through sentences and without direct teaching, may represent an alternative way of teaching and strengthening the emission of honest reports and corresponding socially approved behaviors in children with ASD by caregivers.
Another limitation corresponded to the selection of children with ASD eligible to participate in the current study. Overall, seven children were assessed as to demonstrating the repertoires of emitting sentences with tact function and related non-verbal actions under the control of pictures from the four sequences defined. Four children were dismissed because they were able to demonstrate all target repertoires without errors in initial probe sessions. However, the remaining three children defined as participants demonstrated few or no correct responses concerning both target repertoires in initial probe session and across baseline sessions. It is important that the investigation be extended across more children with ASD in future research, as a way of testing the generality of the procedures and results. Besides, not identifying participants, who could not show the emergence of non-verbal actions through the direct teaching of related tact relations, may also be interpreted as a limitation. Therefore, future investigations could involve participants less likely to show emergent related non-verbal repertoires, making it possible to assess if direct teaching procedures such as prompting hierarchy and differential reinforcement of independent performance will be effective, as it was the case of a previous literature [
6,
7,
8,
9,
10].
A limitation similar to what happened previously in Matos et al. [
15] was that a single sentence/script was defined for each of the pictures across the behavioral sentences. Although script fading was successful in establishing the emission of sentences free from script control in both studies (the previous and the current one as well), all participants showed the same sentences based on the scripts previously used during teaching. However, the literature discusses the importance of designing procedures that may favor the emission of varied sentences during communication, and that may not be only based on trained scripts [
12,
14]. Therefore, new investigations may consider using varied sentences during script fading training, and assessing the impact on communication with varied sentences, including possible verbal utterances not directly based on previously trained scripts. A possible future systematic replication of the current research might address this issue. It will be important to assess how script fading training to establish tacts with varied sentences impact the emergence of related non-verbal repertoires. Besides, it will be important that new studies also assess generalization across new environments and communication partners. Finally, another limitation of the current study was that it did not predict participation of children with ASD with more significant deficits in verbal skills, considering, for example, tact relations. Future research may reflect on possible repertoires that can be taught as an alternative to teaching tact relations.