COINCIDENCES AND DIVERGENCES BETWEEN AUDIO TRANSCRIPTION AND TEXTUALIZATION Coincidências e divergências entre transcrição e textualização de áudios

Forensic Science has been developed, studied and practiced in many countries for decades, contributing to justice in the most different knowledge areas. From the procedures involving Forensic Science with highlight in Human Communication, we highlight video and audio content analysis, transcripts, textualizations, communicative profile analysis, graphological-technical examinations, facial identification, and establishment of a causal link among hearing and/or vocals and occupational therapy. Among these, we highlight the identification COINCIDENCES AND DIVERGENCES BETWEEN AUDIO TRANSCRIPTION AND TEXTUALIZATION


INTRODUCTION
Forensic Science has been developed, studied and practiced in many countries for decades, contributing to justice in the most different knowledge areas.From the procedures involving Forensic Science with highlight in Human Communication, we highlight video and audio content analysis, transcripts, textualizations, communicative profile analysis, graphological-technical examinations, facial identification, and establishment of a causal link among hearing and/or vocals and occupational therapy.Among these, we highlight the identification

COINCIDENCES AND DIVERGENCES BETWEEN AUDIO TRANSCRIPTION AND TEXTUALIZATION
request can accelerate the processes, benefiting the judicial power 8,17 .
The need for the partial transcription or in its entirety for the audios and, if the same ones are considered expertise or documental proof [18][19][20][21] are ongoing discussions in the high level court in the country, considering the large volume of material to be analyzed, due to technological advancement, and cases of impact on the national political and economical scenario.Therefore, for this study, we will be considering the differences and similarities in carrying out transcription and textualization procedures as a way to contribute for the choice between one of the subjects in the process or law enforcement officers.
This study aims to investigate coincidences and divergences between audio material transcription and textualization in order to check the best applicability.

METHODS
This research was duly registered with the Brazil Platform having the approval of CEP (Committee of Ethic in Research) under number 274-742.This is a retrospective study.The audio material is used as a sample comes from wiretapping records that were used to identify speakers in the same lawsuit.As this is not a comparative study, the samples retain the code of secrecy, since it does not identify any given process as well as the speakers.The researcher in charge for this study is committed to maintaining the confidentiality signing a Term of Usage Commitment and Data Disclosure.
For the making up the sample, the audios coming from the same process were submitted to random sampling statistical treatment using the R and Tinn-R 22 software.239 audios that comprised the database were first registered by time, where the lowest audio had 0.13 min and the larger one 10.12 min.30 audios were selected randomly among the 239 available.The boxplot is a chart representing the distribution of a data set based on the median and other quartiles, was used to describe the boxplot representation of the sample (Figure 1). of speakers, used in civil and criminal proceedings as judicial evidence [1][2][3] .
In Brazil, although recent, Speech Pathologists are now being inserted in specialized sectors of institutions working directly with the law, and as the Department of Justice and the Institutes of Forensic Expertise e and more States.Forensic Speech Pathology is, then, described as the interface between law and science, applying technical and scientific knowledge of human communication in judicial issues, and aiming to clarify the facts under legal interest, by using the grounds of Speech Pathology and its specialties, which include the areas related to hearing, voice, speech, orofacial motricity, oral written language 4 .
Transcription and textualization, or de-recording, as it is known in the legal environment, are commonly performed procedures in the analysis of audio arising from wiretappings 5 .Authors experienced in the subject 6,7 , as well as the commands of the Civil and Criminal Process Code, stressed out the need for the procedure to be performed by a person who holds "expertise" on a given area of knowledge, duly registered on the board of class 8 and/or, in cases involving human communication, demonstrating knowledge in the areas related to syntax, semantics, morphology, lexicology, dialectology, sociolinguistics, psycholinguistics, in addition to articulatory phonetics and acoustic phonetics [9][10][11] .The linguistic experience of the person performing it is also considered, with the proposal to build up the most relevant pieces of content, through the reproduction of discourses, intentions, situations, relationships and correlates chronologically chained 1,12 .
The de-recording of audio material can be made using the transcription, which consists in transforming in writing exactly what is being heard, keeping the phonemic content and traces of prosody; or textualization which is based on a written narrative on the speaker's communicative intents.Studies on the contribution of textualization and/or transcription in a given audio material can guide the application of justice in either procedure, depending on the nature of the process at issue [13][14][15][16] .A timely of words used (Figures 2 and 3).The choice of the parts was done by draw.Standards of Conversation Analysis proposed by Marcuschi 18 were used to carry out transcription and textualization.
Comparative analysis regarding the maintenance of the main audio content highlights was performed by a Speech Pathologist, PhD in Human Communication Disorders and with Training in Forensic Speech Pathology.The Speech Pathologist listed the main content highlights contained in the transcription and verified whether they were kept in the corresponding textualization (Figure 4).
The results were compiled in tables and subsequently analyzed statistically.We performed the statistical analysis under R environment, with Tinn R interface.We used the Wilcoxon-Mann-Whitney 22 test for comparative analyzes, with 5% (0.05) significance level.
To check the coincidences and divergences among transcription and textualization procedures we considered:   1, illustrative charts in boxplot format 2 and 3.
In the comparative analysis as for the maintenance of the main content highlights contained in the transcriptions found in the textualization, it was possible to verify that 983% of the highlights were kept, p=0.61%.These data are shown in table 2, illustrative chart boxplot 4 format.

RESULTS
The mean number of words used in of audio file transcription was 27% greater than the number of words used in textualization, p=0.52.The mean time in minutes required to perform the transcription was double the time required to perform the textualization, p=0.013.The evaluators transcribed on average 12.44 words per minute and textualized

DISCUSSION
The search for experts in the human communication field has been increasingly common in Brazil, a possible reflection on the commitment of the Brazilian judiciary in order to make justice more precise and transparent 23,24 .
In this context, there are several requests for audio transcription and textualization arising mainly from wiretappings 23,24 .It falls to the expert not only to translate the audio material for writing, but also, in some cases, to identify the speakers' voices 25,26 .This study sought to elucidate coincidences and divergences among audio transcription and textualization in order contribute with the judiciary, law enforcement officers and/or subjects of the process, in the option of requesting either procedure.
According to the data found during the analysis of the number of words used during transcription and textualization, the following data were obtained: the total number of words used in the transcription was, on average, 27% higher than in textualization.However the coefficient of variation was very high, 35%, which leads to no statistical significance of this difference.These data are shown in table 1 and illustrative chart 2. The insignificant difference indicates that the text size and thus the reading time would be about the same in both procedures, and therefore, not constituting an important factor in choosing one of the two processes 27,28 .
It is noteworthy that the act of transcribing is a procedure that is directly related to the linguistic baggage of the subject and the ability to interpret what is being heard, factors directly related to their academic grade and social-cultural level 27 .This were held between the two methods.The difference among the medians as for the number of existing highlights in both methods, was not statistically significant (P=0.61),which confirms the similarity of content between the two processes.These data are shown in table 2 and illustrative Figure 4. Different highlights may cause serious problems of understanding, preventing, sometimes, the establishment of coherence 34 .This study showed no significant difference, demonstrating that there is no damage to the preservation of contents, opting for either procedure.
Whereas transcription and textualization must retain the content of the links, to avoid changing the original message and the findings showing that the central highlight of the messages is maintained, it is possible to emphasize that textualization, due to its execution speed, becomes more feasible in content analysis of intercepted calls, providing the speed of the procedure and performing a greater number of analyses 10,28. .Thus, although many studies seek to understand the automatic speech recognition and transcription programs, the results found in this study show that speech can not be easily analyses because it involves relevant factors, such as linguistic knowledge, practice of their transcriber, time to perform the work and perception of non linguistic signs 35,36 .

CONCLUSION
According to this study's results, which aimed to verify coincidences and divergences between audio transcription and textualization, it is concluded that the similarities among the processes of transcription and textualization are related with the median number of words and the maintenance of the median number of content highlights.There was divergence as for the implementation time, significantly lower in textualization.Considering the data obtained in this study, textualization process proved to be the most suitable in audio de-recording.
research sought to minimize the intra-subject differences suggesting that the transcription and textualization tasks were to be performed by professionals with academic training and converging socio-cultural level.
Analyzing the time spent for each task, it was possible to verify that the transcriptions were performed, on average, twice as long as the textualizations, being it a statistically significant difference (P=0.01).Such data are important enough to argue that when the time factor is involved, from the time when material quality denoted not being impaired, this procedure can be used without compromising the result.These data are shown in table 1, illustrative chart 3.
The evaluators transcribed on average 12.44 words per minute and textualized 18.79 words per minute.The time for completing textualization was on average half the time needed to perform the transcription.Time is an important factor considering the need to expedite legal proceedings and minimize costs.The need for a faster procedure is real respecting the limitation of human and technical resources made available to the authorities 20 .This study shows that the choice for textualization significantly save time for audio treatments 29,30 , i.e., with the time factor being significant, the judiciary power should opt for textualization.
The slower speed in transcription is probably due to the very process that by itself requires all words to be accurately reproduced 31 .Although textualization depends on the skill and knowledge of the textualizer and Portuguese idiom domain, more swiftness was attributed to the fact of this power interpreting the contents by means of context, while keeping the highlights, without the need for literal understanding of all words.Such data corroborate other studies that classify the transcription as a complex process that involves numerous aspects such as conversation, performance time, nonverbal actions, speaker/listener relationship and physical orientation 32,33 .

1 .Figure 1 -
Figure 1 -Representativeness of sample by audio time distribution in minutes.

Figure 2 -Figure 3 -
Figure 2 -Illustrative image in boxplot format for the distribution of the data set on number of words in the transcription and number of words in textualization

Figure 4 -
Figure 4 -Illustrative chart in boxplot format on the distribution of the data set on the number of highlights in the transcription and the number of highlights preserved in the textualization