Blog – Saul Albert

What Conversational AI Can’t Do

April 9, 2025

Repairing the Common Ground between Conversation Analysis and Conversational Technologies

Abstract

Computational linguistics and dialogue systems research share many terms and concepts with conversation analysis, but there are some irreconcilable di5erences in how key conversational phenomena are understood and operationalised between these fields. This leads to misunderstandings (at best) and fully fledged category errors (at worst) when we attempt to collaborate across disciplines that have much to gain from closer cooperation. In this talk, I will use examples from a recent special issue of Discourse Studies (Stokoe, Albert, Buschmeier & Stommel, 2024) to identify opportunities for reconciliation and targets for future cross-disciplinary work.

References

Albert, H., Housley, W., Sikveland, R. O., & Stokoe, E. (forthcoming). The conversational action test: Detecting the artificial sociality of AI. New Media & Society.

Albert, S., & Hall, L. (2024). Distributed agency in smart homecare interactions: A conversation analytic case study. Discourse & Communication, 18(6), 892–904. https://doi.org/10.1177/17504813241267059

Albert, S., Hamann, M., & Stokoe, E. (2023). Conversational user interfaces in smart homecare interactions: A conversation analytic case study. In Proceedings of the 5th International Conference on Conversational User Interfaces (pp. 1–12). ACM. https://doi.org/10.1145/3571884.3597140

Alač, M., Gluzman, Y., Aflatoun, T., Bari, A., Jing, B., & Mozqueda, G. (2020). How everyday interactions with digital voice assistants resist a return to the individual. Evental Aesthetics, 9(1), 51.

Antaki, C., & Crompton, R. J. (2015). Conversational practices promoting a discourse of agency for adults with intellectual disabilities. Discourse & Society, 26(6), 645–661. https://doi.org/10.1177/0957926515592774

Antaki, C., & Kent, A. (2012). Telling people what to do (and, sometimes, why): Contingency, entitlement and explanation in staff requests to adults with intellectual impairments. Journal of Pragmatics, 44(6), 876–889. https://doi.org/10.1016/j.pragma.2012.03.014

Brooker, P., Dutton, W., & Mair, M. (2019). The new ghosts in the machine: ‘Pragmatist’ AI and the conceptual perils of anthropomorphic description. Ethnographic Studies, 16, 272–298. https://doi.org/10.5281/zenodo.3459327

Button, G. (Ed.). (1995). Computers, minds and conduct. Polity Press.

Cooper, S., Di Fava, A., Vivas, C., Marchionni, L., & Ferro, F. (2020). ARI: The social assistive robot and companion. In 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) (pp. 745–751). IEEE. https://doi.org/10.1109/RO-MAN47096.2020.9223470

Craven, A., & Potter, J. (2010). Directives: Entitlement and contingency in action. Discourse Studies, 12(4), 419–442. https://doi.org/10.1177/1461445610370126

Curl, T. S., & Drew, P. (2008). Contingency and action: A comparison of two forms of requesting. Research on Language and Social Interaction, 41(2), 129–153. https://doi.org/10.1080/08351810802028613

Dingemanse, M. (2020). Recruiting assistance and collaboration: A West-African corpus study. In S. Floyd, G. Rossi, & N. J. Enfield (Eds.), Getting others to do things: A pragmatic typology of recruitments (pp. 369–421). Language Science Press. https://doi.org/10.5281/zenodo.4018388

Dreyfus, H. L. (1972). What computers can’t do. MIT Press.

Edwards, D. (1994). Imitation and artifice in apes, humans, and machines. American Behavioral Scientist, 37(6), 754–771. https://doi.org/10.1177/0002764294037006004

Floyd, S., Rossi, G., & Enfield, N. J. (2020). Getting others to do things: A pragmatic typology of recruitments. Zenodo. https://doi.org/10.5281/zenodo.4017493

Garfinkel, H. (2021). Ethnomethodological misreading of Aron Gurwitsch on the phenomenal field. Human Studies, 44(1), 19–42. https://doi.org/10.1007/s10746-020-09566-z

Goodwin, C. (1984). Notes on story structure and the organization of participation. In J. M. Atkinson & J. Heritage (Eds.), Structures of social action: Studies in conversation analysis (pp. 225–246). Cambridge University Press.

Goodwin, C. (2007). Interactive footing. In E. Holt & R. Clift (Eds.), Reporting talk (pp. 16–46). Cambridge University Press. https://doi.org/10.1017/CBO9780511486654.003

Goodwin, C. (2017). Co-operative action. Cambridge University Press. https://doi.org/10.1017/9781139016735

Hall, L., Albert, S., & Peel, E. (2024). Doing virtual companionship with Alexa. Social Interaction. Video-Based Studies of Human Sociality, 7(3), Article 3. https://doi.org/10.7146/si.v7i3.150089

Heinemann, T. (2006). ‘Will you or can’t you?’: Displaying entitlement in interrogative requests. Journal of Pragmatics, 38(7), 1081–1104. https://doi.org/10.1016/j.pragma.2005.09.013

Ivarsson, J., & Lindwall, O. (2023). Suspicious minds: The problem of trust and conversational agents. Computer Supported Cooperative Work (CSCW). https://doi.org/10.1007/s10606-023-09465-8

Jackson, L., Haagaard, A., & Williams, R. (2022). Disability dongle. Platypus: The CASTAC Blog. https://blog.castac.org/2022/04/disability-dongle/

Jaton, F., & Sormani, P. (2023). Enabling ‘AI’? The situated production of commensurabilities. Social Studies of Science, 53(5), 625–634. https://doi.org/10.1177/03063127231194591

Jefferson, G. (1989). Letter to the editor re: Anita Pomerantz’ epilogue to the special issue on sequential organization of conversational activities, Spring 1989. Western Journal of Speech Communication, 53(Fall), 427–429.

Kendrick, K. H., & Drew, P. (2016). Recruitment: Offers, requests, and the organization of assistance in interaction. Research on Language and Social Interaction, 49(1), 1–19. https://doi.org/10.1080/08351813.2016.1126436

Liesenfeld, A., & Dingemanse, M. (2024). Interactive probes: Towards action-level evaluation for dialogue systems. Discourse & Communication. Advance online publication. https://doi.org/10.1177/17504813241267071

Mlynář, J., de Rijk, L., Liesenfeld, A., Stommel, W., & Albert, S. (2024). AI in situated action: A scoping review of ethnomethodological and conversation analytic studies. AI & SOCIETY. https://doi.org/10.1007/s00146-024-01919-x

Pino, M., & Land, V. (2022). How companions speak on patients’ behalf without undermining their autonomy: Findings from a conversation analytic study of palliative care consultations. Sociology of Health & Illness, 44(2), 395–415. https://doi.org/10.1111/1467-9566.13427

Porcheron, M., Fischer, J. E., Reeves, S., & Sharples, S. (2018). Voice interfaces in everyday life. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (pp. 1–12). ACM. https://doi.org/10.1145/3173574.3174214

Rudaz, D., & Licoppe, C. (2024). ‘Playing the robot’s advocate’: Bystanders’ descriptions of a robot’s conduct in public settings. Discourse & Communication. Advance online publication. https://doi.org/10.1177/17504813241271481

Schütz, A. (2007). The phenomenology of the social world (1932). In Contemporary sociological theory (2nd ed., p. 32). [Original work published 1932]

Stokoe, E., Sikveland, R. O., Albert, S., Hamann, M., & Housley, W. (2020). Can humans simulate talking like other humans? Comparing simulated clients to real customers in service inquiries. Discourse Studies, 22(1), 87–109. https://doi.org/10.1177/1461445619887537

Suchman, L. (2023). The uncontroversial ‘thingness’ of AI. Big Data & Society, 10(2), 20539517231206794. https://doi.org/10.1177/20539517231206794

What Conversational AI Can’t Do Read More »

What ‘counts’ as explanation in social interaction?

November 6, 2023

Saul Albert∗, Hendrik Buschmeier, Katharina Cyra, Christiane Even, Magnus Hamann, Jakub Mlynář, Hannah Pelikan, Martin Porcheron, Stuart Reeves, Philippe Sormani & Sylvaine Tuncer†

Citation: Albert, S., Buschmeier, H. Cyra, K., Even, C., Hamann, M., Licoppe, C., Mlynář, J., Pelikan, H., Porcheron, M., Reeves, S., Rudaz, D., Sormani, P., Tuncer, S. (2023, November 6-7). What ‘counts’ as an explanation in social interaction? 2^ndTRR 318 Conference Measuring Understanding, University of Paderborn, Paderborn, Germany.

Background

Measuring explainability in explainable AI (X-AI) usually involves technical methods for evaluating and auditing automated decision-making processes to highlight and eliminate potential sources of bias. By contrast, human practices of explaining usually involve doing explanation as a social action (Miller, 2019). X-AI’s transparent machine learning models can help to explain the proprietary ‘black boxes’ often used by high-stakes decision support systems in legal, financial, or diagnostic contexts (Rudin, 2019). However, as Rohlfing et al. (2021) point out, effective explanations (however technically accurate they may be), always involve processes of co-construction and mutual comprehension. Explanations usually involve at least two parties: the system and the user interacting with the system at a particular point in time, and ongoing contributions from both explainer and explainee are required. Without accommodating action, X-AI models appear to offer context-free, one-size-fits-all technical solutions that may not satisfy users’ expectations as to what constitutes a proper explanation.

What counts as an explanation?

If we accept that explanations are not simply stand-alone statements of causal relation, it can be hard to identify what should ‘count’ as an explanation in interaction (Ingram, Andrews, and Pitt, 2019). Research into explanation in ordinary human conversation has shown that explanations can be achieved through various practices tied to the local context of production cf. Schegloff, 1997. Moreover, explanations do not just appear anywhere in an interaction, but they are recurrently produced as responsive actions to fit an interactional ‘slot’ where someone has been called to account for something (Antaki, 1996). Sometimes explanations may also be produced as ‘initial’ moves in a sequence of action. In such cases, they are often designed to anticipate resistance and deal with. e.g., the routine contingencies that people cite when refusing to comply with an instruction (Antaki and Kent, 2012). Explanations as actions also perform and ‘talk into being’ social and institutional relationships such as doctor/patient, or teacher/student (Heritage and Clayman, 2010). Explainable AI systems, in this sense, become ‘accountable’ or ’transparent’ through their social uses (Button, 2003; Ehsan et al., 2021). We draw on concepts of explanation from Ethnomethodology and Conversation Analysis (Garfinkel, 1967; Garfinkel, 2002; Sacks, Schegloff, and Jefferson, 1974), Discursive Psychology (Edwards and Potter, 1992; Wiggins, 2016), and cognate fields like Distributed Cognition (Hutchins, 1995), and Enactivism (Di Paolo, Cuffari, and De Jaegher, 2018) to outline an empirical approach to explanation as a context-sensitive situated social practice (Suchman, 1987).

Explanations as joint actions Shared understanding is co-constructed through the achievement of coordinated social action (see e.g., Clark, 1996; Linell, 2009; Goodwin, 2017). Necessary and sufficient explanations cannot, therefore, be predefined by AI designers. Instead, explanation may be achieved through the achievement of joint actions with an AI in a specific context. If a system displays its capabilities in ways that match users’ expectations, they may achieve explanation (as contingently shared understanding) for all present intents and purposes. Explanation in this sense can never be considered complete – it could always be elaborated (cf. Garfinkel, 1967, pp. 73–75). While similar situations would involve predictability and regularity, this concept of explanation requires that participants jointly establish the relevant criteria and form for sufficient explanation with reference to the tasks and present purposes at hand, such as formulations of examples (Lee and Mlynář, 2023).

The self-explanatory nature of the social world Explanations are not only explicitly formulated, but are an inherent feature of the social world. Even without giving an explicit explanation, the design of an object provides ‘implicit’ explanations. Gibson (1979)’s concept of affordances, often used in system development, highlights that specific design features make specific actions relevant, e.g., a button that should be pressed, a lever that should be pulled (Norman, 1990). Situated social actions are also inherently recognisable (Levinson, 2013), even when mediated through AI such as in autonomous driving systems (Stayton, 2020). A slowly driving car will be recognised as not from the area (see, e.g., (Stayton, 2020)) and moving in certain ways can be recognised as, e.g., giving way to a pedestrian (Moore et al., 2019; Haddington and Rauniomaa, 2014). We could harness self-explanatory visibility (Nielsen, 1994) to design AI behaviours that are recognisable as specific social actions.

Miscommunication as explanation The practices of repair – the methods we use to recognise and deal with miscommunication (problems of speaking, hearing and understanding), as they occur in everyday interaction (Schegloff, Jefferson, and Sacks, 1977) – constitute pragmatic forms of explanation when they allow us to identify and resolve breakdowns of mutual understanding. For example, when someone says “huh?” in response to a ‘trouble source’ turn in spoken conversation, the speaker usually repeats the entire prior turn (Dingemanse, Torreira, and Enfield, 2013), whereas if the recipient had said “where?”, their response might only have solicited a repeat or reformulation of only the misheard place reference. These practices range from from tacit displays of uncertainty to explicit requests for clarification that solicit fully formed explanations and accounts (Raymond and Sidnell, 2019). These methods for real-time resolution of mutual (mis)understanding allows us, at least, to proceed with joint action in ways that establish and uphold explanation in action.

Explainability in action

This paper will use Conversation Analysis to examine episodes of Human-AI interactions, from a wide range of everyday interactional settings, and involving different technologies, user groups, and task orientations. Rather than attempting to establish a systematic or generalisable metric for explainability across interactional settings, the aim here is to encourage an extension of – and critical reflections on – our technical conceptualisation of explanation in X-AI.

References

Antaki, C. (1994). Explaining and Arguing: The Social Organization of Accounts. Sage.

Antaki, Charles (1996). “Explanation slots as resources in interaction”. In: British Journal of Social Psychology 35, pp. 415–432. doi: 10.1111/j.2044-8309.1996.tb01105.x.

Antaki, Charles and Alexandra Kent (2012). “Telling people what to do (and, sometimes, why): Contingency, entitlement and explanation in staff requests to adults with intellectual impairments”. In: Journal of Pragmatics 44, pp. 876–889. doi: 10.1016/j.pragma.2012.03.014.

Button, Graham (2003). “Studies of work in Human-Computer Interaction”. In: HCI Models, Theories, and Frameworks. Toward a Multidisciplinary Science. Ed. by John M. Carroll. San Francisco, CA, USA: Morgan Kaufmann, pp. 357–380. doi: 10.1016/b978-155860808-5/50013-7.

Clark, Herbert H. (1996). Using Language. Cambridge, UK: Cambridge University Press. doi: 10.1017/CBO9780511620539.

Depperman, A., & Haugh, M. (Eds.). (2021). Action ascription in interaction. Cambridge University Press.

Di Paolo, Ezequiel A., Elena Clare Cuffari, and Hanne De Jaegher (2018). Linguistic Bodies The Continuity between Life and Language. Cambridge, MA, USA: The MIT Press. doi: 10.7551/mitpress/11244.001.0001.

Dingemanse, Mark, Francisco Torreira, and Nick J. Enfield (2013). “Is “huh?” a universal word? Conversational infrastruc- ture and the convergent evolution of linugistic items”. In: PLoS ONE 8, e13636. doi: 10.1371/journal.pone.0078273.

Edwards, Derek and Jonathan Potter (1992). Discursive Psychology. London, UK: Sage.

Ehsan, Upol et al. (2021). “Expanding explainability: Towards social Transparency in AI systems”. In: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. Yokohama, Japan. doi: 10.1145/3411764.3445188.

Garfinkel, Harold (1967). Studies in Ethnomethodology. Englewood Cliffs, NJ, USA: Prentice Hall.

Garfinkel, Harold (2002). Ethnomethodology’s program: Working out Durkheim’s aphorism. Lanham, Boulder, New York, Oxford: Rowman & Littlefield Publishers.

Garfinkel, H. (2021). Ethnomethodological Misreading of Aron Gurwitsch on the Phenomenal Field. Human Studies, 44(1), 19–42. https://doi.org/10.1007/s10746-020-09566-z

Gibson, James J. (1979). The Ecological Approach to Visual Perception. New York, NY, USA: Psychology Press. doi:10.4324/9781315740218.

Gill, V. T., & Maynard, D. W. (2006). Explaining illness: Patients’ proposals and physicians’ responses. In D. W. Maynard & J. Heritage (Eds.), Communication in Medical Care: Interaction between Primary Care Physicians and Patients (pp. 115–150). Cambridge University Press. https://doi.org/10.1017/CBO9780511607172.007

Goodwin, Charles (2017). Co-Operative Action. New York, NY, USA: Cambridge University Press. doi: 10 . 1017 / 9781139016735.

Haddington, Pentti and Mirka Rauniomaa (2014). “Interaction between road users”. In: Space and Culture 17, pp. 176–190. doi: 10.1177/1206331213508498.

Heller, V. (2016). Meanings at hand: Coordinating semiotic resources in explaining mathematical terms in classroom discourse. Classroom Discourse, 7(3), 253–275. https://doi.org/10.1080/19463014.2016.1207551

Heritage, J. (1988). Explanations as accounts: A conversation analytic perspective. In C. Antaki (Ed.), Analysing Everyday Explanation: A Casebook of Methods (pp. 127–144). Sage Publications.

Heritage, John and Steven Clayman (2010). Talk in Action. Interactions, Identities, and Institutions. Chichester, UK: Wiley. doi: 10.1002/9781444318135.

Hindmarsh, J., Reynolds, P., & Dunne, S. (2011). Exhibiting understanding: The body in apprenticeship. Journal of Pragmatics, 43(2), 489–503. https://doi.org/10.1016/j.pragma.2009.09.008

Hutchins, Edwins (1995). Cognition in The Wild. Cambridge, MA, USA: The MIT Press.

Ingram, J., Nick Andrews, and Andrea Pitt (2019). “When students offer explanations without the teacher explicitly asking them to”. In: Educational Studies in Mathematics 101, pp. 51–66. doi: 10.1007/s10649-018-9873-9.

Lee, Yeji and Jakub Mlynář (2023). ““For example” formulations and the interactional work of exemplification”. In: Human Studies. doi: 10.1007/s10746-023-09665-7.

Levinson, Stephen C. (2013). “Action formation and ascription”. In: The Handbook of Conversation Analysis. Ed. by Jack Sidnell and Tanya Stivers. Chichester, UK: Wiley-Blackwell, pp. 101–130. doi: https://doi.org/10.1002/ 9781118325001.ch6.

Linell, P (2009). Rethinking language, mind, and world dialogically: Interactional and contextual theories of human sense- making. Charlotte: Information Age Publishing inc.

Miller, Tim (2019). “Explanation in Artificial Intelligence: Insights from the social sciences”. In: Artificial Intelligence 267, pp. 1–38. doi: 10.1016/j.artint.2018.07.007.

Moore, Dylan et al. (2019). “The Case for Implicit External Human-Machine Interfaces for Autonomous Vehicles”. In: Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications. Utrecht, The Netherlands, pp. 295–307. doi: 10.1145/3342197.3345320.

Nielsen, Jakob (1994). “Enhancing the explanatory power of usability heuristics”. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Boston, MA, USA, pp. 152–158. doi: 10.1145/191666.191729.

Norman, Donald A. (1990). The Design of Everyday Things. New York, NY, USA: Basic Books.

Raymond, Geoffrey and Jack Sidnell (2019). “Interaction at the boundaries of a world known in common: initiating repair with “What do you nean?”” In: Research on Language and Social Interaction 52, pp. 177–192. doi: 10.1080/08351813. 2019.1608100.

Robinson, J. D. (2016). Accountability in Social Interaction. In J. D. Robinson (Ed.), Accountability in Social Interaction (pp. 1–44). Oxford University Press (OUP). https://doi.org/10.1093/acprof:oso/9780190210557.003.0001

Rohlfing, Katharina et al. (2021). “Explanation as a social practice: Toward a conceptual framework for the social design of AI systems”. In: IEEE Transactions on Cognitive and Developmental Systems 13, pp. 717–728. doi: 10.1109/TCDS. 2020.3044366.

Rudin, Cynthia (2019). “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead”. In: Nature Machine Intelligence 1, pp. 206–215. doi: 10.1038/s42256-019-0048-x.

Sacks, Harvey, Emanuel A. Schegloff, and Gail Jefferson (1974). “A simplest systematics for the organization of turn-taking for conversation”. In: Language 50, pp. 696–735.

Schegloff, Emanuel A. (1997). “Whose Text? Whose Context?” In: Discourse & Society 8, pp. 165–187. doi: 10.1177/ 0957926597008002002.

Schegloff, Emanuel A., Gail Jefferson, and Harvey Sacks (1977). “The preference for self-correction in the organization of repair in conversation”. In: Language 53, pp. 361–382.

Sidnell, J. (2012). Declaratives, Questioning, Defeasibility. Research on Language & Social Interaction, 45(1), 53–60. https://doi.org/10.1080/08351813.2012.646686

Stayton, Erik Lee (2020). “Humanizing Autonomy: Social scientists’ and engineers’ futures for robotic cars”. PhD thesis.

Cambridge, MA, USA: Massachusetts Institute of Technology. doi: 1721.1/129050.

Suchman, Lucy A. (1987). Plans and Situated Actions: The Problem of Human-Machine Communication. Cambridge, UK: Cambridge University Press.

Wiggins, Sally (2016). Discursive Psychology. Theory, Method and Applications. Sage. doi: 10.4135/9781473983335.

∗Corresponding author: s.b.albert@lboro.ac.uk. Order of authors is alphabetical.

†SA, MH: Loughborough University; HB: Bielefeld University; KC: University of Duisburg-Essen; CE: Ruprecht Karl University of Heidelberg; JM:HES-SO Valais-Wallis; HP: Linköping University; MP: Swansea University; SR: University of Nottingham; PS: University of Lausanne; ST: King’s College

What ‘counts’ as explanation in social interaction? Read More »

The Atypical Interactant in a Smart Homecare Participation Framework

July 14, 2023

This paper contributes to research on the role of technology in ‘atypical’ interaction by examining a situation in which the technology takes on the stigma of atypicality. Building on our analysis, we argue that this approach provides a model for assistive technology research and development that moves away from a techno-medical model and focuses on how typicality (and atypicality) are achieved interactionally.

References

Alač, M., Gluzman, Y., Aflatoun, T., Bari, A., Jing, B., & Mozqueda, G. (2020). How Everyday Interactions with Digital Voice Assistants Resist a Return to the Individual. Evental Aesthetics, 9(1), 51.
Albert, S., Hamann, M., & Stokoe, E. (2023). Conversational User Interfaces in Smart Homecare Interactions: A Conversation Analytic Case Study. In ACM conference on Conversational User Interfaces (CUI ’23),
July 19–21, 2023, Eindhoven, Netherlands. ACM, New York, NY, USA 12 https://doi.org/10.1145/3571884.3597140
Amazon Echo (Director). (2019). Amazon Echo & Alexa—Morning Ritual (60s). https://www.youtube.com/watch?v=rHsO-rXrLLo
Amazon Echo (Director). (2019). Amazon Alexa: Sharing is Caring. https://www.youtube.com/watch?v=225Wlg3pkdo
Antaki, C., & Wilkinson, R. (2012). Conversation Analysis and the Study of Atypical Populations. In The Handbook of Conversation Analysis (pp. 533–550). John Wiley & Sons, Ltd. https://doi.org/10.1002/9781118325001.ch26
Barnes, S., & Bloch, S. (2020). Communication disorders, enchrony, and other-participation in repair. Clinical Linguistics & Phonetics, 34(10–11), 887–893. https://doi.org/10.1080/02699206.2020.1749886
Bedaf, S., Gelderblom, G. J., de Witte, L., Syrdal, D., Lehmann, H., Amirabdollahian, F., Dautenhahn, K., & Hewson, D. (2013). Selecting services for a service robot: Evaluating the problematic activities threatening the independence of elderly persons. 2013 IEEE 13th International Conference on Rehabilitation Robotics (ICORR), 1–6. https://doi.org/10.1109/ICORR.2013.6650458
Bottema-Beutel, K., Kapp, S. K., Lester, J. N., Sasson, N. J., & Hand, B. N. (2021). Avoiding Ableist Language: Suggestions for Autism Researchers. Autism in Adulthood, 3(1), 18–29. https://doi.org/10.1089/aut.2020.0014
Ekberg, K., Hickson, L., & Lind, C. (2020). Practices of Negotiating Responsibility for Troubles in Interaction Involving People with Hearing Impairment. In R. Wilkinson, J. P. Rae, & G. Rasmussen (Eds.), Atypical Interaction: The Impact of Communicative Impairments within Everyday Talk (pp. 409–433). Springer International Publishing. https://doi.org/10.1007/978-3-030-28799-3_14
García-Soler, Á., Facal, D., Díaz-Orueta, U., Pigini, L., Blasi, L., & Qiu, R. (2018). Inclusion of service robots in the daily lives of frail older users: A step-by-step definition procedure on users’ requirements. Archives of Gerontology and Geriatrics, 74, 191–196. https://doi.org/10.1016/j.archger.2017.10.024
Goodwin, C. (2007). Interactive footing. In E. Holt & R. Clift (Eds.), Reporting Talk (pp. 16–46). Cambridge University Press. https://doi.org/10.1017/CBO9780511486654.003
Jackson, L., Haagaard, A., & Williams, R. (2022). Disability Dongle | Platypus. https://blog.castac.org/2022/04/disability-dongle/
Kachouie, R., Sedighadeli, S., Khosla, R., & Chu, M.-T. (2014). Socially Assistive Robots in Elderly Care: A Mixed-Method Systematic Literature Review. International Journal of Human–Computer Interaction, 30(5), 369–393. https://doi.org/10.1080/10447318.2013.873278
Kendrick, K. H., & Drew, P. (2015). Recruitment: Offers, Requests, and the Organization of Assistance in Interaction. Research on Language & Social Interaction, 49(1), 1–19. https://doi.org/10.1080/08351813.2016.1126436
Maguire, D., Honeyman, M., Fenney, D., & Jabbal, J. (2021). Shaping the future of digital technology in health and social care. The King’s Fund. https://www.kingsfund.org.uk/publications/future-digital-technology-health-social-care
Porcheron, M., Fischer, J. E., Reeves, S., & Sharples, S. (2018). Voice Interfaces in Everyday Life. Proceedings of the 2018 ACM Conference on Human Factors in Computing Systems (CHI’18). https://doi.org/10.1145/3173574.3174214
Robinson, J. D. (2006). Managing Trouble Responsibility and Relationships During Conversational Repair. Communication Monographs, 73(2), 137–161. https://doi.org/10.1080/03637750600581206
Sacks, H. (1984). On doing ‘being ordinary’. In J. Heritage & J. M. Atkinson (Eds.), Structures of social action: Studies in conversation analysis (pp. 413–429). Cambridge University Press.
Scherer, M. J. (2020). It is time for the biopsychosocialtech model. Disability and Rehabilitation: Assistive Technology, 15(4), 363–364. https://doi.org/10.1080/17483107.2020.1752319
Tuisku, O., Pekkarinen, S., Hennala, L., & Melkas, H. (2018). “Robots do not replace a nurse with a beating heart”: The publicity around a robotic innovation in elderly care. Information Technology & People, 32(1), 47–67. https://doi.org/10.1108/ITP-06-2018-0277
White, G. W., Lloyd Simpson, J., Gonda, C., Ravesloot, C., & Coble, Z. (2010). Moving from Independence to Interdependence: A Conceptual Model for Better Understanding Community Participation of Centers for Independent Living Consumers. Journal of Disability Policy Studies, 20(4), 233–240. https://doi.org/10.1177/1044207309350561
Wilkinson, R. (2019). Atypical Interaction: Conversation Analysis and Communicative Impairments. Research on Language and Social Interaction, 52(3), 281–299. https://doi.org/10.1080/08351813.2019.1631045
Wright, J. (2019). Robots vs migrants? Reconfiguring the future of Japanese institutional eldercare. Critical Asian Studies, 51(3), 331–354. https://doi.org/10.1080/14672715.2019.1612765
Wright, J. (2023). Robots won’t save Japan: An ethnography of eldercare automation. ILR Press, an imprint of Cornell University Press.

The Atypical Interactant in a Smart Homecare Participation Framework Read More »

Conversational User Interfaces in Smart Homecare Interactions: A Conversation Analytic Case Study

October 20, 2021

Saul Albert, Magnus Hamann, Elizabeth Stokoe

Abstract:

Policymakers are increasingly interested in using virtual assistants to augment social care services in the context of a demographic ageing crisis. At the same time, technology companies are market- ing conversational user interfaces (CUIs) and smart home systems as assistive technologies for elderly and disabled people. However, we know relatively little about how today’s commercially available CUIs are used to assist in everyday homecare activities, or how care service users and human care assistants interpret and adapt these technologies in practice. Here we report on a longitudinal conversation analytic case study to identify, describe, and share how CUIs can be used as assistive conversational agents in practice. The analysis reveals that, while CUIs can augment and support new capabilities in a homecare environment, they cannot replace the delicate interactional work of human care assistants. We ar- gue that CUI design is= best inspired and underpinned by a better understanding of the joint coordination of homecare activities

References

Alač, M., Gluzman, Y., Aflatoun, T., Bari, A., Jing, B., & Mozqueda, G. (2020). How Everyday Interactions with Digital Voice Assistants Resist a Return to the Individual. Evental Aesthetics, 9(1), 51.

Albert, S., & Hamann, M. (2021). Putting wake words to bed: We speak wake words with systematically varied prosody, but CUIs don’t listen. CUI 2021 – 3rd Conference on Conversational User Interfaces, 1–5. https://doi.org/10.1145/3469595.3469608

Amazon Echo. (2019). Amazon Alexa: Sharing is Caring. https://www.youtube.com/watch?v=225Wlg3pkdo

Archibald, M. M., & Barnard, A. (2018). Futurism in nursing: Technology, robotics and the fundamentals of care. Journal of Clinical Nursing, 27(11–12), 2473–2480. https://doi.org/10.1111/jocn.14081

Bedaf, S., Gelderblom, G. J., de Witte, L., Syrdal, D., Lehmann, H., Amirabdollahian, F., Dautenhahn, K., & Hewson, D. (2013). Selecting services for a service robot: Evaluating the problematic activities threatening the independence of elderly persons. 2013 IEEE 13th International Conference on Rehabilitation Robotics (ICORR), 1–6. https://doi.org/10.1109/ICORR.2013.6650458

Casey, D., Felzmann, H., Pegman, G., Kouroupetroglou, C., Murphy, K., Koumpis, A., & Whelan, S. (2016). What People with Dementia Want: Designing MARIO an Acceptable Robot Companion. In K. Miesenberger, C. Bühler, & P. Penaz (Eds.), Computers Helping People with Special Needs (pp. 318–325). Springer International Publishing. https://doi.org/10.1007/978-3-319-41264-1_44

Chappell, N. L., Dlitt, B. H., Hollander, M. J., Miller, J. A., & McWilliam, C. (2004). Comparative Costs of Home Care and Residential Care. The Gerontologist, 44(3), 389–400. https://doi.org/10.1093/geront/44.3.389

Dowling, S., Williams, V., Webb, J., Gall, M., & Worrall, D. (2019). Managing relational autonomy in interactions: People with intellectual disabilities. Journal of Applied Research in Intellectual Disabilities, 32(5), 1058–1066. https://doi.org/10.1111/jar.12595

García-Soler, Á., Facal, D., Díaz-Orueta, U., Pigini, L., Blasi, L., & Qiu, R. (2018). Inclusion of service robots in the daily lives of frail older users: A step-by-step definition procedure on users’ requirements. Archives of Gerontology and Geriatrics, 74, 191–196. https://doi.org/10.1016/j.archger.2017.10.024

Goodwin, C. (2000). Action and embodiment within situated human interaction. Journal of Pragmatics, 32(10), 1489–1522. https://doi.org/10.1016/S0378-2166(99)00096-X

Harmo, P., Taipalus, T., Knuuttila, J., Vallet, J., & Halme, A. (2005). Needs and solutions—Home automation and service robots for the elderly and disabled. 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 3201–3206. https://doi.org/10.1109/IROS.2005.1545387

House of Lords. (2021). Ageing: Science, Technology and Healthy Living (p. 132). House of Lords Science and Technology Select Committee. https://publications.parliament.uk/pa/ld5801/ldselect/ldsctech/183/183.pdf

Jackson, L., Haagaard, A., & Williams, R. (2022). Disability Dongle | Platypus. https://blog.castac.org/2022/04/disability-dongle/

Kachouie, R., Sedighadeli, S., Khosla, R., & Chu, M.-T. (2014). Socially Assistive Robots in Elderly Care: A Mixed-Method Systematic Literature Review. International Journal of Human–Computer Interaction, 30(5), 369–393. https://doi.org/10.1080/10447318.2013.873278

Kendrick, K. H., & Drew, P. (2016). Recruitment: Offers, Requests, and the Organization of Assistance in Interaction. Research on Language and Social Interaction, 49(1), 1–19. https://doi.org/10.1080/08351813.2016.1126436

Kingston, A., Comas-Herrera, A., & Jagger, C. (2018). Forecasting the care needs of the older population in England over the next 20 years: Estimates from the Population Ageing and Care Simulation (PACSim) modelling study. The Lancet Public Health, 3(9), e447–e455. https://doi.org/10.1016/S2468-2667(18)30118-X

Krummheuer, A. L., Rehm, M., & Rodil, K. (2020). Triadic Human-Robot Interaction. Distributed Agency and Memory in Robot Assisted Interactions. Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 317–319. https://doi.org/10.1145/3371382.3378269

Levine, D. M., Ouchi, K., Blanchfield, B., Diamond, K., Licurse, A., Pu, C. T., & Schnipper, J. L. (2018). Hospital-Level Care at Home for Acutely Ill Adults: A Pilot Randomized Controlled Trial. Journal of General Internal Medicine, 33(5), 729–736. https://doi.org/10.1007/s11606-018-4307-z

Lipp, B. (2022). Caring for robots: How care comes to matter in human-machine interfacing. Social Studies of Science, 03063127221081446. https://doi.org/10.1177/03063127221081446

Maguire, D., Honeyman, M., Fenney, D., & Jabbal, J. (2021). Shaping the future of digital technology in health and social care. The King’s Fund. https://www.kingsfund.org.uk/publications/future-digital-technology-health-social-care

Sacks, H. (1984). On doing ‘being ordinary’. In J. Heritage & J. M. Atkinson (Eds.), Structures of social action: Studies in conversation analysis (pp. 413–429). Cambridge University Press.

Share, P., & Pender, J. (2018). Preparing for a Robot Future? Social Professions, Social Robotics and the Challenges Ahead. Irish Journal of Applied Social Studies, 18(1). https://doi.org/10.21427/D7472M

Stokoe, E., Sikveland, R. O., Albert, S., Hamann, M., & Housley, W. (2020). Can humans simulate talking like other humans? Comparing simulated clients to real customers in service inquiries. Discourse Studies, 22(1), 87–109. https://doi.org/10.1177/1461445619887537

Topol, E. (2019). The Topol Review: Preparing the healthcare workforce to deliver the digital future (p. 103). Health Education England. https://topol.hee.nhs.uk/wp-content/uploads/HEE-Topol-Review-2019.pdf

Tuisku, O., Pekkarinen, S., Hennala, L., & Melkas, H. (2018). “Robots do not replace a nurse with a beating heart”: The publicity around a robotic innovation in elderly care. Information Technology & People, 32(1), 47–67. https://doi.org/10.1108/ITP-06-2018-0277

White, G. W., Lloyd Simpson, J., Gonda, C., Ravesloot, C., & Coble, Z. (2010). Moving from Independence to Interdependence: A Conceptual Model for Better Understanding Community Participation of Centers for Independent Living Consumers. Journal of Disability Policy Studies, 20(4), 233–240. https://doi.org/10.1177/1044207309350561Post navigation

Wright, J. (2021). The Alexafication of Adult Social Care: Virtual Assistants and the Changing Role of Local Government in England. International Journal of Environmental Research and Public Health, 18(2), Article 2. https://doi.org/10.3390/ijerph18020812

Wright, J. (2023). Robots won’t save Japan: An ethnography of eldercare automation. ILR Press, an imprint of Cornell University Press.

Conversational User Interfaces in Smart Homecare Interactions: A Conversation Analytic Case Study Read More »

An artificial turn in social interaction research?

August 21, 2021

Jakub Mlynář, Andreas Liesenfeld, Lynn de Renata Topinková, Wyke Stommel, Lynn de Rijk, and Saul Albert for the 6th Copenhagen Multimodality Day: Interacting with AI

The turn towards multimodality and embodiment in interaction research has yielded new terminology and representational schema in key publications (Nevile 2015). At the intersections between multidisciplinary fields, e.g., ethnomethodological and conversation analytic (EMCA) research exploring interactions between humans and ‘AI’, social robots, and conversational user interfaces, such methodological changes are even harder to track. How do these approaches to the meticulous, naturalistic study of technologies in (and of) social interaction reframe the key terms, schema and practices that constitute AI as a field of technosocial activity? Largely grounded in the EMCA Wiki bibliography, we map this emerging field and report on a bibliometric review of 90 publications directly relevant to EMCA studies of AI (broadly defined) including social robots and their components such as voice interfaces.

We found that the most works cited in the EMCA+AI corpus are classics from the canon of human interaction research (Garfinkel, Sacks, Schegloff, Goffman), including multimodality (Goodwin, Heath), human-machine interaction (Suchman), and STS (Latour). The most frequently cited texts are: Sacks, Schegloff and Jefferson’s (1974) ‘turn-taking paper’ (in 45% of items from the corpus), Garfinkel’s (1967) Studies (40%), and Suchman’s (1987) book (31%). Dealing specifically with AI from an EMCA perspective, Porcheron et al.’s 2018 paper on voice user interfaces is the most cited (11%). Apart from this one, two other texts feature as citation hubs: Alač’s (2016) and Pitsch et al.’s (2013) papers on social robots and embodiment. The study aims to provide a starting point for discussion about how concepts such as embodiment, agency and interaction are shared, used and understood through the practice of academic citation.

References

Nevile, M. (2015). The Embodied Turn in Research on Language and Social Interaction. Research on Language and Social Interaction, 48(2), 121–151.

An artificial turn in social interaction research? Read More »

The interactional coordination of virtual and personal assistants in a homecare setting

August 21, 2021

Saul Albert, Magnus Hamann & Elizabeth Stokoe (for the 6th Copenhagen Multimodality Day), October 2021.

Abstract

Policymakers and care service providers are increasingly looking to technological developments in AI and robotics to augment or replace health and social care services in the context of a demographic ageing crisis (House of Lords, 2021; Kingston et al., 2018; Topol, 2019, pp. 54–55). However, there is still little evidence as to how these technologies might be applied to everyday social care situations (Maguire et al., 2021). This paper uses conversation analysis of ~100 hours of video recorded interactions between a disabled person, their virtual assistant (Alexa), and their (human) personal assistant to explore how routine care tasks are organized in a domestic setting. We focus on how the human participants organize conversational turn-space around ‘turns-at-use’ with the virtual assistant. Specifically, how turns-at-use ostensibly designed for the virtual assistant can recruit overhearing others. Further, we show how participants include the virtual assistant in their shared taskscape by, for example, putting ongoing activities and conversations on hold, visibly reorienting their bodies, or explicitly making themselves available for – or requesting – assistance when coordination trouble emerges between the machine-human dyad. Our findings show that virtual assistants expand the affordances of a homecare environment but do not replace the work of personal assistants.

References

Amazon Echo. (2019). Amazon Alexa: Sharing is Caring. https://www.youtube.com/watch?v=225Wlg3pkdo

Goodwin, C. (2000). Action and embodiment within situated human interaction. Journal of Pragmatics, 32(10), 1489–1522. https://doi.org/10.1016/S0378-2166(99)00096-X

Stokoe, E., Sikveland, R. O., Albert, S., Hamann, M., & Housley, W. (2020). Can humans simulate talking like other humans? Comparing simulated clients to real customers in service inquiries. Discourse Studies, 22(1), 87–109. https://doi.org/10.1177/1461445619887537

The interactional coordination of virtual and personal assistants in a homecare setting Read More »

Putting wake words to bed

August 2, 2021

Magnus Hamann and I wrote a provocation paper for the third conference on Conversational User Interfaces 2021.

In it, we argue (hopefully provocatively), that voice user interface designers should stop using wake words like “Alexa” and “Hey Siri” that are crowding each other out of the audible environment of the smart home. Our point is that, as interface elements, wake words are misleading for users who seem to treat them like fully-fledged interactional summons, when they’re really little more than glorified ‘on’ buttons.

We got a surprisingly positive response from the technically-inclined audience at the conference. I found it surprising mostly because wake words are so ubiquitous and central to the branding and functionality of today’s voice interfaces that it seems hard to imagine them being phased out in favour of something more prosaic.

You can read the full paper on the ACM site, or a preprint here.

References

Charles Goodwin. 2007. Interactive footing. In Reporting Talk, Elizabeth Holt and Rebecca Clift (eds.). Cambridge University Press, Cambridge, 16–46. DOI:https://doi.org/10.1017/CBO9780511486654.003
Alexa Hepburn and Galina B Bolden. 2017. Transcribing for social research. Sage, London.
William Housley, Saul Albert, and Elizabeth Stokoe. 2019. Natural Action Processing. In Proceedings of the Halfway to the Future Symposium 2019 (HTTF 2019), Association for Computing Machinery, Nottingham, United Kingdom, 1–4. DOI:https://doi.org/10.1145/3363384.3363478
Razan Jaber, Donald McMillan, Jordi Solsona Belenguer, and Barry Brown. 2019. Patterns of gaze in speech agent interaction. In Proceedings of the 1st International Conference on Conversational User Interfaces – CUI ’19, ACM Press, Dublin, Ireland, 1–10. DOI:https://doi.org/10.1145/3342775.3342791
Seung-Hee Lee. 2006. Second summonings in Korean telephone conversation openings. Language in Society. 35, 02. DOI:https://doi.org/10.1017/S0047404506060118
Gene H Lerner. 2003. Selecting next speaker: The context-sensitive operation of a context-free organization. Language in Society. 32, 02, 177–201. DOI:https://doi.org/10.1017/S004740450332202X
Ewa Luger and Abigail Sellen. 2016. “Like Having a Really Bad PA”: The Gulf between User Expectation and Experience of Conversational Agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI ’16), Association for Computing Machinery, New York, NY, USA, 5286–5297. DOI:https://doi.org/10.1145/2858036.2858288
Robert J. Moore and Raphael Arar. 2019. Conversational UX design: A practitioner’s guide to the natural conversation framework. Association for Computing Machinery, New York, NY, USA.
Clifford Nass and Youngme Moon. 2000. Machines and Mindlessness: Social Responses to Computers. Journal of Social Issues 56, 1 (2000), 81–103. DOI:https://doi.org/10.1111/0022-4537.00153
Hannah R. M. Pelikan and Mathias Broth. 2016. Why That Nao? In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems – CHI \textquotesingle16, ACM Press. DOI:https://doi.org/10.1145/2858036.2858478
Danielle Pillet-Shore. 2018. How to Begin. Research on Language and Social Interaction 51, 3 (July 2018), 213–231. DOI:https://doi.org/10.1080/08351813.2018.1485224
Martin Porcheron, Joel E Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice Interfaces in Everyday Life. In Proceedings of the 2018 ACM Conference on Human Factors in Computing Systems – CHI’18, ACM Press. DOI:https://doi.org/10.1145/3173574.3174214
Stuart Reeves, Martin Porcheron, and Joel Fischer. 2018. “This is not what we wanted”: designing for conversation with voice interfaces. Interactions 26, 1, 46–51. DOI:https://doi.org/10.1145/3296699
Harvey Sacks. 1995. Lectures on conversation. Wiley-Blackwell, London.
Emanuel A Schegloff. 1968. Sequencing in Conversational Openings. American Anthropologist 70, 6, 1075–1095. DOI:https://doi.org/10.1525/aa.1968.70.6.02a00030
Emanuel A Schegloff. 1988. Presequences and indirection: Applying speech act theory to ordinary conversation. Journal of Pragmatics 12, 1 (1988), 55–62.
Emanuel A Schegloff. 2007. Sequence organization in interaction: Volume 1: A primer in conversation analysis. Cambridge University Press, Cambridge.

Putting wake words to bed Read More »

Digital transcription for EM/CA research

July 25, 2021

I have put my introduction to digital transcription workshop materials and tutorials online, here’s a little blog outlining some of the reasons I started developing the workshop, and how I hope researchers will use it.

There are very few – if any – software tools designed specifically for conversation analytic transcription, partly because so few conversation analysts use them, so there’s not really a ‘market’ for software developers to cater to.

Instead, we have to make do with tools that were designed for more generic research workflows, and which often build in analytic assumptions, constraints and visual metaphors that don’t necessarily correspond with EM/CA’s methodological priorities.

Nonetheless, most researchers that use digital transcription systems choose between two main paradigms.

the ‘list-of-turns’-type system represents interaction much like a Jeffersonian transcript: a rendering of turn-by-turn talk, line by line, laid out semi-diagrammatically so that lines of overlapping talk are vertically aligned on the page.
the ‘tiers-of-timelines’ system uses a horizontal scrolling timeline like a video editing interface, with multiple layers or ‘tiers’ representing e.g., each participant’s talk, embodied actions, and other types of action annotated over time.

CLAN’s interface (left) and ELAN’s (right) with transcripts of the same bit of audiovisual data

A key utility of both kinds of digital transcription systems is that they allow researchers to align media and transcript, and to use very precise timing tools to check the order and timing of their analytic observations.

I used these terms to describe this distinction between representational schema in a short ‘expert box’ for Alexa Hepburn and Galina Bolden’s excellent (2017) book Transcribing for Social Research entitled “how to choose transcription software for conversation analysis“, where I tried to explain what is at stake in choosing one or the other type of system .

For the most part, researchers choose lists-of-turns tools when their analysis is focused on conversation and audible turn-space, and tiers-of-timelines when their analysis focuses on video analysis of visible bodily action.

The problem for EM/CA researchers working with both these approaches, however, is that neither representational schema on its own, (nor any schema save whatever schema may have been constituted through the original interaction itself), is ideal for exploring and describing participants’ sense-making processes and resources.

Tiers-of-timelines representations are great for showing the temporal unfolding of simultaneous action, but it is hard to read more than a few seconds of activity at a glance. By contrast, lists-of-turns use the same basic schema as our well-practiced, mundane reading abilities to scan a page of text and take in the overall structure of a conversation, but reduce the fine-grained timing and multi-activity organization of complex embodied activities.

In any case, neither of these representational schema, nor any currently available transcription tools adequately capture the dynamics of movement in the way that, for example, specialized graphical methods a nd life drawing techniques were developed to achieve (although our Drawing Interactions prototype points to some possibilities).

The reason I put this digital transcription workshop together was to combine existing, well-used software tools for digital transcription from both major paradigms, and to show how to work on a piece of data using both approaches. It’s not intended as a comprehensive ‘solution’, and there are many unresolved practical and conceptual issues, but I think it gives researchers the best chance to address their empirical concerns to help break away from the conceptual and disciplinary constraints that come from analyzing data using one, uniform type of user interface.

The workshop materials include slides (so people can use them to teach collaborators/students) as well as a series of short tutorial videos accompanying each practical exercise in the slides, along with some commentary from me.

My hope is that researchers will use and improve these materials, and possibly extend them to include additional tools (e.g., EXMARaLDA project tools, with which I’m less familiar). If you do, and you find ways to improve them with additional tips, hacks, or updated instructions that take into account new versions, please do let me know.

Digital transcription for EM/CA research Read More »

Moving into step: The embodiment of social structures of action

December 22, 2020

The abstract for a forthcoming article by myself and Dirk vom Lehn, soon to be liberated from the stalled pandemic year R&R cycle. Draft available if you’re willing to give feedback!

Abstract

While dance has often featured in sociological theory, there are relatively few empirical studies that explore the social practices through which people learn to dance together. This paper takes as its point of departure the way that partner dance is often featured as a metaphor to illustrate theories about social order and interaction. We examine a corpus of video data gathered as part of a day-long workshop and explore how novice dancers learn to perform some of the basic steps of a social dance in time with their partner and with the rhythmical environment. The analysis shows how dancers use rhythm, bodies, language and other resources to organize their social interactions and shows how ethnomethodology and conversation analysis provide a critical standpoint for examining sociological theories about the relationship between the body and the social.

Keywords: ethnomethodology, conversation analysis, multimodality, dance, culture,

Moving into step: The embodiment of social structures of action Read More »

Three meeting points between CA and AI

June 29, 2020

I gave this keynote at the first European Conference on Conversation Analysis (ECCA 2020), which, due to C-19, had to be delivered as a video instead of a stand-up talk.

I tried to make a mix between a film essay and a research presentation of work-in-progress, so it didn’t always work to put references on every slide. I’ve added them below with links to the data used where available.

Abstract

Sacks’ (1963) first published paper on ‘sociological description’ uses the metaphor of a mysterious ‘talking-and-doing’ machine, where researchers from different disciplines come up with incompatible, contradictory descriptions of its functionality. We may soon find ourselves in a similar situation to the one Sacks describes as AI continues to permeate the social sciences, and CA begins to encounter AI either as a research object, as a research tool, or more likely as a pervasive feature of both.

There is now a thriving industry in ‘Conversational AI’ and AI-based tools that claim to emulate or analyse talk, but both the study and use of AI within CA is still unusual. While a growing literature is using CA to study social robotics, voice interfaces, and conversational user experience design (Pelikan & Broth, 2016; Porcheron et al., 2018), few conversation analysts even use digital tools, let alone the statistical and computational methods that underpin conversational AI. Similarly, researchers and developers of conversational AI rarely cite CA research and have only recently become interested in CA as a possible solution to hard problems in natural language processing (NLP). This situation presents an opportunity for mutual engagement between conversational AI and CA (Housley et al., 2019). To prompt a debate on this issue, I will present three projects that combine AI and CA very differently and discusses the implications and possibilities for combined research programmes.

The first project uses a series of single case analyses to explore recordings in which an advanced conversational AI successfully makes appointments over the phone with a human call-taker. The second revisits debates on using automated speech recognition for CA transcription (Moore, 2015) in light of significant recent advances in AI-based speech-to-text, and includes a live demo of ‘Gailbot’, a Jeffersonian automated transcription system. The third project both uses and studies AI in an applied CA context. Using video analysis, it asks how a disabled man and his care worker interact while using AI-based voice interfaces and a co-designed ‘home automation’ system as part of a domestic routine of waking, eating, and personal care. Data are drawn from a corpus of ~500 hours of video data recorded by the participants using a voice-controlled, AI-based ‘smart security camera’ system.

These three examples of CA’s potential interpretations and uses of AI’s ‘talking-and-doing’ machines provide material for a debate about how CA research programmes might conceptualize AI, and use or combine it with CA in a mutually informative way.

Videos (in order of appearance)

The Senster. (2007, March 29). https://www.youtube.com/watch?v=wY85GrYGnyw

MIT AI Lab. (2011, September 25). https://www.youtube.com/watch?v=hp9NHNKTV-M

Keynote (Google I/O ’18). (2018, May 9). https://www.youtube.com/watch?v=ogfYd705cRs

Online Data

Linguistic Data Consortium. (2013). CABank CallHome English Corpus [Data set]. Talkbank. https://ca.talkbank.org/access/CallHome/eng.html

Jefferson, G. (2007). CABank English Jefferson NB Corpus [Data set]. TalkBank. https://doi.org/10.21415/T58P4Z

Bibliography

Agre, P. (1997). Toward a critical technical practice: Lessons learned in trying to reform AI. Social Science, Technical Systems and Cooperative Work: Beyond the Great Divide. Erlbaum.

Berger, I., Viney, R., & Rae, J. P. (2016). Do continuing states of incipient talk exist? Journal of Pragmatics, 91, 29–44. https://doi.org/10.1016/j.pragma.2015.10.009

Bolden, G. B. (2015). Transcribing as Research: “Manual” Transcription and Conversation Analysis. Research on Language and Social Interaction, 48(3), 276–280. https://doi.org/10.1080/08351813.2015.1058603

Brooker, P., Dutton, W., & Mair, M. (2019). The new ghosts in the machine: “Pragmatist” AI and the conceptual perils of anthropomorphic description. Ethnographic Studies, 16, 272–298. https://doi.org/10.5281/zenodo.3459327

Button, Graham. (1990). Going Up a Blind Alley: Conflating Conversation Analysis and Computational Modelling. In P. Luff, N. Gilbert, & D. Frolich (Eds.), Computers and Conversation (pp. 67–90). Academic Press. https://doi.org/10.1016/B978-0-08-050264-9.50009-9

Button, Graham, & Dourish, P. (1996). Technomethodology: Paradoxes and possibilities. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. http://dl.acm.org/citation.cfm?id=238394

Button, G., & Sharrock, W. (1996). Project work: The organisation of collaborative design and development in software engineering. Computer Supported Cooperative Work (CSCW), 5(4), 369–386. https://doi.org/10.1007/BF00136711

Casino, T., & Freenor, Michael. (2018). An introduction to Google Duplex and natural conversations, Willowtree. https://willowtreeapps.com/ideas/an-introduction-to-google-duplex-and-natural-conversations

Duca, D. (2019). Who’s disrupting transcription in academia? — SAGE Ocean | Big Data, New Tech, Social Science. SAGE Ocean. https://ocean.sagepub.com/blog/whos-disrupting-transcription-in-academia

Fischer, J. E., Reeves, S., Porcheron, M., & Sikveland, R. O. (2019). Progressivity for voice interface design. Proceedings of the 1st International Conference on Conversational User Interfaces – CUI ’19, 1–8. https://doi.org/10.1145/3342775.3342788

Garfinkel, H. (1967). Studies in ethnomethodology. Prentice-Hall.

Goodwin, C. (1996). Transparent vision. In E. A. Schegloff & S. A. Thompson (Eds.), Interaction and Grammar (pp. 370–404). Cambridge University Press.

Heath, C., & Luff, P. (1992). Collaboration and control: Crisis management and multimedia technology in London Underground Line Control Rooms. Computer Supported Cooperative Work (CSCW), 1(1–2), 69–94.

Heritage, J. (1984). Garfinkel and ethnomethodology. Polity Press.

Heritage, J. (1988). Explanations as accounts: A conversation analytic perspective. In C. Antaki (Ed.), Analysing Everyday Explanation: A Casebook of Methods (pp. 127–144). Sage Publications.

Hoey, E. M. (2017). Lapse organization in interaction [PhD Thesis, Max Planck Institute for Psycholinguistics, Radbound University, Nijmegen]. http://bit.ly/hoey2017

Housley, W., Albert, S., & Stokoe, E. (2019). Natural Action Processing. In J. E. Fischer, S. Martindale, M. Porcheron, S. Reeves, & J. Spence (Eds.), Proceedings of the Halfway to the Future Symposium 2019 (pp. 1–4). Association for Computing Machinery. https://doi.org/10.1145/3363384.3363478

Kendrick, K. H. (2017). Using Conversation Analysis in the Lab. Research on Language and Social Interaction, 50(1), 1–11. https://doi.org/10.1080/08351813.2017.1267911

Lee, S.-H. (2006). Second summonings in Korean telephone conversation openings. Language in Society, 35(02). https://doi.org/10.1017/S0047404506060118

Leviathan, Y., & Matias, Y. (2018). Google Duplex: An AI System for Accomplishing Real-World Tasks Over the Phone [Blog]. Google AI Blog. http://ai.googleblog.com/2018/05/duplex-ai-system-for-natural-conversation.html

Local, J., & Walker, G. (2005). Methodological Imperatives for Investigating the Phonetic Organization and Phonological Structures of Spontaneous Speech. Phonetica, 62(2–4), 120–130. https://doi.org/10.1159/000090093

Luff, P., Gilbert, N., & Frolich, D. (Eds.). (1990). Computers and Conversation. Academic Press.

Moore, R. J. (2015). Automated Transcription and Conversation Analysis. Research on Language and Social Interaction, 48(3), 253–270. https://doi.org/10.1080/08351813.2015.1058600

Ogden, R. (2015). Data Always Invite Us to Listen Again: Arguments for Mixing Our Methods. Research on Language and Social Interaction, 48(3), 271–275. https://doi.org/10.1080/08351813.2015.1058601

O’Leary, D. E. (2019). Google’s Duplex: Pretending to be human. Intelligent Systems in Accounting, Finance and Management, 26(1), 46–53. https://doi.org/10.1002/isaf.1443

Pelikan, H. R. M., & Broth, M. (2016). Why That Nao? Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems – CHI \textquotesingle16. https://doi.org/10.1145/2858036.2858478

Pelikan, H. R. M., Broth, M., & Keevallik, L. (2020). “Are You Sad, Cozmo?”: How Humans Make Sense of a Home Robot’s Emotion Displays. Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 461–470. https://doi.org/10.1145/3319502.3374814

Porcheron, M., Fischer, J. E., Reeves, S., & Sharples, S. (2018). Voice Interfaces in Everyday Life. Proceedings of the 2018 ACM Conference on Human Factors in Computing Systems (CHI’18).

Reeves, S. (2017). Some conversational challenges of talking with machines. Talking with Conversational Agents in Collaborative Action, Workshop at the 20th ACM Conference on Computer-Supported Cooperative Work and Social Computing. http://eprints.nottingham.ac.uk/40510/

Relieu, M., Sahin, M., & Francillon, A. (2019). Lenny the bot as a resource for sequential analysis: Exploring the treatment of Next Turn Repair Initiation in the beginnings of unsolicited calls. https://doi.org/10.18420/muc2019-ws-645

Robles, J. S., DiDomenico, S., & Raclaw, J. (2018). Doing being an ordinary technology and social media user. Language & Communication, 60, 150–167. https://doi.org/10.1016/j.langcom.2018.03.002

Sacks, H. (1984). On doing “being ordinary.” In J. Heritage & J. M. Atkinson (Eds.), Structures of social action: Studies in conversation analysis (pp. 413–429). Cambridge University Press.

Sacks, H. (1987). On the preferences for agreement and contiguity in sequences in conversation. In G Button & J. R. Lee (Eds.), Talk and social organization (pp. 54–69). Multilingual Matters.

Sacks, H. (1995a). Lectures on conversation: Vol. II (G. Jefferson, Ed.). Wiley-Blackwell.

Sacks, H., Schegloff, E. A., & Jefferson, G. (1974). A simplest systematics for the organization of turn-taking for conversation. Language, 50(4), 696–735. https://doi.org/10.2307/412243

Sahin, M., Relieu, M., & Francillon, A. (2017). Using chatbots against voice spam: Analyzing Lenny’s effectiveness. Proceedings of the Thirteenth Symposium on Usable Privacy and Security, 319–337.

Schegloff, E. A. (1988). On an Actual Virtual Servo-Mechanism for Guessing Bad News: A Single Case Conjecture. Social Problems, 35(4), 442–457. https://doi.org/10.2307/800596

Schegloff, E. A. (1993). Reflections on Quantification in the Study of Conversation. Research on Language & Social Interaction, 26(1), 99–128. https://doi.org/10.1207/s15327973rlsi2601_5

Schegloff, E. A. (2004). Answering the Phone. In G. H. Lerner (Ed.), Conversation Analysis: Studies from the First Generation (pp. 63–109). John Benjamins Publishing Company.

Schegloff, E. A. (2010). Some Other “Uh(m)s.” Discourse Processes, 47(2), 130–174. https://doi.org/10.1080/01638530903223380

Soltau, H., Saon, G., & Kingsbury, B. (2010). The IBM Attila speech recognition toolkit. 2010 IEEE Spoken Language Technology Workshop, 97–102. https://doi.org/10.1109/SLT.2010.5700829

Stivers, T. (2015). Coding Social Interaction: A Heretical Approach in Conversation Analysis? Research on Language and Social Interaction, 48(1), 1–19. https://doi.org/10.1080/08351813.2015.993837

Stokoe, E. (2011). Simulated Interaction and Communication Skills Training: The `Conversation-Analytic Role-Play Method’. In Applied Conversation Analysis (pp. 119–139). Palgrave Macmillan UK. https://doi.org/10.1057/9780230316874_7

Stokoe, E. (2013). The (In)Authenticity of Simulated Talk: Comparing Role-Played and Actual Interaction and the Implications for Communication Training. Research on Language & Social Interaction, 46(2), 165–185. https://doi.org/10.1080/08351813.2013.780341

Stokoe, E. (2014). The Conversation Analytic Role-play Method (CARM): A Method for Training Communication Skills as an Alternative to Simulated Role-play. Research on Language and Social Interaction, 47(3), 255–265. https://doi.org/10.1080/08351813.2014.925663

Stokoe, E., Sikveland, R. O., Albert, S., Hamann, M., & Housley, W. (2020). Can humans simulate talking like other humans? Comparing simulated clients to real customers in service inquiries. Discourse Studies, 22(1), 87–109. https://doi.org/10.1177/1461445619887537

Turing, A. (1950). Computing machinery and intelligence. Mind, 49, 433–460.

Walker, G. (2017). Pitch and the Projection of More Talk. Research on Language and Social Interaction, 50(2), 206–225. https://doi.org/10.1080/08351813.2017.1301310

Wong, J. C. (2019, May 29). “A white-collar sweatshop”: Google Assistant contractors allege wage theft. The Guardian. https://www.theguardian.com/technology/2019/may/28/a-white-collar-sweatshop-google-assistant-contractors-allege-wage-theft

Three meeting points between CA and AI Read More »