Hoppa till huvudinnehållet
Till KTH:s startsida

Publikationer av Jens Edlund

Refereegranskade

Artiklar

[1]
Ekström, A. G., Gannon, C., Edlund, J., Moran, S. & Lameira, A. R. (2024). Chimpanzee utterances refute purported missing links for novel vocalizations and syllabic speech. Scientific Reports, 14(1).
[2]
Ekström, A. G. & Edlund, J. (2023). Evolution of the human tongue and emergence of speech biomechanics. Frontiers in Psychology, 14.
[3]
Strömbergsson, S., Götze, J., Edlund, J. & Nilsson Björkenstam, K. (2021). Simulating Speech Error Patterns Across Languages and Different Datasets. Language and Speech, 1-38.
[4]
Strombergsson, S., Edlund, J., McAllister, A. & Lagerberg, T. (2021). Understanding acceptability of disordered speech through Audience Response Systems-based evaluation. Speech Communication, 131, 13-22.
[5]
Strombergsson, S., Holm, K., Edlund, J., Lagerberg, T. & McAllister, A. (2020). Audience Response System-Based Evaluation of Intelligibility of Children's Connected Speech - Validity, Reliability and Listener Differences. Journal of Communication Disorders, 87.
[6]
Clark, L., Doyle, P., Garaialde, D., Gilmartin, E., Schloegl, S., Edlund, J. ... Cowan, B. R. (2019). The State of Speech in HCI : Trends, Themes and Challenges. Interacting with computers, 31(4), 349-371.
[7]
Oertel, C., Cummins, F., Edlund, J., Wagner, P. & Campbell, N. (2013). D64 : A corpus of richly recorded conversational interaction. Journal on Multimodal User Interfaces, 7(1-2), 19-28.
[8]
Al Moubayed, S., Edlund, J. & Beskow, J. (2012). Taming Mona Lisa : communicating gaze faithfully in 2D and 3D facial projections. ACM Transactions on Interactive Intelligent Systems, 1(2), 25.
[9]
Heldner, M. & Edlund, J. (2010). Pauses, gaps and overlaps in conversations. Journal of Phonetics, 38(4), 555-568.
[10]
Edlund, J. & Beskow, J. (2009). MushyPeek : A Framework for Online Investigation of Audiovisual Dialogue Phenomena. Language and Speech, 52, 351-367.
[11]
Hincks, R. & Edlund, J. (2009). PROMOTING INCREASED PITCH VARIATION IN ORAL PRESENTATIONS WITH TRANSIENT VISUAL FEEDBACK. Language Learning & Technology, 13(3), 32-50.
[12]
Edlund, J., Gustafson, J., Heldner, M. & Hjalmarsson, A. (2008). Towards human-like spoken dialogue systems. Speech Communication, 50(8-9), 630-645.
[13]
Heldner, M. & Edlund, J. (2007). What turns speech into conversation? : A project description. TMH-QPSR, 50(1), 45-48.

Konferensbidrag

[14]
Tånnander, C., O'Regan, J., House, D., Edlund, J., Beskow, J. (2024). Prosodic characteristics of English-accented Swedish neural TTS. I Proceedings of Speech Prosody 2024. (s. 1035-1039). Leiden, The Netherlands: International Speech Communication Association.
[15]
Tånnander, C., Edlund, J., Gustafsson, J. (2024). Revisiting Three Text-to-Speech Synthesis Experiments with a Web-Based Audience Response System. I 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings. (s. 14111-14121). European Language Resources Association (ELRA).
[16]
Esfandiari-Baiat, G., Edlund, J. (2024). The MEET Corpus: Collocated, Distant and Hybrid Three-party Meetings with a Ranking Task. I ISA 2024: 20th Joint ACL - ISO Workshop on Interoperable Semantic Annotation at LREC-COLING 2024, Workshop Proceedings. (s. 1-7). European Language Resources Association (ELRA).
[17]
Tånnander, C., House, D., Edlund, J. (2023). Analysis-by-synthesis : phonetic-phonological variation indeep neural network-based text-to-speech synthesis. I Proceedings of the 20th International Congress of Phonetic Sciences, Prague 2023. (s. 3156-3160). Prague, Czech Republic: GUARANT International.
[18]
Fallgren, P., Edlund, J. (2023). Crowdsource-based validation of the audio cocktail as a sound browsing tool. I Interspeech 2023. (s. 2178-2182). International Speech Communication Association.
[19]
Pandey, A., Edlund, J., Le Maguer, S., Harte, N. (2023). Listener sensitivity to deviating obstruents in WaveNet. I Interspeech 2023. (s. 1080-1084). International Speech Communication Association.
[20]
Edlund, J., Brodén, D., Fridlund, M., Lindhé, C., Olsson, L. -., Ängsal, M., Öhberg, P. (2022). A Multimodal Digital Humanities Study of Terrorism in Swedish Politics : An Interdisciplinary Mixed Methods Project on the Configuration of Terrorism in Parliamentary Debates, Legislation, and Policy Networks 1968–2018. I Lecture Notes in Networks and Systems. (s. 435-449). Springer Nature.
[21]
Tånnander, C., House, D., Edlund, J. (2022). Syllable duration as a proxy to latent prosodic features. I Proceedings of Speech Prosody 2022. (s. 220-224). Lisbon, Portugal: International Speech Communication Association.
[22]
Fallgren, P., Edlund, J. (2021). Human-in-the-Loop Efficiency Analysis for Binary Classification in Edyson. I Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. (s. 3685-3689). International Speech Communication Association.
[23]
Tånnander, C., Edlund, J. (2021). Methods of slowing down speech. I Proceedings. 11th ISCA Speech Synthesis Workshop (SSW 11). (s. 43-47).
[24]
Székely, É., Edlund, J., Gustafsson, J. (2020). Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis. I Proceedings of The 12th Language Resources and Evaluation Conference. (s. 6368-6374). European Language Resources Association.
[25]
Domeij, R., Edlund, J., Eriksson, G., Fallgren, P., House, D., Lindström, E., Skog, S. N., Öqvist, J. (2020). Exploring the archives for textual entry points to speech - Experiences of interdisciplinary collaboration in making cultural heritage accessible for research. I CEUR Workshop Proceedings. (s. 45-55). CEUR-WS.
[26]
Fallgren, P., Malisz, Z., Edlund, J. (2019). Bringing order to chaos : A non-sequential approach for browsing large sets of found audio data. I Proceedings Of The Eleventh International Conference On Language Resources And Evaluation (LREC 2018). (s. 4307-4311). European Language Resources Association (ELRA).
[27]
Tånnander, C., Edlund, J. (2019). First steps towards text profiling for speech synthesis. I CEUR Workshop Proceedings. (s. 457-468). CEUR-WS.
[28]
Fallgren, P., Malisz, Z., Edlund, J. (2019). How to annotate 100 hours in 45 minutes. I Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. (s. 341-345). ISCA.
[29]
Clark, L., Cowan, B. R., Edwards, J., Munteanu, C., Murad, C., Aylett, M., Moore, R. K., Edlund, J., Székely, É., Healey, P., Harte, N., Torre, I., Doyle, P. (2019). Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions. I CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS. ASSOC COMPUTING MACHINERY.
[30]
Bystedt, M., Edlund, J. (2019). New applications of gaze tracking in speech science. I CEUR Workshop Proceedings. (s. 73-78). CEUR-WS.
[31]
Tånnander, C., Edlund, J. (2019). Preliminary guidelines for the efficient management of OOV words for spoken text. I Speech Synthesis Workshop (SSW). (s. 137-142).
[32]
Edlund, J. (2019). Shoehorning in the name of science. I Procs. of CUI19. ACM Digital Library.
[33]
Wagner, P., Beskow, J., Betz, S., Edlund, J., Gustafson, J., Henter, G. E., Le Maguer, S., Malisz, Z., Székely, É., Tånnander, C. (2019). Speech Synthesis Evaluation : State-of-the-Art Assessment and Suggestion for a Novel Research Program. I Proceedings of the 10th Speech Synthesis Workshop (SSW10)..
[34]
Tånnander, C., Fallgren, P., Edlund, J., Gustafson, J. (2019). Spot the pleasant people! Navigating the cocktail party buzz. I Proceedings Interspeech 2019, 20th Annual Conference of the International Speech Communication Association. (s. 4220-4224).
[35]
Fallgren, P., Malisz, Z., Edlund, J. (2019). Towards fast browsing of found audio data : 11 presidents. I CEUR Workshop Proceedings. (s. 133-142). CEUR-WS.
[36]
Fallgren, P., Malisz, Z., Edlund, J. (2018). A tool for exploring large amounts of found audio data. I CEUR Workshop Proceedings. (s. 499-503). CEUR-WS.
[37]
Borin, L., Forsberg, M., Edlund, J., Domeij, R. (2018). Språkbanken 2018 : Research resources for text, speech, & society. I CEUR Workshop Proceedings. (s. 504-506). CEUR-WS.
[38]
Strömbergsson, S., Edlund, J., Götze, J., Björkenstam, K. N. (2017). Approximating phonotactic input in children's linguistic environments from orthographic transcripts. I Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (s. 2213-2217). International Speech Communication Association.
[39]
Edlund, J., Gustafson, J. (2016). Hidden resources - Strategies to acquire and exploit potential spoken language resources in national archives. I Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. (s. 4531-4534). European Language Resources Association (ELRA).
[40]
Edlund, J., Tånnander, C., Gustafson, J. (2015). Audience response system-based assessment for analysis-by-synthesis. I Proc. of ICPhS 2015. ICPhS.
[41]
Włodarczak, M., Heldner, M., Edlund, J. (2015). Communicative needs and respiratory constraints. I Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. (s. 3051-3055). International Speech Communication Association.
[42]
Edlund, J., Heldner, M., Wlodarczak, M. (2014). Catching wind of multiparty conversation. I LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION..
[43]
Edlund, J., Edelstam, F., Gustafson, J. (2014). Human pause and resume behaviours for unobtrusive humanlike in-car spoken dialogue systems. I Proceedings of the of the EACL 2014 Workshop on Dialogue in Motion (DM). (s. 73-77). Gothenburg, Sweden.
[44]
Dalmas, T., Götze, J., Gustafsson, J., Janarthanam, S., Kleindienst, J., Mueller, C., Stent, A., Vlachos, A., Artzi, Y., Benotti, L., Boye, J., Clark, S., Curin, J., Dethlefs, N., Edlund, J., Goldwasser, D., Heeman, P., Jurcicek, F., Kelleher, J., Komatani, K., Kwiatkowski, T., Larsson, S., Lemon, O., Lenke, N., Macek, J., Macek, T., Mooney, R., Ramachandran, D., Rieser, V., Shi, H., Tenbrink, T., Williams, J. (2014). Introduction. I Proceedings 2014 Workshop on Dialogue in Motion, DM 2014. Association for Computational Linguistics (ACL).
[45]
Strömbergsson, S., Tånnander, C., Edlund, J. (2014). Ranking severity of speech errors by their phonological impact in context. I Proceedings of the Annual ConfereProceedings of the Annual Conference of the International Speech Communication Association. (s. 1568-1572).
[46]
Al Moubayed, S., Edlund, J., Gustafson, J. (2013). Analysis of gaze and speech patterns in three-party quiz game interaction. I Interspeech 2013. (s. 1126-1130). The International Speech Communication Association (ISCA).
[47]
Heldner, M., Hjalmarsson, A., Edlund, J. (2013). Backchannel relevance spaces. I Nordic Prosody: Proceedings of the XIth Conference, Tartu 2012. (s. 137-146). Franktfurt am Main, Germany: Peter Lang Publishing Group.
[48]
Edlund, J., Al Moubayed, S., Tånnander, C., Gustafson, J. (2013). Temporal precision and reliability of audience response system based annotation. I Proc. of Multimodal Corpora 2013..
[49]
Oertel, C., Salvi, G., Götze, J., Edlund, J., Gustafson, J., Heldner, M. (2013). The KTH Games Corpora : How to Catch a Werewolf. I IVA 2013 Workshop Multimodal Corpora: Beyond Audio and Video: MMC 2013..
[50]
Strömbergsson, S., Hjalmarsson, A., Edlund, J., House, D. (2013). Timing responses to questions in dialogue. I Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2013. (s. 2583-2587). Lyon, France: International Speech and Communication Association.
[51]
Edlund, J., Alexanderson, S., Beskow, J., Gustavsson, L., Heldner, M., Hjalmarsson, A., Kallionen, P., Marklund, E. (2012). 3rd party observer gaze as a continuous measure of dialogue flow. I Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012. (s. 1354-1358). Istanbul, Turkey: European Language Resources Association.
[52]
Edlund, J., Heldner, M., Hjalmarsson, A. (2012). 3rd party observer gaze during backchannels. I Proc. of the Interspeech 2012 Interdisciplinary Workshop on Feedback Behaviors in Dialog. Skamania Lodge, WA, USA.
[53]
Strömbergsson, S., Edlund, J., House, D. (2012). A study of Swedish questions and their prosodic characteristics. I Proceedings of Workshop on Innovation and Applications in Speech Technology (IAST). (s. 61-64). Dublin, Ireland.
[54]
Oertel, C., Wlodarczak, M., Edlund, J., Wagner, P., Gustafson, J. (2012). Gaze Patterns in Turn-Taking. I 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol 3. (s. 2243-2246). Portland, Oregon, US.
[55]
Edlund, J., Oertel, C., Gustafson, J. (2012). Investigating negotiation for load-time in the GetHomeSafe project. I Proc. of Workshop on Innovation and Applications in Speech Technology (IAST). (s. 45-48). Dublin, Ireland.
[56]
Edlund, J., Hjalmarsson, A. (2012). Is it really worth it? : Cost-based selection of system responses to speech-in-overlap. I Proc. of the IVA 2012 workshop on Realtime Conversational Virtual Agents (RCVA 2012). Santa Crux, CA, USA.
[57]
Laskowski, K., Heldner, M., Edlund, J. (2012). On the dynamics of overlap in multi-party conversation. I 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. (s. 846-849).
[58]
Edlund, J., Heldner, M., Gustafson, J. (2012). On the effect of the acoustic environment on the accuracy of perception of speaker orientation from auditory cues alone. I 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol 2. (s. 1482-1485).
[59]
Strömbergsson, S., Edlund, J., House, D. (2012). Prosodic measurements and question types in the Spontal corpus of Swedish dialogues. I 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol 1. (s. 838-841).
[60]
Edlund, J., House, D., Strömbergsson, S. (2012). Question types and some prosodic correlates in 600 questions in the Spontal database of Swedish dialogues. I Proceedings Of The 6th International Conference On Speech Prosody, Vols I and  II. (s. 737-740). Shanghai, China: Tongji Univ Press.
[61]
Strömbergsson, S., Edlund, J., House, D. (2012). Question types and some prosodic correlates in the Spontal corpus of Swedish dialogues. I Proceedings of Fonetik 2012. Gothenburg, Sweden.
[62]
Strömbergsson, S., Edlund, J., House, D. (2012). Questions and reported speech in Swedish dialogues. I Nordic Prosody: Proceedings of the XIth Conference, Tartu 2012. Tartu, Estonia.
[63]
Edlund, J., Strömbergsson, S., House, D. (2012). Telling questions from statements in spoken dialogue systems. I Proc. of SLTC 2012. Lund, Sweden.
[64]
Edlund, J., Heldner, M., Gustafson, J. (2012). Who am I speaking at? : perceiving the head orientation of speakers from acoustic cues alone. I Proc. of LREC Workshop on Multimodal Corpora 2012. Istanbul, Turkey.
[65]
Laskowski, K., Edlund, J., Heldner, M. (2011). A single-port non-parametric model of turn-taking in multi-party conversation. I Proc. of ICASSP 2011. (s. 5600-5603). Prague, Czech Republic.
[66]
Al Moubayed, S., Beskow, J., Edlund, J., Granström, B., House, D. (2011). Animated Faces for Robotic Heads : Gaze and Beyond. I Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issues. (s. 19-35). Springer Berlin/Heidelberg.
[67]
Laskowski, K., Edlund, J., Heldner, M. (2011). Incremental learning and forgetting in incremental stochastic turn-taking models. I Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. (s. 2080-2083). Florence, Italy.
[68]
Beskow, J., Alexanderson, S., Al Moubayed, S., Edlund, J., House, D. (2011). Kinetic Data for Large-Scale Analysis and Modeling of Face-to-Face Conversation. I Proceedings of International Conference on Audio-Visual Speech Processing 2011. (s. 103-106). Stockholm: KTH Royal Institute of Technology.
[69]
Landsiedel, C., Edlund, J., Eyben, F., Neiberg, D., Schuller, B. (2011). Syllabification of conversational speech using bidirectional long-short-term memory neural networks. I Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. (s. 5256-5259). Prague, Czech Republic.
[70]
Edlund, J., Al Moubayed, S., Beskow, J. (2011). The Mona Lisa Gaze Effect as an Objective Metric for Perceived Cospatiality. I Proc. of the Intelligent Virtual Agents 10th International Conference (IVA 2011). (s. 439-440). Springer.
[71]
Heldner, M., Edlund, J., Hjalmarsson, A., Laskowski, K. (2011). Very short utterances and timing in turn-taking. I Proceedings of Interspeech 2011. (s. 2848-2851).
[72]
Laskowski, K., Edlund, J. (2010). A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm. I Proceedings of the International Conference on Language Resources and Evaluation, LREC 2010. (s. 3742-3749). Valetta, Malta: European Language Resources Association.
[73]
Edlund, J., Beskow, J. (2010). Capturing massively multimodal dialogues : affordable synchronization and visualization. I Proc. of Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality (MMC 2010). (s. 160-161).
[74]
Oertel, C., Cummins, F., Campbell, N., Edlund, J., Wagner, P. (2010). D64: A corpus of richly recorded conversational interaction. I Proceedings of LREC 2010 Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality. (s. 27-30).
[75]
Beskow, J., Edlund, J., Granström, B., Gustafsson, J., House, D. (2010). Face-to-Face Interaction and the KTH Cooking Show. I Development of multimodal interfaces: Active listing and synchrony. (s. 157-168).
[76]
Heldner, M., Edlund, J., Hirschberg, J. (2010). Pitch similarity in the vicinity of backchannels. I Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010. (s. 3054-3057). Makuhari, Japan.
[77]
Laskowski, K., Heldner, M., Edlund, J. (2010). Preliminaries to an account of multi-party conversational turn-taking as an antiferromagnetic spin glass. I Proceedings of NIPS Workshop on Modeling Human Communication Dynamics. Vancouver, B.C., Canada.
[78]
Edlund, J., Beskow, J., Elenius, K., Hellmer, K., Strömbergsson, S., House, D. (2010). Spontal : a Swedish spontaneous dialogue corpus of audio, video and motion capture. I Proc. of the Seventh conference on International Language Resources and Evaluation (LREC'10). (s. 2992-2995).
[79]
Sikveland, R.-O., Öttl, A., Amdal, I., Ernestus, M., Svendsen, T., Edlund, J. (2010). Spontal-N : A Corpus of Interactional Spoken Norwegian. I Proc. of the Seventh conference on International Language Resources and Evaluation (LREC'10). (s. 2986-2991).
[80]
Laskowski, K., Heldner, M., Edlund, J. (2009). A general-purpose 32 ms prosodic vector for Hidden Markov Modeling. I Proceedings of Interspeech 2009. (s. 724-729). Brighton, UK: ISCA.
[81]
Laskowski, K., Heldner, M., Edlund, J. (2009). Exploring the prosody of floor mechanisms in English using the fundamental frequency variation spectrum. I Proceedings of the 2009 European Signal Processing Conference (EUSIPCO-2009). (s. 2539-2543). Glasgow, Scotland.
[82]
Edlund, J., Heldner, M., Hirschberg, J. (2009). Pause and gap length in face-to-face interaction. I INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009. (s. 2779-2782). BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC.
[83]
Heldner, M., Edlund, J., Laskowski, K., Pelcé, A. (2009). Prosodic features in the vicinity of pauses, gaps and overlaps. I Nordic Prosody: Proceedings of the Xth Conference. (s. 95-106). Frankfurt am Main: Peter Lang.
[84]
Edlund, J., Heldner, M., Pelcé, A. (2009). Prosodic features of very short utterances in dialogue. I Nordic Prosody: Proceedings of the Xth Conference. (s. 57-68). Frankfurt am Main: Peter Lang.
[85]
Beskow, J., Edlund, J., Granström, B., Gustafson, J., Skantze, G., Tobiasson, H. (2009). The MonAMI Reminder : a spoken dialogue system for face-to-face interaction. I Proceedings of the 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009. (s. 300-303). Brighton, U.K.
[86]
Hincks, R., Edlund, J. (2009). Using speech technology to promote increased pitch variation in oral presentations. I Proc. of SLaTE Workshop on Speech and Language Technology in Education. Wroxall, UK.
[87]
Laskowski, K., Edlund, J., Heldner, M. (2008). An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems. I 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING. (s. 5041-5044). New York: IEEE.
[88]
Laskowski, K., Wölfel, M., Heldner, M., Edlund, J. (2008). Computing the fundamental frequency variation spectrum in conversational spoken dialogue systems. I Proceedings of Acoustics'08. (s. 3305-3310). Paris, France.
[89]
Gustafson, J., Edlund, J. (2008). EXPROS : A toolkit for exploratory experimentation with prosody in customized diphone voices. I Perception In Multimodal Dialogue Systems, Proceedings. (s. 293-296).
[90]
Hjalmarsson, A., Edlund, J. (2008). Human-likeness in utterance generation : Effects of variability. I Perception In Multimodal Dialogue Systems, Proceedings. (s. 252-255).
[91]
Beskow, J., Edlund, J., Granström, B., Gustafson, J., Skantze, G. (2008). Innovative interfaces in MonAMI : The Reminder. I Perception In Multimodal Dialogue Systems, Proceedings. (s. 272-275).
[92]
Laskowski, K., Edlund, J., Heldner, M. (2008). Learning prosodic sequences using the fundamental frequency variation spectrum. I Proceedings of the Speech Prosody 2008 Conference. (s. 151-154). Campinas, Brazil: Editora RG/CNPq.
[93]
Gustafson, J., Heldner, M., Edlund, J. (2008). Potential benefits of human-like dialogue behaviour in the call routing domain. I Perception In Multimodal Dialogue Systems, Proceedings. (s. 240-251).
[94]
Edlund, J., Beskow, J. (2007). Pushy versus meek : using avatars to influence turn-taking behaviour. I INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION. (s. 2784-2787). BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC.
[95]
Heldner, M., Edlund, J., Carlson, R. (2006). Interruption impossible. I Nordic Prosody: Proceedings of the IXth Conference, Lund 2004. (s. 97-105). Frankfurt am Main, Germany.
[96]
Skantze, G., Edlund, J., Carlson, R. (2006). Talking with Higgins : Research challenges in a spoken dialogue system. I PERCEPTION AND INTERACTIVE TECHNOLOGIES, PROCEEDINGS. (s. 193-196). BERLIN: SPRINGER-VERLAG BERLIN.
[97]
Wallers, Å., Edlund, J., Skantze, G. (2006). The effect of prosodic features on the interpretation of synthesised backchannels. I Perception And Interactive Technologies, Proceedings. (s. 183-187).
[98]
Edlund, J., Heldner, M., Gustafson, J. (2006). Two faces of spoken dialogue systems. I Interspeech 2006 - ICSLP Satellite Workshop Dialogue on Dialogues: Multidisciplinary Evaluation of Advanced Speech-based Interactive Systems. Pittsburgh PA, USA.
[99]
Skantze, G., House, D., Edlund, J. (2006). User Responses to Prosodic Variation in Fragmentary Grounding Utterances in Dialog. I INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. (s. 2002-2005). BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC.
[100]
Edlund, J., Heldner, M. (2006). vertical bar nailon vertical bar : Software for Online Analysis of Prosody. Presenterad vid 9th International Conference on Spoken Language Processing/INTERSPEECH 2006, Pittsburgh, PA, USA, 17-21 September 2006. (s. 2022-2025). BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC.
[101]
Edlund, J., Hjalmarsson, A. (2005). Applications of distributed dialogue systems : the KTH Connector. I Proceedings of ISCA Tutorial and Research Workshop on Applied Spoken Language Interaction in Distributed Environments (ASIDE 2005)..
[102]
Edlund, J., House, D., Skantze, G. (2005). The effects of prosodic features on the interpretation of clarification ellipses. I Proceedings of Interspeech 2005: Eurospeech. (s. 2389-2392).
[103]
Heldner, M., Edlund, J., Björkenstam, T. (2004). Automatically extracted F0 features as acoustic correlates of prosodic boundaries. I Fonetik 2004: Proc of The XVIIth Swedish Phonetics Conference. (s. 52-55). Stockholm University.
[104]
Skantze, G., Edlund, J. (2004). Early error detection on word level. I Proceedings of ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction..
[105]
Edlund, J., Skantze, G., Carlson, R. (2004). Higgins : a spoken dialogue system for investigating error handling techniques. I Proceedings of the International Conference on Spoken Language Processing, ICSLP 04. (s. 229-231).
[106]
Skantze, G., Edlund, J. (2004). Robust interpretation in the Higgins spoken dialogue system. I Proceedings of ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction..
[107]
Gustafson, J., Bell, L., Johan, B., Edlund, J., Wirn, M. (2002). Constraint Manipulation and Visualization in a Multimodal Dialogue System. I Proceedings of MultiModal Dialogue in Mobile Environments..

Böcker

[108]
Borin, L., Brandt, M. D., Edlund, J., Lindh, J. & Parkvall, M. (2012). The Swedish Language in the Digital Age/Svenska språket i den digitala tidsåldern. Springer.

Kapitel i böcker

[109]
Edlund, J., Al Moubayed, S. & Beskow, J. (2013). Co-present or Not? : Embodiment, Situatedness and the Mona Lisa Gaze Effect. I Nakano, Yukiko; Conati, Cristina; Bader, Thomas (Red.), Eye gaze in intelligent user interfaces: gaze-based analyses, models and applications (s. 185-203). London: Springer London.
[110]
Edlund, J., House, D. & Beskow, J. (2012). Gesture movement profiles in dialogues from a Swedish multimodal database of spontaneous speech. I Bergmann, Pia; Brenning, Jana; Pfeiffer, Martin C.; Reber, Elisabeth (Red.), Prosodic and Visual Resources in Interactional Grammar. Walter de Gruyter.
[111]
Edlund, J. & Gustafson, J. (2010). Ask the experts : Part II: Analysis. I Juel Henrichsen, Peter (Red.), Linguistic Theory and Raw Sound (s. 183-198). Frederiksberg: Samfundslitteratur.
[112]
Gustafson, J. & Edlund, J. (2010). Ask the experts - Part I: Elicitation. I Juel Henrichsen, Peter (Red.), Linguistic Theory and Raw Sound (s. 169-182). Samfundslitteratur.
[113]
Beskow, J., Carlson, R., Edlund, J., Granström, B., Heldner, M., Hjalmarsson, A. & Skantze, G. (2009). Multimodal Interaction Control. I Waibel, Alexander; Stiefelhagen, Rainer (Red.), Computers in the Human Interaction Loop (s. 143-158). Berlin/Heidelberg: Springer Berlin/Heidelberg.
[114]
Edlund, J. & Heldner, M. (2007). Underpinning /nailon/ - automatic estimation of pitch range and speaker relative pitch. I Müller, C. (Red.), Speaker Classification I: Fundamentals, Features, and Methods. Berlin: Springer.
[115]
Beskow, J., Edlund, J. & Nordstrand, M. (2005). A Model for Multimodal Dialogue System Output Applied to an Animated Talking Head. I Minker, Wolfgang; Bühler, Dirk; Dybkjær, Laila (Red.), SPOKEN MULTIMODAL HUMAN-COMPUTER DIALOGUE IN MOBILE ENVIRONMENTS (s. 93-113). Dordrecht: Springer.
[116]
Edlund, J., Heldner, M. & Gustafson, J. (2005). Utterance segmentation and turn-taking in spoken dialogue systems. I Fisseni, B.; Schmitz, H-C.; Schröder, B.; Wagner, P. (Red.), Computer Studies in Language and Speech (s. 576-587). Frankfurt am Main, Germany: Peter Lang.

Icke refereegranskade

Konferensbidrag

[117]
Tånnander, C., Edlund, J. (2022). Mapping specific characteristics of spoken text to listener ratings. I Proceedings of Fonetik 2022. Stockholm, Sweden.
[118]
Tånnander, C., Edlund, J. (2022). Sardin : speech-oriented text processing. I Proceedings of Fonetik 2022. Stockholm, Sweden.
[119]
Tånnander, C., Edlund, J. (2022). Towards a Swedish test set for speech-oriented text normalisation. Presenterad vid Swedish Language Technology Conference (SLTC),November 18-20 2020, Göteborg. Göteborg: Göteborgs universitet.
[120]
Tånnander, C., Edlund, J. (2021). Self-perceived preferences of voice and speaking style characteristics in spoken text. Presenterad vid Swedish Language Technology Conference (SLTC) 2021.
[121]
Tånnander, C., Edlund, J. (2021). Stress manipulation in text-to-speech synthesis using speaking rate categories. I Proceedings of Fonetik 2021, Centre for Languages and Literature, Lund University. (s. 17-22). Lund.
[122]
Edlund, J., Al Moubayed, S., Tånnander, C., Gustafson, J. (2013). Audience response system based annotation of speech. I Proceedings of Fonetik 2013. (s. 13-16). Linköping: Linköping University.
[123]
Heldner, M., Edlund, J. (2012). Continuer relevance spaces. I Proc. of Nordic Prosody XI. Tartu, Estonia.
[124]
Renklint, E., Cardell, F., Dahlbäck, J., Edlund, J., Heldner, M. (2012). Conversational gaze in light and darkness. I Proc. of Fonetik 2012. (s. 59-60). Gothenburg, Sweden.
[125]
Edlund, J., Hjalmarsson, A., Tånnander, C. (2012). Unconventional methods in perception experiments. I Proc. of Nordic Prosody XI. Tartu, Estonia.
[126]
Edlund, J. (2011). How deeply rooted are the turns we take?. I SemDial 2011: Proceedings of the 15th Workshop on the Semantics and Pragmatics of Dialogue. (s. 196-197).
[127]
Edlund, J., Gustafson, J., Beskow, J. (2010). Cocktail : a demonstration of massively multi-component audio environments for illustration and analysis. I SLTC 2010, The Third Swedish Language Technology Conference (SLTC 2010): Proceedings of the Conference..
[128]
Beskow, J., Edlund, J., Gustafson, J., Heldner, M., Hjalmarsson, A., House, D. (2010). Modelling humanlike conversational behaviour. I SLTC 2010: The Third Swedish Language Technology Conference (SLTC 2010), Proceedings of the Conference. (s. 9-10). Linköping, Sweden.
[129]
Beskow, J., Edlund, J., Gustafson, J., Heldner, M., Hjalmarsson, A., House, D. (2010). Research focus : Interactional aspects of spoken face-to-face communication. I Proceedings from Fonetik, Lund, June 2-4, 2010: . (s. 7-10). Lund, Sweden: Lund University.
[130]
Edlund, J., Heldner, M., Al Moubayed, S., Gravano, A., Hirschberg, J. (2010). Very short utterances in conversation. I Proceedings from Fonetik 2010, Lund, June 2-4, 2010. (s. 11-16). Lund, Sweden: Lund University.
[131]
Beskow, J., Edlund, J., Elenius, K., Hellmer, K., House, D., Strömbergsson, S. (2009). Project presentation: Spontal : multimodal database of spontaneous dialog. I Proceedings of Fonetik 2009: The XXIIth Swedish Phonetics Conference. (s. 190-193). Stockholm: Stockholm University.
[132]
Hincks, R., Edlund, J. (2009). Transient visual feedback on pitch variation for Chinese speakers of English. I Proc. of Fonetik 2009. Stockholm.
[133]
Gustafson, J., Edlund, J. (2008). EXPROS : Tools for exploratory experimentation with prosody. I Proceedings of FONETIK 2008. (s. 17-20). Gothenburg, Sweden.
[134]
Beskow, J., Edlund, J., Granström, B., Gustafson, J., Jonsson, O., Skantze, G. (2008). Speech technology in the European project MonAMI. I Proceedings of FONETIK 2008. (s. 33-36). Gothenburg, Sweden: University of Gothenburg.
[135]
Laskowski, K., Heldner, M., Edlund, J. (2008). The fundamental frequency variation spectrum. I Proceedings of FONETIK 2008. (s. 29-32). Gothenburg, Sweden: Department of Linguistics, University of Gothenburg.
[136]
Edlund, J., Beskow, J., Heldner, M. (2007). MushyPeek : an experiment framework for controlled investigation of human-human interaction control behaviour. I Proceedings of Fonetik 2007. (s. 61-64).
[137]
Edlund, J., Heldner, M. (2006). /nailon/ - online analysis of prosody. I Working Papers 52: Proceedings of Fonetik 2006. (s. 37-40). Lund University, Centre for Languages & Literature, Dept. of Linguistics & Phonetics.
[138]
Skantze, G., House, D., Edlund, J. (2006). Grounding and prosody in dialog. I Working Papers 52: Proceedings of Fonetik 2006. (s. 117-120). Lund, Sweden: Lund University, Centre for Languages & Literature, Dept. of Linguistics & Phonetics.
[139]
Heldner, M., Edlund, J. (2006). Prosodic cues for interaction control in spoken dialogue systems. I Proceedings of Fonetik 2006. (s. 53-56). Lund, Sweden: Lund University, Centre for Languages & Literature, Dept. of Linguistics & Phonetics.
[140]
Carlson, R., Edlund, J., Heldner, M., Hjalmarsson, A., House, D., Skantze, G. (2006). Towards human-like behaviour in spoken dialog systems. I Proceedings of Swedish Language Technology Conference (SLTC 2006). Gothenburg, Sweden.
[141]
Edlund, J., House, D., Skantze, G. (2005). Prosodic Features in the Perception of Clarification Ellipses. I Proceedings of Fonetik 2005: The XVIIIth Swedish Phonetics Conference. (s. 107-110). Gothenburg, Sweden.

Kapitel i böcker

[142]
Borin, L., Domeij, R., Edlund, J. & Forsberg, M. (2023). Language Report Swedish. I Cognitive Technologies (s. 219-222). Springer Nature.

Avhandlingar

[143]
Edlund, J. (2011). In search for the conversational homunculus : serving to understand spoken human face-to-face interaction (Doktorsavhandling , KTH Royal Institute of Technology, Stockholm, Trita-CSC-A 11:03). Hämtad från http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-31172.

Övriga

[145]
Ekström, A. G., Crockford, C., Grawunder, S., Moran, S., Edlund, J. (). Evolution and function of hominid air sacs : A synthesis bearing on vowel production. (Manuskript).
[146]
Ekström, A. G., Holmer, S., Sward, K., Moran, S., Lameira, A. R., Friedrichs, D., Edlund, J. (). Gibbon vowel-like quality is tied to superhuman articulator landmarks. (Manuskript).
[147]
Ekström, A. G., Gannon, C., Edlund, J., Moran, S., Lameira, A. R. (). No neural “missing link” for verbal control in chimpanzees. (Manuskript).
[148]
Ekström, A. G., Gärdenfors, P., Snyder, W., Friedrichs, D., McCarthy, R. C., Tsapos, M., Tennie, C., Strait, D. S., Edlund, J., Moran, S. (). Phonetic correlates of hominin evolution in the late Pliocene and Pleistocene epochs : Becoming pre-adapted for speech. (Manuskript).
[149]
Ekström, A. G., Bortolato, T., Wittig, R. W., Shumaker, R. W., Masi, S., Nellissen, L., Crockford, C., Moran, S., Lameira, A. R., Edlund, J. (). Reverse engineering great ape vocal tract configurations with implications for evolving speech biomechanics. (Manuskript).
Senaste synkning med DiVA:
2024-10-31 00:15:53