FAIME: A Framework for AI-Assisted Musical Devices

Miguel Civit; Luis Muñoz-Saavedra; Francisco Cuadrado; Charles Tijus; María José Escalona

doi:10.5772/intechopen.108898

Abstract

In this paper, we present a novel framework for the study and design of AI-assisted musical devices (AIMEs). Initially, we present taxonomy of these devices and illustrate it with a set of scenarios and personas. Later, we propose a generic architecture for the implementation of AIMEs and present some examples from the scenarios. We show that the proposed framework and architecture are a valid tool for the study of intelligent musical devices.

Keywords

artificial intelligence
musical devices
internet of musical things

Author Information

Show +

Miguel Civit*
- Loyola University, Spain
Luis Muñoz-Saavedra
- E.T.S. Ingeniería Informática, University of Seville, Spain
Francisco Cuadrado
- Loyola University, Spain
Charles Tijus
- University of Paris, France
María José Escalona
- Computer Languages and Systems Department, ETSII, University of Seville, Spain

*Address all correspondence to: mcivit@uloyola.es

1. Introduction

Advances in technology and computer science have greatly enhanced the possibility of designing, developing, and deploying intelligent musical devices. A typical well-studied subset of these intelligent devices are IoMusTs (Internet of Musical Things). According to [1], an IoMusT is a “computing device capable of sensing and exchanging data to serve a musical purpose.” An IoMusT does not need to be able to produce, select, or modify music, but it can be any device that is “music aware” in the sense that its behavior is directly related to music. As an example, PixMob devices [2] have been widely used in musical performances. These devices that can be either worn (smartband), thrown (balls), or attached to the audience seats and are able to produce light patterns synchronized with life performances.

Not all intelligent musical devices are IoMusts. We can design intelligent devices where the intelligence is embedded in the device, and thus, we may say that we gave an Intelligent Musical dEvice (IME) but not as a part of the Internet of Musical Things. In [3], the evolution of the design of intelligent musical instruments is studied. In most cases, these instruments use artificial intelligence as a tool for user interaction without requiring any connection to public networks or cloud-based services. It is important to consider that machine learning ML, in most of these cases, cannot be considered as an independent agent but mainly as one of the possible alternatives for designing layers of a complete system. These types of devices can also be considered as cyber-physical systems, as they clearly require intelligent software systems and dedicated hardware.

In this work, we will create a framework that covers all, or at least a very wide part, of intelligent musical devices and helps design, understand, and study them.

The rest of the paper is divided as follows: First, in the Materials and Methods section, the taxonomy is and published works are detailed, as well as the analysis methodology used to test the different systems. The results obtained for the different systems are then detailed and explained in the Results and Discussion sections. Finally, conclusions are presented.

2. Materials and methods

Artificial-intelligence-assisted musical devices come in a wide variety of forms and potentially have a very wide spectrum of uses. In order to create a framework that will cover most of these possibilities, we will start by introducing taxonomy of the different usages of the said devices. It should be clear that it is possible for a device to fall into several categories. As an example, most musical instruments could also be considered educational aids, some of them being used predominantly for this purpose. The monochord was used through the Middle Ages for educational and scientific purposes [4], and similarly, we can design intelligent instruments that, although being able to be used for performing, are meant with an educational intent.

2.1 Taxonomy

We propose a classification for AI-assisted musical devices (AIMEs). It is clear that this is not the only possible taxonomy, but it is complete, easy to apply, and useful. The classification is shown in Table 1.

1. Musical instruments	AI assisted instruments Augmented instruments
2. Music processors	Instrumental modifiers Voice modifiers General sound processors
3. Music generators	Instrumental Voice Combined
4. Music recommendation devices	Ambient aware recommendation User aware recommendation Combined
5. Music-related feedback systems	Personal Feedback. Ambient Feedback. Combined
6. Educational Aids	Music Education General educational support Rehabilitation

Table 1.

AI-assisted musical device (AIME) taxonomy.

In a first level, we divide our AIMEs into:

Devices that are played by musicians: Musical instruments.
Devices designed to modify music: Music Processors.
Devices that compose music: Music Generators.
Devices that select music: Music Recommenders.
Devices that send to the user or the environment information extracted from the music: Feedback systems.
Devices designed to be used in an educational process: educational devices.

A real device may be included in several categories. As an example, a device could generate a set of music scores and then recommend some of them to a student. In this way, this device could be considered as a generator, a recommender, and an educational system.

This main AIME division can then be divided into subcategories. As an example, a Music Generator can either be instrumental, vocal, or combined. An instrumental music generator usually produces music in symbolic format. The most common symbolic format is the Musical Instrument Digital Interface (MIDI), which contains information that indicates the pitch, start time, stop time, and other properties of each individual note, rather than the resulting sound. Combined and voice generators have to use a raw audio format and are much more difficult to implement, although their quality has improved significantly in the present decade [5].

As a further example, recommendation devices can recommend music as a function of the environment or as a function of the user state. The environment-based recommendation is mostly used in social scenarios, e.g., if the system selects music for a shopping mall or an elevator. Personal Music recommendation devices are used mostly when recommending for a single user. As an example, we could estimate the user’s emotional state from the data of the wearable device [6] and select the music accordingly. It is also possible to use the acquired data of an AIME personal recommender to try to modify some aspects of user behavior. An interesting possibility would be to train the user, through music, to reduce his or her stress level. In this way, the device could also be considered as part of the Internet of Behavior (IOB) [7].

2.2 Intelligent instrument scenarios

The area of intelligent musical instruments [8] includes an important subset of musical devices and has a wide range of applications that we will present in four example scenarios.

2.2.1 Able instrument scenario

Mike had an accident that led to a problem that prevents him from playing with his right hand. However, he would like to continue playing the bass in a small blues band. Mike thought he would not be able to play again as a bass player, as most instruments require significant ability with both hands. There are several alternatives to adapt the instrument to his physical capabilities [9], but finally he settled on a small robotic mechanism that can detect which string is he fretting with his left hand and pluck it. This device can hear what other members of the band are playing and dynamically adapt to the tempo and genre of the song by varying the rhythms and patterns it plays.

Although the results do not match his earlier performances, Mike is still able to play well enough and have fun with his friends’ band.

2.2.2 Drum stroke scenario

Toby recently had a stroke that left him with reduced mobility in his right hand. In his rehabilitation clinic, they proposed that he should follow complementary music-supported therapy (MST) in which he controls a set of midi drums through his hand gestures [10], which are detected through electromyography signals (EMG). The drums can play almost autonomously at the beginning of therapy and allow control of an increased number of variables as Toby progresses in his recovery.

The rehabilitation device keeps track of Toby’s progress and periodically sends reports to his therapist. When Toby goes to the clinic for an in-person session, the therapist will discuss his progress and adapt the MST accordingly.

2.2.3 Teach and play scenario

Mary wants to start playing the concertina and is following a well-known book and taking some lessons online. However, she does not like the sound that she is producing with the instrument currently and refuses to play it anywhere. A friend tells her about Inteltina, an intelligent didactical concertina that augments Mary’s abilities and helps her produce a nice sound. The instrument assistance dynamically decreases as Mary’s playing capabilities improve.

Although Mary plays reasonably well with Inteltina, her online teacher warns her that this type of instrument sometimes backfires as the student becomes lazy and her abilities stagnate [8].

2.2.4 TherAImin

Sara is a computer scientist who plays piano as a hobby. Recently, she has become fascinated by the discovery of the Theremin [11]. Figure 1 shows an early implementation of Theremin. Being an AI specialist, she believes that the design can be clearly improved with the help of AI. Thus, she decides to become a “digital luthier” and to create a new instrument that is faithful to the original Theremin concept. The TherAImin keeps the pitch and volume antennas of the original instrument but includes an AI-based gesture recognizer to change the timbre of the instrument [12] according to hand gestures.

Figure 1.
Alexandra Stepanoff playing the theremin, 1930.

This scenario reflects the creation of new digital AI-supported musical instruments. Several interesting reflections on this topic can be found in [3].

This type of instrument is fun to build and play, but it can be difficult to create a community of users around them.

2.3 Audio processing scenarios

This area includes instrument processors, voice processors, and generic audio processors.

2.3.1 Boogie boogie scenario

Saul is a professional guitar player. He would love to have a Mesa Boogie Mark V amplifier, but the price is too high for him. Saul knows that there are emulations for this amp for several Digital Audio Workstations (DAWs) including Cubase, which he regularly uses. However, Saul would like to have the emulation as a pedal he can easily carry. He has several friends who work in a small start-up company that designs embedded deep learning devices and learns from them that the boogie can be emulated by an AI system [13] that can be implemented using a Coral Edge TPU accelerator [14].

In a few months, Saul has tested the device and the company is starting to sell the BoogieBoogie Pedal.

2.3.2 DeepTuner Scenario

Sara is a singer who regularly uses a pitch-correction voice processor for her performances. Currently, she uses an AI enhanced version of Antares Auto-Tune [15] on an Avid Carbon Device. She is satisfied with the natural feeling, and virtually unnoticeable delay that this hardware/software implementation brings to her performances. Nevertheless, she would love a similar pitch correction implementation in a smaller and cheaper device [16].

2.3.3 DeepAFx scenario

Kyra is a production Engineer. Since she discovered the Deep-Learning-based LV2 DeepAFx plug-in framework [17] she regularly uses it to control her DAW and to introduce several effects. Although she always fine-tunes the work manually, the use of the framework has clearly improved her schedule. Kyra would love to have a device with an embedded version of these plug-ins for live performances.

2.4 Music generator scenarios

In this subsection, we present two scenarios that rely on the use of different AI-based music generators.

2.4.1 On hold scenario

Peter has a small online seller business with a telephone customer service line. He wants some copyright-free music to keep the costumer on hold while an agent can handle their call. He wants the music to change according to the expected waiting time, the time of the day, and other circumstances.

Peter has heard about AI-based music generation technology [5] and after searching online decides to select some compositions made using AIVA and computoser [18]. Peter consults with his guitar player friend Saul to help him decide which parameters would be best for the different music fragments that he wants for the customer service line. An automated controller dynamically changes the generator parameters to create the desired result.

Peter would like to be able to estimate the emotional state of the client [6] and change the music accordingly; however, this is not possible in a standard phone call. When clients use the customer service app, the music changes according to their comments [19]. All the generators in this scenario produce symbolic music in midi format. This format is suitable for instrumental music and produces results of a quality that can be adequate for the proposed scenario.

2.4.2 Singing elevator scenario

Mia is a Design Engineer for a large elevator company. In their latest models, the elevators are fitted with a screen that mainly provides news and weather information. Mia wants to have copyright-free background songs while the elevator is in use.

After studying several alternatives, Mia decides to generate the songs dynamically based on the characteristics of the building (residential, commercial, neighborhood, etc.). To generate the songs, she uses the OpenAi Jukebox generator [20] and updates the sons on a regular basis. The entire selection of songs according to the different situations is performed by the elevator media controller, which can also be considered a musical thing.

This scenario uses a nonsymbolic direct audio music generator. This type of generator is much less common than the symbolic alternatives, but the results are becoming acceptable by final users in the last years.

2.5 Music recommendation device scenarios

2.5.1 Emotiwatch scenario

Sam is a sports and music fan. Every morning he runs for an hour. While running, Sam likes to listen to music. His musical choices clearly depend on his mood. For years, Sam has selected his songs directly, but he would prefer, at least sometimes, that his smartwatch would do the selection for him. It is well known [21, 22] that emotional states and stress can be predicted using AI technology from physiological indicators. These are mainly electro-dermal activity (EDA), heart rate variability (HRV), and to a smaller extent, peripheral oxygen saturation (Spo2). Several wearable devices, including smartwatches such as Fitbit charge 2 or Sense [22] or research-oriented Empatica E4 wristband, are capable of measuring at least a subset of these parameters.

Sam finds an app for his watch [23] that selects music based on his mood. The watch, which was already a musical thing, becomes an AI-assisted musical device and lets Sam keep his mind on running.

2.5.2 iClock scenario

Jane, like a great part of the population in many countries, has been having lack of sleep problems for a long time. The relationships between sleep disorders and anxiety, depression, overweight, and diabetes are well known by the medical community [24]. As part of her treatment, her psychologist tells Jane that some new devices could possibly help restore her sleep quality. Among these devices, Jane finds iClock, a new device that monitors her sleep, using Jane’s smartwatch, and modifies her wake up routines taking into account her schedule needs, the sleep monitoring data, and an estimation of her emotional state. Among the different aspects that iClock controls is the selection and modification of the melodies according to the selected waking up routine. Thus, iClock is, among other things, an AI-assisted musical device,

Following her therapist recommendations, including the use of iClock, Jane’s sleep patterns improve, which in turn is clearly reflected in an improvement of her quality of life.

2.6 Feedback device scenarios

2.6.1 RumbleRumble scenario

Gina has a moderate hearing problem. She likes to go to concerts with friends. However, she feels that she is losing an important part of the information. Recently, she learned about the existence of the Subpac backpack [25] that uses haptics, interoception, and bone conduction to deliver bass sensation to even profoundly deaf users. Although the current version of the device requires an external computer to run the software, Gina is using an experimental version that runs in an embedded controller, thus making the Subpac a personal feedback AIME.

2.6.2 MagicShoes scenario

Peter has a problem with his weigh. He has tried several solutions, but none seem to work well for him. He has even tried game-based approaches [26] with little success. Peter is very fond of music, and he hears from a musician friend of the existence of a wearable device that uses sounds to promote sport activity and to change your own body perception. He starts using MagicShoes [27] and finally finds a way to help him reduce weight in a fun way that adequately fits his tastes and habits. A future update includes machine learning capabilities so that the device selects music based on the user preferences.

2.6.3 Let there be light scenario

Nico really likes to go to rock music performances. He especially loves when people start following music with their lighters. In some recent concerts, this has even improved due to new musical device technology. When Nico went to his last concert, he was given a PixMob-led wristband. These devices have a set of preprogramed effects that are triggered usually by a human operator. Nevertheless, the possibility of an AI-based controller that decides which effect to apply according to both the concert and the carrier circumstances is currently perfectly feasible. In this way, the wristband will become an AIME (Figure 2).

2.7 Educational scenarios

2.7.1 Teach and play scenario: again

The Teach and Play Scenario presented in Subsection 1.2 is also clearly an educational Musical Device scenario and could have been presented in this subsection as well.

2.7.2 The magiFlute Scenario

John is 13 years old and has a moderate learning disability. His music teacher recommends that he use a new accessible digital musical instrument (ADMI) [28] known as the MagiFlute. This instrument is an Electronic Wind Instrument [EWI] [26], which is similar to a recorder, but does not produce sound directly. It has sensors for wind and touch pressure and controls a synthesizer through an embedded deep learning system. It also uses John’s iPad to help him remember what to play and how to play. It even has the possibility of automatically correcting what John is playing when configured in this way. With the magiFlute, John participates in the school band and is becoming a better standard recorder player every day (Figure 3).

2.7.3 Magic flute scenario

In our last scenario, we use the term magiFlute for our proposed instrument, as its housemate, the “Magic Flute,” is a completely different existing ADMI, which is an EWI that is controlled by very small head movements [27]. This instrument is played by Ellen, who has a spinal cord injury as a result of a motorcycle accident.

2.8 Processing architecture

In this work, we propose a generic architecture for the design of musical devices. This architecture is based on a multilayered approach. The proposed layers are structured as follows:

User stimuli capture and processing layer.
Embedded learning Layer
Music adaptation layer
Music production layer
User feedback layer

The block diagram for this proposed architecture is shown in Figure 4.

After presenting the methodology used to test the theories discussed in the Introduction, the results will be detailed in the next section.

3. Results

In this section, we present a possible implementation for some of the devices proposed in the scenarios. In this way, we will verify the suitability of the proposed generic architecture and, thus, the usefulness of the framework presented.

4. Scenario implementation

We will briefly describe possible implementations of TherAImin. This implementation is presented to show that the proposed framework provides a usable foundation for building AI-assisted devices and describing them in a systematic manner.

We think that even though we do not present a device for each of the possible categories, the difference between the selected AIMEs is wide enough to show that, in principle, any AIME can be implemented using the framework.

4.1 TherAImin

As discussed in Section 1.2, the Theremin is an instrument with two antennas that is controlled by the player without touch interaction. The block diagram of the Theremin is shown in Figure 5. The TherAIMin is an AI-assisted variation of the original instrument, where hand gestures are used to control the timbre.

Figure 5.
Block diagram of the Theremin ad the TherAImin.

Although we could have implemented TherAImin without antennas using, e.g., Mediapipe Handpose [29], we have decided to be more faithful to the original instrument and thus use the [30], which provides a versatile Theremin implementation with Pitch and Volume outputs.

Thus, openTheremin antennas act as part of the user-stimulus capture layer. The other part of this layer is a camera that is used to capture the user’s hand gesture.

We will interface openTheramin using a Raspberry Pi board with an RPI-GP90 pulse signal IO hat. This is part of the stimulus adaptation layer. The other part of this layer is made up of the video interface already available in the raspberry pi.

The embedded learning layer is built using Google’s Teachable Machine [31] accelerated with a Coral Edge TPU accelerator. The approach is very similar to [32] where a machine that can be trained is used to recognize objects. With this approach, the accelerated embedded system classifies the gestures in the number of trained classes. It is important to keep the gesture classes different, and it is also essential to train a wide class of gestures and other images that the camera may see in the background class [33]. An advantage of the TherAImin is that when the AI system makes a wrong decision, this will affect the timbre and the effects, but not the volume and the pitch.

The sound production layer is implemented on raspberry pi using sonic PI [34]. The selection of sound pitch and volume is done by a small Processing program that produces OSC [35]. Open Sound Control (OSC) is a protocol to connect sound synthesizers, computers, and other multimedia devices for purposes such as musical performance or show control. Many music-related software tools, including sonic PI, support the OSC protocol. The OSC protocol uses UDP (or TCP) packets and thus can run either in a single embedded system or be distributed over a network.

The selection of timbre and effects is done by the learning layer as a function of the gestures and sent to the generation layer using OSC. In this way, TherAImin produces audio as a function of the hand positions captured by the antennas and the gestures captured by the camera and recognized by the teachable machine.

The TherAImin, as an extension of the Theremin, can be considered in the augmented musical instrument class in the FAIME taxonomy.

5. Discussion

The approach used to implement TherAImin could be used for simpler and more complex devices. As an example, the “Singing Elevator” can use the presence, temperature, humidity and noise sensors and additional information from the Internet as user stimuli. Using aggregation of estimators this process can be further improved [36]. Using a simple learning layer (local or not), it will decide from which category it should retrieve the generated music. The production layer would be a simple player with possible adaptations to handle noise in the cabin or other issues.

As a further example, the emotiWatch uses the wearable device sensors as a stimuli layer, preprocesses them in the stimuli adaptation layer, estimates the user’s emotional state, and selects the music in the learning layer and outputs the music through a player in the production layer. The same approach can be used to start the design of any AIME.

It is clear that many other approaches could have been proposed, but FAIME is simple and gives clear insights into the musical device design process.

6. Conclusions

In this work, we have presented a useful framework for the classification, understanding, and design of AI-assisted musical devices.

We have shown a very wide range of devices that can fall into this category including such different things as accessible instruments for disabled musicians or alarm clocks to help people with sleeping disorders.

We have presented a quite detailed implementation of a variation of a successful musical instrument designed in the 1920s, the Theremin. Our augmentation allows the player to select timbre and effects in real time through hand gestures but also helps to keep the look and feel of the original instrument if it is played with open hands.

We also included a short description of the design of other AIMEs to show the usefulness of the framework.

In future work, we will evaluate the user experience of TherAImin with musicians and study possible modifications for performers with disabilities using the powerful embedded intelligent system.

Acknowledgments

This work was supported by the NICO project (PID2019-105455GB-C31) from Ministerio de Ciencia, Innovación y Universidades (Spanish Government) and by DAFNE (US-1381619) Consejería de Economía y Conocimiento (Junta de Andalucia).

References

1. Turchet L, Fischione C, Essl G, Keller D, Barthet M. Internet of musical things: Vision and challenges. IEEE Access. 2018;6:61994-62017
2. Clark D, Westin F, Girouard A. iSNoW: User perceptions of an interactive social novelty wearable. In: Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers. 2019. pp. 268-271
3. Fiebrink R, Sonami L. Reflections on Eight Years of Instrument Creation with Machine Learning. Goldsmiths, University of London; 2020
4. Buehler-McWilliams K, Murray RE. The monochord in the medieval and modern classrooms. Journal of Music History Pedagogy. 2013;3:151-172
5. Briot JP, Hadjeres G, Pachet FD. Deep Learning Techniques for Music Generation. Springer; 2020
6. Muñoz-Saavedra L, Luna-Perejón F, Civit-Masot J, Miró-Amarante L, Civit A, Domı́nguez-Morales M. Affective state assistant for helping users with cognition disabilities using neural networks. Electronics. 2020;9:1843
7. Rahaman T. Smart things are getting smarter: An introduction to the internet of behavior. Medical Reference Services Quarterly. 2022;41:110-116
8. Jordà S. Instruments and players: Some thoughts on digital lutherie. Journal of New Music Research. 2004;33:321-341
9. Harrison J. Instruments and Access: The Role of Instruments in Music and Disability [Ph.D. dissertation]. Queen Mary University of London; 2020
10. Dieckmann M. EMG/Motion Capture-Based Accessible Music Interfaces for Rehabilitation. 2020
11. Theremin LS, Petrishev O. The design of a musical instrument based on cathode relays. Leonardo Music Journal. 1996;6:49-50
12. McAdams S, Giordano BL. The perception of musical timbre. In: The Oxford Handbook of Music Psychology. Oxford Academic; 2009. pp. 72-80
13. Wright A, Damskägg EP, Juvela L, Välimäki V. Real-time guitar amplifier emulation with deep learning. Applied Sciences. 2020;10:766
14. Civit-Masot J, Luna-Perejón F, Corral JMR, Domínguez-Morales M, Morgado-Estévez A, Civit A. A study on the use of Edge TPUs for eye fundus image segmentation. Engineering Applications of Artificial Intelligence. 2021;104:104384
15. Mårtensson B. The Timbral and Quality Affect from Pitch Correction Software on a Recorded Vocal Performance [Dissertation]. 2022. Retrieved from: http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-90744
16. Wager S, Tzanetakis G, CI W, Kim M. Deep autotuner: A pitch correcting network for singing performances. In: ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2020. pp. 246-250
17. Martinez Ramirez MA, Wang O, Smaragdis P, Bryan NJ. Differentiable signal processing with black-box audio effects. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2021
18. Bozhanov B. Computoser-Rule-Based, Probability-Driven Algorithmic Music Composition. arXiv preprint arXiv:1412.3079. 2014
19. Salas J. Generating music from literature using topic extraction and sentiment analysis. IEEE Potentials. 2018;37:15-18
20. Dhariwal P, Jun H, Payne C, Kim JW, Radford A, Sutskever I. Jukebox: A Generative Model for Music. arXiv preprint arXiv:2005.00341. 2020
21. Assabumrungrat R et al. Ubiquitous affective computing: A review. IEEE Sensors Journal. 1 Feb 2022;22(3):1867-1881. DOI: 10.1109/JSEN.2021.3138269
22. Williams SH. A Validation Study: Fitbit Charge 2 Heart Rate Measurement at Rest and During Cognitive-Emotional Stressors. 2021
23. Linger O. Designing a User-Centered Music Experience for the Smartwatch [Dissertation]. 2018. Retrieved from: http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-231061
24. Staner L. Sleep and anxiety disorders. Dialogues in Clinical Neuroscience. 2003;5(3):249-258. DOI: 10.31887/DCNS.2003.5.3/lstaner
25. Schmitz A, Holloway C, Cho Y. Hearing through vibrations: Perception of musical emotions by profoundly deaf people. arXiv preprint arXiv:2012.13265. 2020
26. Snyder J. The birl: Adventures in the development of an electronic wind instrument. In: Musical Instruments in the 21st Century. Springer; 2017. pp. 181-205
27. Davanzo N, Avanzini F. Experimental evaluation of three interaction channels for accessible digital musical instruments. In: International Conference on Computers Helping People with Special Needs. 2020. pp. 437-445
28. Frid E. Accessible digital musical instruments—a review of musical interfaces in inclusive music practice. Multimodal Technologies and Interaction (MDPI). 2019;3
29. Sung G, Sokal K, Uboweja E, Bazarevsky V, Baccash J, Bazavan EG, et al. On-device Real-time Hand Gesture Recognition. arXiv preprint arXiv:2111.00038. 2021
30. GaudiLabs. Open Theremin - Open Source Hardware Project. 2022. Available from: https://github.com/GaudiLabs/OpenTheremin_V3
31. Carney M, Webster B, Alvarado I, Phillips K, Howell N, Griffith J, et al. Teachable machine: Approachable Web-based tool for exploring machine learning classification. In: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. pp. 1-8
32. Tyka M. Embedded Teachable Machine. April 2019. Available from: https://teachablemachine.withgoogle.com/
33. Muñoz-Saavedra L, Civit-Masot J, Luna-Perejón F, Domı́nguez-Morales M, Civit A. Does two-class training extract real features? a COVID-19 case study. Applied Sciences. 2021;11:1424
34. Aaron S, Blackwell AF, Burnard P. The development of Sonic Pi and its use in educational partnerships: Co-creating pedagogies for learning computer programming. Journal of Music, Technology & Education. 2016;9:75-94
35. Wright M. OpenSound Control Specification. UC Berkeley: Center for New Music and Audio Technologies; 2002
36. El Ghali K, El Ghali A, Tijus C. Multimodal Automatic Tagging of Music Titles using Aggregation of Estimators. MediaEval; 2012

Sections

Author information

1.Introduction
2.Materials and methods
3.Results
4.Scenario implementation
5.Discussion
6.Conclusions
Acknowledgments

References

Publish with IntechOpen

Next chapter

Introductory Chapter: An Overview to the Internet of Things

By Manuel Domínguez-Morales, Ángel Varela-Vaca and Lourdes Miró-Amarante

143 downloads | 3 cites

[1] 1. Turchet L, Fischione C, Essl G, Keller D, Barthet M. Internet of musical things: Vision and challenges. IEEE Access. 2018;6:61994-62017

[2] 2. Clark D, Westin F, Girouard A. iSNoW: User perceptions of an interactive social novelty wearable. In: Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers. 2019. pp. 268-271

[3] 3. Fiebrink R, Sonami L. Reflections on Eight Years of Instrument Creation with Machine Learning. Goldsmiths, University of London; 2020

[4] 4. Buehler-McWilliams K, Murray RE. The monochord in the medieval and modern classrooms. Journal of Music History Pedagogy. 2013;3:151-172

[5] 5. Briot JP, Hadjeres G, Pachet FD. Deep Learning Techniques for Music Generation. Springer; 2020

[6] 6. Muñoz-Saavedra L, Luna-Perejón F, Civit-Masot J, Miró-Amarante L, Civit A, Domı́nguez-Morales M. Affective state assistant for helping users with cognition disabilities using neural networks. Electronics. 2020;9:1843

[7] 7. Rahaman T. Smart things are getting smarter: An introduction to the internet of behavior. Medical Reference Services Quarterly. 2022;41:110-116

[8] 8. Jordà S. Instruments and players: Some thoughts on digital lutherie. Journal of New Music Research. 2004;33:321-341

[9] 9. Harrison J. Instruments and Access: The Role of Instruments in Music and Disability [Ph.D. dissertation]. Queen Mary University of London; 2020

[10] 10. Dieckmann M. EMG/Motion Capture-Based Accessible Music Interfaces for Rehabilitation. 2020

[11] 11. Theremin LS, Petrishev O. The design of a musical instrument based on cathode relays. Leonardo Music Journal. 1996;6:49-50

[12] 12. McAdams S, Giordano BL. The perception of musical timbre. In: The Oxford Handbook of Music Psychology. Oxford Academic; 2009. pp. 72-80

[13] 13. Wright A, Damskägg EP, Juvela L, Välimäki V. Real-time guitar amplifier emulation with deep learning. Applied Sciences. 2020;10:766

[14] 14. Civit-Masot J, Luna-Perejón F, Corral JMR, Domínguez-Morales M, Morgado-Estévez A, Civit A. A study on the use of Edge TPUs for eye fundus image segmentation. Engineering Applications of Artificial Intelligence. 2021;104:104384

[15] 15. Mårtensson B. The Timbral and Quality Affect from Pitch Correction Software on a Recorded Vocal Performance [Dissertation]. 2022. Retrieved from: http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-90744

[16] 16. Wager S, Tzanetakis G, CI W, Kim M. Deep autotuner: A pitch correcting network for singing performances. In: ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2020. pp. 246-250

[17] 17. Martinez Ramirez MA, Wang O, Smaragdis P, Bryan NJ. Differentiable signal processing with black-box audio effects. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2021

[18] 18. Bozhanov B. Computoser-Rule-Based, Probability-Driven Algorithmic Music Composition. arXiv preprint arXiv:1412.3079. 2014

[19] 19. Salas J. Generating music from literature using topic extraction and sentiment analysis. IEEE Potentials. 2018;37:15-18

[20] 20. Dhariwal P, Jun H, Payne C, Kim JW, Radford A, Sutskever I. Jukebox: A Generative Model for Music. arXiv preprint arXiv:2005.00341. 2020

[21] 21. Assabumrungrat R et al. Ubiquitous affective computing: A review. IEEE Sensors Journal. 1 Feb 2022;22(3):1867-1881. DOI: 10.1109/JSEN.2021.3138269

[22] 22. Williams SH. A Validation Study: Fitbit Charge 2 Heart Rate Measurement at Rest and During Cognitive-Emotional Stressors. 2021

[23] 23. Linger O. Designing a User-Centered Music Experience for the Smartwatch [Dissertation]. 2018. Retrieved from: http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-231061

[24] 24. Staner L. Sleep and anxiety disorders. Dialogues in Clinical Neuroscience. 2003;5(3):249-258. DOI: 10.31887/DCNS.2003.5.3/lstaner

[25] 25. Schmitz A, Holloway C, Cho Y. Hearing through vibrations: Perception of musical emotions by profoundly deaf people. arXiv preprint arXiv:2012.13265. 2020

[26] 26. Snyder J. The birl: Adventures in the development of an electronic wind instrument. In: Musical Instruments in the 21st Century. Springer; 2017. pp. 181-205

[27] 27. Davanzo N, Avanzini F. Experimental evaluation of three interaction channels for accessible digital musical instruments. In: International Conference on Computers Helping People with Special Needs. 2020. pp. 437-445

[28] 28. Frid E. Accessible digital musical instruments—a review of musical interfaces in inclusive music practice. Multimodal Technologies and Interaction (MDPI). 2019;3

[29] 29. Sung G, Sokal K, Uboweja E, Bazarevsky V, Baccash J, Bazavan EG, et al. On-device Real-time Hand Gesture Recognition. arXiv preprint arXiv:2111.00038. 2021

[30] 30. GaudiLabs. Open Theremin - Open Source Hardware Project. 2022. Available from: https://github.com/GaudiLabs/OpenTheremin_V3

[31] 31. Carney M, Webster B, Alvarado I, Phillips K, Howell N, Griffith J, et al. Teachable machine: Approachable Web-based tool for exploring machine learning classification. In: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. pp. 1-8

[32] 32. Tyka M. Embedded Teachable Machine. April 2019. Available from: https://teachablemachine.withgoogle.com/

[33] 33. Muñoz-Saavedra L, Civit-Masot J, Luna-Perejón F, Domı́nguez-Morales M, Civit A. Does two-class training extract real features? a COVID-19 case study. Applied Sciences. 2021;11:1424

[34] 34. Aaron S, Blackwell AF, Burnard P. The development of Sonic Pi and its use in educational partnerships: Co-creating pedagogies for learning computer programming. Journal of Music, Technology & Education. 2016;9:75-94

[35] 35. Wright M. OpenSound Control Specification. UC Berkeley: Center for New Music and Audio Technologies; 2002

[36] 36. El Ghali K, El Ghali A, Tijus C. Multimodal Automatic Tagging of Music Titles using Aggregation of Estimators. MediaEval; 2012

FAIME: A Framework for AI-Assisted Musical Devices

Internet of Things - New Trends, Challenges and Hurdles

Abstract

Keywords

Author Information

Miguel Civit*

Luis Muñoz-Saavedra

Francisco Cuadrado

Charles Tijus

María José Escalona

1. Introduction

2. Materials and methods

2.1 Taxonomy

Table 1.

2.2 Intelligent instrument scenarios

2.2.1 Able instrument scenario

2.2.2 Drum stroke scenario

2.2.3 Teach and play scenario

2.2.4 TherAImin

Figure 1.

2.3 Audio processing scenarios

2.3.1 Boogie boogie scenario

2.3.2 DeepTuner Scenario

2.3.3 DeepAFx scenario

2.4 Music generator scenarios

2.4.1 On hold scenario

2.4.2 Singing elevator scenario

2.5 Music recommendation device scenarios

2.5.1 Emotiwatch scenario

2.5.2 iClock scenario

2.6 Feedback device scenarios

2.6.1 RumbleRumble scenario

2.6.2 MagicShoes scenario

2.6.3 Let there be light scenario

Figure 2.

2.7 Educational scenarios

2.7.1 Teach and play scenario: again

2.7.2 The magiFlute Scenario

Figure 3.

2.7.3 Magic flute scenario

2.8 Processing architecture

Figure 4.

3. Results

4. Scenario implementation

4.1 TherAImin

Figure 5.

5. Discussion

6. Conclusions

Acknowledgments

References

Continue reading from the same book

Internet of Things