Sunday, May 2, 2010

On Music and Language - Part 1

Most if not all of us are familiar with the oft-romanticized idiom "music is a universal language". Far be it from me to indulge in hyperbole, but for us Computational Creativitist [you heard it here first, folks], the hypothesis may contain some validity...some substance. Fully validating or refuting the hypothesis is far beyond the scope of any blog post. But I intend on this article being the first in a series, each exploring a different dimension of this deceptively simple idiom.

Before we tackle the formidable "universal" qualifier, let us try and conceptualize what it means for something to be a language. How does one define a Language? The linguists among us would have undoubtedly belted out that Language can be defined at varying levels of granularity. Perfect! Let us explore this very granularity. It should follow intuitively that each level of granularity might reveal a proportionally granular analogy for music.

So...as all scientists must, lets begin with an absolute minimal construct for a Language. Lets for the moment, describe a Language as:

A system of symbols that encode/decode information. [1]

Thus, a Language is a Collection of Symbols which when chained together temporally, gains a sort of semantic "meaning" - the meaning, of course, being the Information that we wish to decode/encode. Furthermore, the Encoding Process can be abstracted to the action of translating “mental” information into a chain of symbols, and consequently the Decoding Process translates the chain of symbols into the original information. Do note that we are not restricting our definition of Language to being either written or spoken. The definition is extensible to both.

For the moment let us also assume that we expect translation to be a perfectly symmetric function i.e. no ambiguity in a particular concept being encoded to some chain of symbols, and then decoded back to a different concept.

Is this a reasonable enough definition of Language? For the moment, I will leave the educated reader to make his/her assessment. Lets move on.

Could this definition of Language possibly offer us an analogy for music? After all, it is so simple...so primitive in its articulation. Lets give it a shot:

  • Collection of Symbols -> All notes in the chromatic scale [note that I am making no attempt to restrict this to the Western Tempered 12-note scale. There isn't any need to]

  • Information -> People, Places, Events, Cultural and Social Archetypes, and perhaps the entire human Emotional Spectrum. Its a free-for-all.

  • Encoding Process -> Converting “mental” information into a sequence of notes

  • Decoding Process -> Converting the sequence of notes back into the original information.

As we did for our definition of language, we are not restricting this definition to written [staff notation] music or performed [audible] music. It should be evident that the definition is extensible to both for music as well.

Is this a reasonable enough definition of music? If not, where are the points of contention? If I were a betting man, I would bet that most of us had a uneasy reaction to that definition at the very least. The uneasiness is starring us all right in the face! The answer is in the encoding/decoding process. Can ANY concept be encoded into a string of notes, transcribed and then played/read by another to decode the original concept? The answer is a resounding HELL NO. Quite obviously, we have violated the symmetry assumption we made with our definition of language. So:

Knee-jerk reaction: This level of granularity in defining a language does not provide a reasonably analogous definition of music

And such a reaction would be completely valid. But I believe we have overlooked a rather crucial detail, one that I have purposely omitted. Is our definition of language complete? After all, the definition appeared on Wikipedia. The ardent linguists will have undoubtedly beaten me to the punch, but for the rest of us mere mortals, note that our definition of language said NOTHING about syntax, formal grammar, structure or anything of the like. It is because of our innate predisposition toward language that we assumed its existence!

Again, as scientists, unless explicitly specified, we cannot assume the existence of structure. Now, given our recent revelation can we encode and decode information effectively in language? Consider the following chain of letters from the alphabet: Addaminustoall.

What can one decode from this sentence?

  • "Add a minus to all" - Perhaps an instruction for adding a minus sign to all numbers in a mathematical expression

  • "Adda minus Toall" - Perhaps Adda and Toall are a couple, and only Adda is expected to attend a party?

  • ...

My profuse apologies for the completely contrived [and moronic] example. But I think it illustrates my point. And as such we come to:

Embarrassing conclusion: Without structure, any string of symbols is ultimately doomed to ambiguous interpretation. As such, the above definition of language suffers the same fate as its analogy for music. Stated another way, both language and music require additional structure to encode information effectively - structure that our above definition doesn't explicitly provide.

Adding the necessary structure to our definition of language follows intuitively. But that's a topic I'll tackle in the future. For now lets go back to our reactions to defining Language and music as we did above. Did you have an adverse reaction to Language being defined in the above construct before I pointed it out? If so, how would you compare that, in degree and kind, to the adverse reaction you had to the analogous construct for music? Again, tantalizing my gambling-addiction tendencies, I would say that a fair majority would have a stronger reaction to music's definition.

Here's the kicker. We are all, in a loose sense, experts at language: we can communicate our thoughts with a high level of proficiency and application. Comparatively speaking, most of us novices at best at music; even those of us with extensive musical training. But its definition above set our spidey senses tingling...

As I said, there are some rather large fundamental concepts on both Language and music at play here, and that fully addressing them is outside the scope of this article. The intention of the article is to make the reader question his/her own assumptions and predisposition about music and language. But I will leave you with a teaser. One of the greatest linguists of our time, Noam Chomsky, popularised a concept called Universal Grammar[2]. This draws on Nativist Philosophical thought that maintains that principles of grammar, shared by all languages, are innate to human beings. That is to say, that the skills necessary to learn and understand language are a function of Nature rather than Nurture.

At this time, I will defer commenting on the Universal Grammar concept itself. I will however pose the question:

Given the symmetry we saw in the construct for Language and the analogous construct for music (i.e. both succeeded and failed for the same parameters) and our knee-jerk reaction to the analogous definition, does the concept of a Universal Grammar extend to music?

This is a question we will keep coming back to in these articles. And instead of making another...knee-jerk reaction, let us arrive at the conclusion by exploring the connection between Language and music more profoundly. It should be quite obvious that exploring Chomsky's Universal Grammar's applicability to music is intrinsically tied to exploring the validity of our initial hypothesis – is music a universal language?

Until next time...

----

Bibliography

  • http://en.wikipedia.org/wiki/Language [1]

  • http://en.wikipedia.org/wiki/Universal_grammar

Tuesday, April 20, 2010

A quick introduction to Modelling

I am in the midst of writing quite a few articles on the subject of modelling the computational aspects of creativity – specifically in music. And, I'm trying to decide on the best way to present the material such that my audience sees the themes I'm alluding to. However, I just realized that not all of you out there may have a quantitative background. This may (or perhaps not) have been implicitly stated, but I'd like my articles to be accessible to everybody, not just my peers. So here is a VERY brief introduction to modelling.

Before we begin, we must familiarize ourselves with the vocabulary “necessary”[note the quotation marks] for modelling. I have always believed that there is a fine line between vocabulary and jargon...and that line is necessity. One needs a vocabulary to describe the world...and jargon to appear poetic (read self-indulgent) about his description. Far too many people indulge in jargon while explaining; we will NOT.

Here is our vocabulary for modelling:
  • Object: An Object is any observable item/thing/abstract entity in our world. For example, an alarm clock, a car, a batter in a baseball game are all objects.

  • Event: An Event is an observation about how an Object works in our world. For example an alarm clock ringing, a car stopping at a traffic light, a batter hitting a home run are all examples of events. In essence, anything that falls within a cause-and-effect framework is an Event. An Event is often comprised of smaller Events on smaller Objects. The complexity of the Event is directly proportional to the number of sub-events chosen to model that event. Need an Event be as complex as possible? Absolutely not. The situation should determine how fine-grained our event should be and for the most part, the simpler the better.

  • State: The condition or value of an Object during the timespan of the Event. For example, the timer in the alarm goes to 7:00am and this fires an internal signal that switched the buzzer from off to on. In this case, the global Object – alarm clock – has the state 'On', and the sub-objects – the timer and the buzzer – have the states '7:00am' and 'On' respectively.
Now that we have the vocabulary, lets define a model. Simply put, a model is a mathematical, logical, algorithmic and/or graphical approximation of an Event; it captures all the different objects and how they interact to completely describe the Event.

Lets jump right in and create a [trivial] model. In this case, our Event is “Married Couple's date night”. This Event is easy enough to describe in plain English. If the husband, the wife and the babysitter are all free, the couple can have their date night while the babysitter takes care of their kids. If the husband, wife or babysitter are not free then tough noogies.

And [drum roll]....here's the model:












[Click graph to magnify]

Recall that we said a model can be a graphical approximation of an Event. All we have done here is created a pictorial representation of what we have described in plain English above. In scientific terms, this is called a Graph. The ardent reader is referred to Graph and Graph Theory for more information on the subject :

In this model, the global Object is an abstract entity - “date night” but is clearly observable and hence qualifies to be modelled as an Event. The States of the global Object are 'Bummer' and 'Oh Yeah'. The sub-objects are the husband's schedule, the wife's schedule and the babysitters schedule, all of which are abstract objects. Each of the objects has 2 states associated with them – 'Yes' and 'No'. From the diagram we observe that for the state 'Oh Yeah!' to be activated, all three sub-objects must be in the 'Yes' state. Otherwise, the 'Bummer' event is activated.

Note that for every graph, there is a corresponding mathematical/algorithmic model, and that is simply stating the graph as a function:


result = 'Oh Yeah!' iff (isHusbandFree='Yes' AND isWifeFree='Yes')
AND isBabySitterFree='Yes'

= 'Bummer' otherwise

Again, the ardent reader is referred to Boolean Algebra for more information.

Now lets imagine a more complex event – identifying a song, based on the first 5 seconds of a music clip. How would one graphically [or mathematically or algorithmically] model such an Event? How simple or complex does the model have to be to reasonably capture the phenomenon?

These questions form the foundation of Information Processing in Music, and we will explore them in subsequent articles. Stay tuned.

Resurgence of faith

Ever so often, we are fortunate enough to experience something truly life-changing; the kind of change that epiphanies are made of. This past week, I had the absolute honor of attending Victor Wooten's Bass [Guitar] Camp. It goes without saying that the amount of theory and technique all of us picked up quickly propelled each of us exponentially in our skill and understanding of the instrument. However what I think was infinitely more valuable was observing the dignity, integrity and humility with which the Wooten family welcomed us into their home. I had said I wouldn't make a habit of writing mushy nothings and I intend to keep that promise. So instead of a philosophical writ, I'll leave you with one of many interactions with Victor:

[I walk up to Victor with my bass and sharpie in hand]
Adi: Hi Victor! Could you possibly sign my bass for me?
Victor: Sure! No problem.
Adi: Thank you so much!
Victor: No, thank you for asking. Its my honor
[All without the vaguest hint of insincerity. Believe me, I would have known]

And most importantly, I'd like to leave you with a quote Victor shared with us at the very end of camp. I can't think of a more appropriate substantiation for my dedication to this cause - my Blog.

"[All] my discovers are the result of musical perceptions" - Albert Einstein.

Indulging in a bit of colloquialism: "`Nuf said..."

---

[P.S. I am working on a digital diary of sorts of my experiences at camp. I'm not sure whether I should post it on this blog since it is quite obviously thematically incoherent. However if enough of my readers are interested, I'll make an exception]

Monday, April 12, 2010

Paying homage to a Champion...

I'd like to begin this blog with a brief anecdote about one of Machine Learning's stalwarts. Last week, I had the utmost pleasure of meeting one of my heroes, Dr. Tom Mitchell. For those of you not familiar with that name, he is a well-respected authority on Machine Learning and for quite some time, his book was the premier text book for Machine Learning courses across the country:

Link to book on Amazon

[Exeunt: It was this book, that sparked my passion for the field, and I haven't looked back ever since. For those of you even remotely interested in the field, you would be hard-pressed to find a book with as much clarity, detail and ease-of-comprehension factor. For what its worth, I highly recommend this book.]

Unbeknownst to my hero, I was more than a little star-struck as I heard him give a talk on the potential of leveraging Natural Language Processing (NLP) on the web and learning Concepts. I will try and address the talk itself and its relevance to my research in latter articles, but for the eager, Tom has quite a few of his publications on his website and they're easy to read and comprehend:

Articles on his CMU website

After the talk I had a chance to engage him in conversation about the potential for techniques in NLP being leveraged in Music Analysis [article on it coming shortly]. Tom actually stayed 20 minutes after the talk to express his ideas on the topic. Do note that my query wasn't directly related to his talk or anything he was working on...and that, comparatively speaking, he was addressing a complete ignoramus. I was completely taken aback by the level of genuine investment in research – be it his own or someone else's - that Tom has. Everything he said was founded firmly in his own research and knowledge, and he produced deep and meaningful analogies on the subject – far from the superficial hand-wave of a response I usually get from other researchers in Machine Learning. Even more compelling than his intellect and commitment to research was his humility in addressing all of those who spoke to him, including myself. He entertained all of our ideas, which I can only imagine were bordering on absurd, with such genuine enthusiasm that those in attendance must have been forced to acknowledge an intellectual and emotional mettle that transcends that of us mere mortals.

As such, I want to start this blog by offering a very humble tip of the hat to one of the Champions of research. Its people like Dr. Tom Mitchell that reinforce my belief in self-actualization being achieved through the pursuit of research...which I imagine isn't terribly different than pursuing Art. Here's to you Tom...