<?xml version="1.0" encoding="UTF-8"?>
	<rss version="2.0"
		xmlns:content="http://purl.org/rss/1.0/modules/content/"
		xmlns:wfw="http://wellformedweb.org/CommentAPI/"
		xmlns:dc="http://purl.org/dc/elements/1.1/"
		xmlns:atom="http://www.w3.org/2005/Atom"

			>

	<channel>
		<title>Cycling 74  &#187;  Topic: scrambled hackz idea&#8230;</title>
		<atom:link href="http://cycling74.com/forums/topic/scrambled-hackz-idea/feed" rel="self" type="application/rss+xml" />
		<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/feed</link>
		<description></description>
		<pubDate>Wed, 19 Jun 2013 06:02:17 +0000</pubDate>
		<generator>http://bbpress.org/?v=2.2.4</generator>
		<language></language>

		
														
					
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-25209</guid>
					<title><![CDATA[scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-25209</link>
					<pubDate>Mon, 03 Apr 2006 13:27:01 +0000</pubDate>
					<dc:creator>Deklin</dc:creator>

					<description>
						<![CDATA[
						<p><a href="http://youtube.com/watch?v=eRlhKaxcKpA" rel="nofollow">http://youtube.com/watch?v=eRlhKaxcKpA</a></p>
<p>for those of you who dont know what it is, jsut have a look at the link above.<br />
I have a question that might be a bit over my head, but I am trying to create a similar patch in Max for a school project. im going to leave out the video part of it, but how would I go about &#8216;searching the database&#8217; for similar sounds? <br />
im think I have an idea of how to do the rest, but scanning the buffer in real time for a similar waveform/spectrum is the place where I am stumped. </p>
<p>any suggestions?</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73942</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73942</link>
					<pubDate>Mon, 03 Apr 2006 14:16:41 +0000</pubDate>
					<dc:creator>Jean-Francois Charles</dc:creator>

					<description>
						<![CDATA[
						<p></p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73943</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73943</link>
					<pubDate>Mon, 03 Apr 2006 14:49:19 +0000</pubDate>
					<dc:creator>Matthew Aidekman</dc:creator>

					<description>
						<![CDATA[
						<p>I&#8217;ll quickly note that I have a patch which analyzes for beats which  <br />
I gave up on because once I got down to a 20 ms window, I couldn&#8217;t  <br />
get it to pick a specific sample which was correct.  if you actually  <br />
succeed in doing this please shoot me an email.  (in fact I feel like  <br />
this video beat me to a punch)</p>
<p>(sorry to thread jack)</p>
<p>
.    .    .    .    .    .    .    .    .    .    .    .<br />
Http://www.EstateSound.com<br />
Http://ideasforstuff.blogspot.com<br />
.    .    .    .    .    .    .    .    .    .    .    .</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73944</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73944</link>
					<pubDate>Mon, 03 Apr 2006 15:00:18 +0000</pubDate>
					<dc:creator>Jean-Francois Charles</dc:creator>

					<description>
						<![CDATA[
						<p></p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73945</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73945</link>
					<pubDate>Mon, 03 Apr 2006 15:30:01 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>I have been trying to figure this out as well.<br />
See the thread Spectral Recontextualization (was very interesting <br />
video&#8230;).<br />
I have the saving of FFT data working. The real trick is doing <br />
the comparison. If I has a window size of 512, I would have 512 FFT<br />
data values to look at each time. If I could calculate some kind of<br />
distribution over those values and save the resultant value, I could<br />
then order those values and save them in a coll. This would speed<br />
up the lookup part of things. </p>
<p>So the steps would be&#8230;<br />
 1. Process source soundfile and save FFT data using sfrecord~.<br />
 2. Load source FFT data in buffer~. Use index~ to access it. <br />
    Calculate FFT distributions and build data structure for fast <br />
    access of FFT data. Store this data structure in a coll. You <br />
    would also save the index of index~. These values could then <br />
    be ordered for faster searching.<br />
 4. Analyze incoming sound, calculate FFT distribution, lookup <br />
    closest value in Col.  </p>
<p>I am trying to keep this as simple as possible, no jitter, no DB.<br />
This method would consume a lot of RAM though. Step 2 is the step <br />
that needs to be figured out. When I say distribution, I mean some<br />
equation that takes a set of values and then calculates some <br />
identifying value. I am not a mathematician, I was wondering if<br />
someone out there could offer some suggestions. Any ideas?</p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73946</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73946</link>
					<pubDate>Mon, 03 Apr 2006 15:58:52 +0000</pubDate>
					<dc:creator>Matthew Aidekman</dc:creator>

					<description>
						<![CDATA[
						<p>now I&#8217;m excited!</p>
<p>first of all, he makes it sound like he&#8217;s matching the full samples  <br />
live.  this is impossible because of the age old &#8220;you dont know the  <br />
future rule&#8221;  in other words, he cant say CHAAH and expect to get a  <br />
CHAAAH live because at the instant it selects the slice, all its  <br />
heard is CH.</p>
<p>also, I&#8217;ve thrown this structure our in an earlier thread but it  <br />
seems applicable. forgive the repetition.  you&#8217;ve rekindled my  <br />
interest in it.</p>
<p>STORE<br />
A: analyze each slice with as much as much metadata as possible&#8230;   <br />
brightness~,fiddle~, loudness~, try and find some formants, max  <br />
pitch, min pitch, etc&#8230;<br />
B: save an xml file for each transient (or in this case song with  <br />
multiple transient descriptions)<br />
C: upon load, go find all the xml files and load them into a database.</p>
<p>now rather than 512 fft bands which is unorganized data, you have 30  <br />
pieces of useful information.</p>
<p>so to search you might analyze the input and find the closest match.   <br />
ie average difference of values is closest to zero.</p>
<p>this is just what I&#8217;m working on, I don&#8217;t know if its how he&#8217;s doing it.<br />
-matt</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73947</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73947</link>
					<pubDate>Mon, 03 Apr 2006 16:06:30 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>I think it is easier to work directly with the<br />
FFT data because that is what you are trying to <br />
match. You trying to find the closest set of FFT<br />
values that matches the incoming one, regardless <br />
of loudess, briteness, or pitch.</p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73948</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73948</link>
					<pubDate>Mon, 03 Apr 2006 16:41:46 +0000</pubDate>
					<dc:creator>Matthew Aidekman</dc:creator>

					<description>
						<![CDATA[
						<p>On second watch. this looks to be more probable.  I was under the  <br />
impression that it reacted to a full syllable.  it seems it only  <br />
takes the first fft frame of the input and matches it to an ?average?  <br />
of the recorded slice.  if no inputs received it just continues  <br />
playing.  brute force.</p>
<p>sorry.  in creativity world I&#8217;m a lonely guy, I get excited when  <br />
people even approach being in phase with me.<br />
-matt</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73949</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73949</link>
					<pubDate>Mon, 03 Apr 2006 17:34:57 +0000</pubDate>
					<dc:creator>Deklin</dc:creator>

					<description>
						<![CDATA[
						<p>thanks a lot for the help so far, im still thinking in thoery right now, as I think I&#8217;ll drive myself crazy if im trial and erroring without knowing the outcome. <br />
FFT and bonk~ definitely seem to be the way forward, considering the short amount of time i have to do it. but i would appreciate the help!</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73950</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73950</link>
					<pubDate>Mon, 03 Apr 2006 18:34:55 +0000</pubDate>
					<dc:creator>Jean-Francois Charles</dc:creator>

					<description>
						<![CDATA[
						<p></p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73951</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73951</link>
					<pubDate>Mon, 03 Apr 2006 18:55:56 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>Yes I understand that some frequency bins are <br />
not as relevant. What is the point in trying to <br />
extract all this extra information when all your<br />
are trying to do is find the closest matching spectrum.<br />
The whole point is to keep things simple. By treating<br />
the FFT values as points on a line you could calculate <br />
the distibution and save that for later comparison. <br />
Looking at the Bark~ documentation shows that its <br />
output is at the control rate. I am not sure it would <br />
be able to have the time resolution needed to do spectral <br />
remapping. For this to work you need to be going at the<br />
audio rate.</p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73952</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73952</link>
					<pubDate>Mon, 03 Apr 2006 19:22:51 +0000</pubDate>
					<dc:creator>Jean-Francois Charles</dc:creator>

					<description>
						<![CDATA[
						<p></p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73953</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73953</link>
					<pubDate>Mon, 03 Apr 2006 20:32:22 +0000</pubDate>
					<dc:creator>Matthew Aidekman</dc:creator>

					<description>
						<![CDATA[
						<p>the first thing that led me to metadata (aside from the fact that I  <br />
work on it)  is the speech.  he could conjure speech pretty easily  <br />
which means that somewhere it knew that there was a focused fft bin  <br />
which implies that its not just the first fft frame. because if you  <br />
say CHAAH your analyzing CH.</p>
<p>it could just rigged like a trade show computer to perform well under  <br />
x circumstances.</p>
<p>I draw no conclusions.I&#8217;m leaning toward fft for him, but its still  <br />
metadata for me.<br />
you guys rock.<br />
-matt</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73954</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73954</link>
					<pubDate>Wed, 05 Apr 2006 19:42:56 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>I have hit a roadblock in solving this problem. So far,<br />
have a way to save all my FFT windows. What is needed <br />
now is a way to represent these FFT windows and <br />
compare them with the FFT windows of an incoming <br />
signal. My first thought would be to do a histogram<br />
on the FFT windows? I am not sure if that would be <br />
best thing to do. Does anyone have any suggestions<br />
as to what method I could use that would accurately <br />
give me a way to compare spectra?</p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73955</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73955</link>
					<pubDate>Thu, 06 Apr 2006 16:13:13 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>I have been looking at the following site&#8230;<br />
  <a href="http://www.mat.ucsb.edu/~b.sturm/sand/VLDCMCaR/VLDCMCaR.html" rel="nofollow">http://www.mat.ucsb.edu/~b.sturm/sand/VLDCMCaR/VLDCMCaR.html</a><br />
This is a pretty serious implemetation of spectral recontextualization.<br />
The author states that the following sound qualities are <br />
considered when comparing spectra:</p>
<p>Number of Zero Crossings &#8211; General noisiness, existence of transients <br />
Root Mean Square &#8211; (RMS) Mean acoustic energy (loudness) <br />
Spectral Centroid &#8211; Mean frequency of total spectral energy <br />
Spectra Dropoff &#8211; Frequency below which 85% of energy exists <br />
Harmonicity Deviation &#8211; from a harmonic spectra <br />
Pitch &#8211; Estimate of fundamental frequency </p>
<p>I doubt I could do all these in real time on my laptop. But I <br />
might be able to get good results if I choose one. Which one <br />
would be the best one to use? Does any one know of an external <br />
that will do any of these? If not could someone please point <br />
me in the direction of what algorithm to use.</p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73956</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73956</link>
					<pubDate>Thu, 06 Apr 2006 17:02:51 +0000</pubDate>
					<dc:creator>Peter McCulloch</dc:creator>

					<description>
						<![CDATA[
						<p>Actually, you probably could get most of these in realtime using the <br />
Tristan Jehan objects.  The catch with analysis is generally not doing <br />
the analysis, but having overhead to do other things at the same <br />
time&#8230;  (He says wistfully from a 3 1/2 year powerbook&#8230;)  Analyzer~ <br />
will output rather a lot of useful information.  For instance, you <br />
could use the bark output of that with the MLP object if you wanted to <br />
do neural net matching&#8230;</p>
<p>Peter McCulloch</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73957</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73957</link>
					<pubDate>Thu, 06 Apr 2006 18:14:14 +0000</pubDate>
					<dc:creator>Mark Pauley</dc:creator>

					<description>
						<![CDATA[
						<p>So it looks like the author of scrambled hackz stored each chunk in a  <br />
vector-space.  If you&#8217;ve got k bins per window and j windows per  <br />
chunk, then you&#8217;re talking k * j dimensions, which is huge.<br />
Then it sounds like he stores a reference to the sample in this k * j  <br />
space and the &#8220;closeness&#8221; is really just the scalar distance.  Scalar  <br />
distance over greater than 3-space isn&#8217;t simple pythagoras, I forget  <br />
how to do it but I think a good linear algebra textbook would have the  <br />
answer.</p>
<p>So the storage would need to be sparse in k * j space, I would  <br />
recommend a list to start.  Maybe make a buffer for the imported sound  <br />
file and address a chunk by the beginning and ending sample.  The  <br />
lookup will be faster if you restrict the database size, you can  <br />
optimize later.  So imagine you&#8217;ve got 32 chunks, for each incoming  <br />
chunk (that you&#8217;re mapping to a sample in the database), you just  <br />
iterate through the list and find the chunk that is of least scalar  <br />
distance.</p>
<p>To make this simple: imagine that you only have fft bins per fft  <br />
window, and 3 windows per chunk, and 8 chunks per bar and one bar of  <br />
input sample.  Each sample would have 3 * 3 = 9 coordinates per  <br />
chunk.  So each saved chunk would be an entry in a list that looks  <br />
like [1.1 1.2 1.3 2.1 2.2 2.3 3.1 3.2 3.3 begin_sample end_sample].   <br />
Then you need to write an abstraction that takes a 9 element list (the  <br />
incoming chunk&#8217;s coord&#8217;s) and produces a 2 element list (the outgoing  <br />
chunk&#8217;s position in the saved buffer).</p>
<p>This entire process would of course be optimized later, but you could  <br />
probably get away with only using a small number of fft bins.</p>
<p>Does this make any sense?  Or am I being stupid?</p>
<p>_Mark</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73958</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73958</link>
					<pubDate>Thu, 06 Apr 2006 18:38:13 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>The problem is I am already doing the FFT analysis and<br />
saving that data in a file. I then load that data in <br />
a buffer~, i then take the incoming signal and find the <br />
closest matching FFT window in my source buffer~ (which<br />
again is holding FFT data NOT samples). I then replace incoming FFT <br />
window with the data of the source FFT window. This has <br />
to be done at the audio rate. The problem with using Bark~ is <br />
that it is doing its own FFT, so I lose the correlation between <br />
a specific FFT window and the analysis produced by bark~. <br />
Trying to tie my own FFT data to bark~ output would be really <br />
hard. </p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73959</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73959</link>
					<pubDate>Thu, 06 Apr 2006 19:03:22 +0000</pubDate>
					<dc:creator>Peter McCulloch</dc:creator>

					<description>
						<![CDATA[
						<p>You could save the FFT data in the original soundfile on some <br />
additional channels, so that you can access either.  (if you&#8217;re doing <br />
2x overlap, you&#8217;ll need two extra channels)<br />
There&#8217;s all sorts of data striping, etc. that you could do with it, but <br />
it sounds like an SDIF file may be the direction that you&#8217;re headed in, <br />
and they are at a higher level of analysis than what you get from an <br />
FFT.  I guess it just depends on how much off-line processing you&#8217;re <br />
looking to do.  Spear will save Sdif files, IIRC, and there&#8217;s a set of <br />
them for playing back in Max.</p>
<p>You could probably deal with the bark~ latency by compensating with <br />
delay~, and use the scheduler in audio interrupt (and using fftout~ to <br />
convert your buffers back into real-world audio), though.  How are you <br />
doing the correlation?  (looking at bin delta?)  (and is this in <br />
real-time, or NRT?)</p>
<p>Peter McCulloch</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73960</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73960</link>
					<pubDate>Thu, 06 Apr 2006 19:11:46 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>Mark, </p>
<p>I think he is doing something a little more complicated <br />
when it comes to comparing the spectra, I don&#65533;t think scalar <br />
distance is a meaningful comparison. For now I am trying to <br />
keep things simple. So I have k bins per window and j windows <br />
per chunk. Lets say K is a window size of 1024, j would vary depending <br />
on <br />
the size of the original file. I am not using a DB so this could <br />
potentially<br />
use a lot of RAM. I would run my source sound file through fft~ and <br />
save the FFT data to disk. I then load this FFT file into buffer~, and <br />
use index~ to get to a specific window FFT data. Lets just say for now <br />
I am going <br />
to do a linear search (later on I know I will have to optimize this). <br />
What I am missing is the piece that takes as input an FFT window and <br />
spits out some meaningful value that I can use to compare. Like wise<br />
I would do this same process on the FFT window of the incoming signal.<br />
I then compare values and keep repeating this process until I find the <br />
closest match. I then insert the matching FFT window into the output <br />
audio stream. Perhaps there is some easy way to compare things that <br />
I am just not thinking of. But it sounds like I need something more <br />
specific.<br />
Is there any source out there that I could perhaps use to calculate <br />
Pitch, <br />
Loudness, Brightness given FFT input?</p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73961</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73961</link>
					<pubDate>Thu, 06 Apr 2006 20:47:30 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>I had not thought about a multichannel file. That might be <br />
doable although more complex than I wanted. </p>
<p>The SDIF file sounds fascinating, does Max support SDIF files?<br />
I was looking to do this in real-time. So that is why I am trying<br />
to use existing Max objects like sfrecord~, fft~, buffer~, etc&#8230;<br />
I also need to easily do an ifft~ to produce the output audio<br />
stream. So using the native Max representation of fft data is <br />
much easier. </p>
<p>Trying to sync up bark~ with delay~ sounds complicated. If I <br />
use audio interrupt scheduling, will I be assured<br />
that each output from bark~ would correspond to a fft frame?<br />
It would be nice to not have to worry about that. Again I am <br />
trying to keep things simple. But If I could not find<br />
anything else that worked, I could try it. </p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73962</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73962</link>
					<pubDate>Thu, 06 Apr 2006 23:26:38 +0000</pubDate>
					<dc:creator>Wesley Smith</dc:creator>

					<description>
						<![CDATA[
						<p>If you consider each of these data points as one value of a feature vector:</p>
<p>Number of Zero Crossings &#8211; General noisiness, existence of transients<br />
Root Mean Square &#8211; (RMS) Mean acoustic energy (loudness)<br />
Spectral Centroid &#8211; Mean frequency of total spectral energy<br />
Spectra Dropoff &#8211; Frequency below which 85% of energy exists<br />
Harmonicity Deviation &#8211; from a harmonic spectra<br />
Pitch &#8211; Estimate of fundamental frequency</p>
<p>you will have a vector of length 6.  With a bunch of vectors, you are<br />
basically populating a 6 dimensional space with data.  If you have a<br />
ton of data, this is going to be a lengthy search as that space is<br />
massive compared to how much data you have.  There are a few ways to<br />
do fast searching if you consider your data to be distributed in<br />
specific ways.  For instance, if you assume your data to be gaussian<br />
dsitributed, you could use principle component analysis to compare<br />
things in a smaller dimensional space.  Or, you could use something<br />
like an approximate nearest neighbor search:<br />
<a href="http://www.cs.umd.edu/~mount/ANN/" rel="nofollow">http://www.cs.umd.edu/~mount/ANN/</a> and find your seult very quickly. <br />
That would be a good way to query a database imho.</p>
<p>best,<br />
wes</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73963</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73963</link>
					<pubDate>Fri, 07 Apr 2006 19:15:46 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>I am not trying to do all of these I am just trying to <br />
do at least one of them. Just to get things working. But <br />
I seem to be running into problems figuring out how to <br />
do the analysis of the FFT windows. There seems to be no <br />
convenient externals that take FFT input and calculate <br />
meaningful things like pitch, loudness, brightness.</p>
<p>Anthony</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73964</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73964</link>
					<pubDate>Fri, 07 Apr 2006 19:57:31 +0000</pubDate>
					<dc:creator>Peter McCulloch</dc:creator>

					<description>
						<![CDATA[
						<p>There aren&#8217;t any that I know of.  If you store the FFT data in extra <br />
channels on the soundfile, you can just send the original sound into <br />
the objects that will calculate all of these things.  It may take a <br />
slight amount of extra fanangling, but it&#8217;s definitely the way to go.  <br />
(basically just test it out, see what the lag is, and then adjust for <br />
it with delay~)  Most of these analysis objects are quite optimized, so <br />
the performance saving isn&#8217;t necessarily that significant.</p>
<p>Peter McCulloch</p>
						]]>
					</description>

					
					
				</item>

			
				<item>
					<guid>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73965</guid>
					<title><![CDATA[Re: scrambled hackz idea&#8230;]]></title>
					<link>http://cycling74.com/forums/topic/scrambled-hackz-idea/#post-73965</link>
					<pubDate>Fri, 07 Apr 2006 20:06:21 +0000</pubDate>
					<dc:creator>Anthony Palomba</dc:creator>

					<description>
						<![CDATA[
						<p>It is certainly worth a try. What do I use to create <br />
a multichannel file?</p>
						]]>
					</description>

					
					
				</item>

					
		
	</channel>
	</rss>

