Can It Be Done?

M

Mod

I would like to start a project but honestly do not think that I have
the time to create and sort out what I need. I'm hoping that there is
some program(s) out there that might be able to shorten the process
dramatically. This is what I would like to do. I would like to create
comic audio segments that are made from fragments of other audio
files, 1,513 files to be exact. To make quality audio pieces I will
need to have a list of what is said in each individual file. This
alone will take me forever but I think I can manage that without it
taking months on end. Here is where it gets really tough. I would like
to be able to have a list of every word said throughout all 1,513
files. So when I go to make a new piece and need to know what files
contain the word "we" in them, all I have to do is look at the list,
find the word, and the actual file numbers it appears in will be
listed along with that word. Is there anything out there that can look
at a long list of file names and words and organize them like this?
Perhaps a program that can look at 1,513 different text files and be
able to create an organized listing that way? Anybody know of
anything? Please I am desperate.
 
L

lilburne

Mod said:
Is there anything out there that can look
at a long list of file names and words and organize them like this?
Perhaps a program that can look at 1,513 different text files and be
able to create an organized listing that way? Anybody know of
anything? Please I am desperate.

You could do this with a combination of pipeline of shell commands:

combine all files together: cat
split each word onto a seperate line: tr
sort result: sort
only keep the unique ones: uniq
foreach line in the above result
foreach file check whether word occurs : grep -il

perl would be quicker though.
 
T

Thomas Matthews

Mod said:
I would like to start a project but honestly do not think that I have
the time to create and sort out what I need. I'm hoping that there is
some program(s) out there that might be able to shorten the process
dramatically. This is what I would like to do. I would like to create
comic audio segments that are made from fragments of other audio
files, 1,513 files to be exact. To make quality audio pieces I will
need to have a list of what is said in each individual file. This
alone will take me forever but I think I can manage that without it
taking months on end. Here is where it gets really tough. I would like
to be able to have a list of every word said throughout all 1,513
files. So when I go to make a new piece and need to know what files
contain the word "we" in them, all I have to do is look at the list,
find the word, and the actual file numbers it appears in will be
listed along with that word. Is there anything out there that can look
at a long list of file names and words and organize them like this?
Perhaps a program that can look at 1,513 different text files and be
able to create an organized listing that way? Anybody know of
anything? Please I am desperate.

If you really want to get analytical (sp?), you will need
a database. For each file you will need to have the following
information:
file name, phrase, phrase position within file, phrase duration.
You can augment these attributes with person name and other stuff.

Once you build the database, you can then search it and find the
phrases (words) that you need. Place the result records into a
a file.

Use your favorite language (scripting ones included) and parse
the result file, concatenating all the phrase data into one
file.

Then shove the file into your favorite audio application.

By the way, this has nothing to do with the C++ language and
is best discussed in Follow-ups set.

--
Thomas Matthews

C++ newsgroup welcome message:
http://www.slack.net/~shiva/welcome.txt
C++ Faq: http://www.parashift.com/c++-faq-lite
C Faq: http://www.eskimo.com/~scs/c-faq/top.html
alt.comp.lang.learn.c-c++ faq:
http://www.raos.demon.uk/acllc-c++/faq.html
Other sites:
http://www.josuttis.com -- C++ STL Library book
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,161
Messages
2,570,892
Members
47,428
Latest member
RosalieQui

Latest Threads

Top