E
ElderUberGeek
Appologies for the basic XML question...
I have some sales transaction data that is being generated from
various sources. This is aggregated in a file every day and uploaded
to a server. Then, the data is supposed to be fetched from the
individual files and dumped in to a database (in to a single table).
The data in the files is uniform for every record (i.e. date, time,
price, item, code, etc.). Nothing fancy.
I have the following questions regarding use of XML for these files:
1. Basically this data could as easily be stored in a delimited flat-
file (as it is now). So what is the advantage in using XML in this
case? (I know the advantages of XML in general, but in this case where
the data never changes, I am not sure of the advantage). Actually, the
XML markup bloats the file making transfer time more...
2. Should a DTD be used? A XML Schema? Or no need for this. I am
asking this because suppose in the future some new information is
added to the records, I think the processing script would need to know
what version of the XML file is being used and process accordingly?
3. Should every record in the data be stores as a single node with
data as attributes? (see example below). I think this is an age old
dilemma in XML but not sure of the answer...
4. Does the use of XML make the task of dumping the records in to the
database easier? (I think that either using existing classes or
available utilities there is no real effort in doing this?)
5. Should any type of XML transformation be considered? (or when...)
Sample flat file data:
TNo, Date, Time, Price, Qty
100, 010107, 1020, 3.2, 7
Should XML look like this?
<record TNo="100" Date="010107" Time="1020" Price="3.2" Qty="7"></
record>
Or like this?
<record>
<Tno>100</TNo>
<Date>010107</Date>
<Time>1020</Time>
<Price>3.2</Price>
<Qty>7</Qty>
</record>
I have some sales transaction data that is being generated from
various sources. This is aggregated in a file every day and uploaded
to a server. Then, the data is supposed to be fetched from the
individual files and dumped in to a database (in to a single table).
The data in the files is uniform for every record (i.e. date, time,
price, item, code, etc.). Nothing fancy.
I have the following questions regarding use of XML for these files:
1. Basically this data could as easily be stored in a delimited flat-
file (as it is now). So what is the advantage in using XML in this
case? (I know the advantages of XML in general, but in this case where
the data never changes, I am not sure of the advantage). Actually, the
XML markup bloats the file making transfer time more...
2. Should a DTD be used? A XML Schema? Or no need for this. I am
asking this because suppose in the future some new information is
added to the records, I think the processing script would need to know
what version of the XML file is being used and process accordingly?
3. Should every record in the data be stores as a single node with
data as attributes? (see example below). I think this is an age old
dilemma in XML but not sure of the answer...
4. Does the use of XML make the task of dumping the records in to the
database easier? (I think that either using existing classes or
available utilities there is no real effort in doing this?)
5. Should any type of XML transformation be considered? (or when...)
Sample flat file data:
TNo, Date, Time, Price, Qty
100, 010107, 1020, 3.2, 7
Should XML look like this?
<record TNo="100" Date="010107" Time="1020" Price="3.2" Qty="7"></
record>
Or like this?
<record>
<Tno>100</TNo>
<Date>010107</Date>
<Time>1020</Time>
<Price>3.2</Price>
<Qty>7</Qty>
</record>