Openlink and SAP Informations
Sunday, November 23, 2014
Saturday, November 22, 2014
Should I use Elements or Attributes in XML?
I am learning about XML Attributes from W3Schools.
The author mentions the following (emphasis mine):
So is the view of the author a famous one, or is this the best practice in XML?
Should Attributes in XML be avoided?
W3Schools also mentioned the following (emphasis mine):
| |||
add a comment
|
Usage of attributes or elements is usually decided by the data you are trying to model.
For instance, if a certain entity is PART of the data, then it is advisable to make it an element. For example the name of the employee is an essential part of the employee data.
Now if you want to convey METADATA about data (something that provides additional information about the data) but is not really part of the data, then it is better to make it an attribute. For instance, lets say each employee has a GUID needed for back end processing, then making it an attribute is better.(GUID is not something that conveys really useful information to someone looking at the xml, but might be necessary for other purposes)
There is no rule as such that says something should be an attribute or a element.
Its not necessary to AVOID attributes at all costs..Sometimes they are easier to model, than elements. It really depends on the data you are trying to represent.
| |||
Not least important is that putting things in attributes makes for less verbose XML.
Compare
Against
Yes, that was a little biased and exaggerated, but you get the point
| |||||||||||||
|
Attributes model mapping. A set of attributes on an element isomorphizes directly onto a name/value map in which the values are text or any serializable value type. In C#, for instance, any
Dictionary<string, string> object can be represented as an XML attribute list, and vice versa.
This is emphatically not the case with elements. While you can always transform a name/value map into a set of elements, the reverse is not the case, e.g.:
If you transform this into a map, you'll lose two things: the multiple values associated with
key1, and the fact that key1 appears before key2.
The significance of this becomes a lot clearer if you look at DOM code that's used to update information in a format like this. For instance, it's trivial to write this:
That code is concise and unambiguous. Contrast it with, say:
| |||
You can't put a CDATA in an attribute. In my experience, sooner or later you are going to want to put single quotes, double quotes and/or entire XML documents into a "member", and if it's an attribute you're going to be cursing at the person who used attributes instead of elements.
Note: my experience with XML mainly involved cleaning up other peoples'. These people seemed to follow the old adage "XML is like violence. If using it hasn't solved your problem, then you haven't used enough."
| |||||
|
It all depends on what XML is used for. When it's mostly interop between software and machines - such as Web services it's easier to go all-elements if only for the sake of consistency (and also some frameworks prefer it that way, e.g. WCF). If it is targeted for human consumption - i.e. primarily created and/or read by people - then judicious use of attributes can improve readability quite a lot; XHTML is a reasonable example of that, and also XSLT and XML Schema.
| |||
I usually work on the basis that attributes are metadata - that is, data about the data. One thing I do avoid is putting lists in attributes. e.g.
Otherwise you have an extra level of parsing to extract each element. If XML provides the structure and tools for lists, then why impose another yourself.
One scenario where you may want to code in preference for attributes is for processing speed via a SAX parser. Using a SAX parser you will get an element call back containing the element name and the list of attributes. If you had used multiple elements instead then you'll get multiple callbacks (one for each element). How much of a burden/timesink this is is up for debate of course, but perhaps worth considering.
| |||||||||
|
I've used Google to search for the exact question. First I landed on this article,http://www.ibm.com/developerworks/library/x-eleatt/index.html. Though, it felt too long for a simple question as such. Anyhow, I've read through all the answers on this topic and didn't find a satisfactory summary. As such, I went back to the latter article. Here is a summary:
When do I use elements and when do I use attributes for presenting bits of information?
Principle of core content
Principle of structured information
Principle of readability
Principle of element/attribute binding
This is a short summary of the important bits from the article. If you wish to see examples and full description of every case, then refer to the original article.
| |||
The author's points are correct (except that attributes may contain a list of values). The question is whether or not you care about his points.
It's up to you.
| |||||
|
This is an example where attributes are data about data.
Databases are named by their ID attribute.
The "type" attribute of the database denotes what is expected to be found inside the database tag.
|
Subscribe to:
Comments (Atom)