this post was submitted on 08 Sep 2024
1221 points (98.3% liked)

Programmer Humor

32874 readers
599 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS
 
(page 2) 50 comments
sorted by: hot top controversial new old
[–] [email protected] 32 points 4 months ago
[–] [email protected] 178 points 4 months ago (8 children)

Some data formats are easy for humans to read but difficult for computers to efficiently parse. Others, like packed binary data, are dead simple for computers to parse but borderline impossible for a human to read.

XML bucks this trend and bravely proves that data formats do not have to be one or the other by somehow managing to be bad at both.

[–] [email protected] 27 points 4 months ago (7 children)

Strong competition from yaml and json on this point however

[–] [email protected] 33 points 4 months ago (11 children)

Alright, the YAML spec is a dang mess, that I'll grant you, but it seems pretty easy for my human eyes to read and write. As for JSON -- seriously? That's probably the easiest to parse human-readable structured data format there is!

[–] [email protected] 9 points 4 months ago (3 children)

it is anything but easy to read if your entire file does not fit on a single screen.

load more comments (3 replies)
[–] [email protected] 3 points 4 months ago

We're we are going we don't need any comments.

[–] [email protected] 2 points 4 months ago (2 children)

I don't know much apart from the basics of YAML, what makes it complicated for computers to parse?

[–] [email protected] 14 points 4 months ago* (last edited 4 months ago) (1 children)

the spec is 10 chapters. everything is unquoted by default, so parsers must be able to guess the data type of every value, and will silently convert them if they are, but leave them alone otherwise. there are 63 possible combinations of string type. "no" and "on" are both valid booleans. it supports sexagesimal numbers for some reason, using the colon as a separator just like for objects. other things of this nature.

[–] [email protected] 1 points 4 months ago

Yes, the classic "no" problem of YAML. But the addition of the comments is very nice.

[–] [email protected] 9 points 4 months ago

Sometimes it's a space, sometimes its a tab, and sometimes it's two spaces which might also be a tab but sometimes it's 4 spaces which means 2 spaces are just whack And sometimes we want two and four spaces because people can't agree.

But do we want quotes or is it actually a variable? Equals or colon? Porque no los dos?

load more comments (8 replies)
load more comments (6 replies)
[–] [email protected] 52 points 4 months ago (1 children)

The thing is, it was never really intended as a storage format for plain data. It's a markup language, so you're supposed to use it for describing complex documents, like it's used in HTML for example. It was just readily available as a library in many programming languages when not much else was, so it got abused for data storage a lot.

load more comments (1 replies)
load more comments (6 replies)
[–] [email protected] 70 points 4 months ago (2 children)
[–] [email protected] 2 points 4 months ago

XML is fine. Namespaces have a special place in hell though

[–] [email protected] 15 points 4 months ago

Except for obvious typos

[–] [email protected] 5 points 4 months ago

XML has its strengths as a markdown format. My own formatted text format ETML is based on XML, as I could recycle old HTML conventions (still has stylesheet as an option), and I can store multiple text blocks in an XML file. It's not something my main choice of human readable format SDL excels at, which itself has its own issues (I'm writing my own extensions/refinements for it by the name XDL, with hexadecimal numbers, ISO dates, etc.).

load more comments
view more: ‹ prev next ›