YAML, SQL, or something else? Looking for recommendations for making a database of stories.

Bubs@lemm.ee · 9 months ago

YAML, SQL, or something else? Looking for recommendations for making a database of stories.

canpolat · 9 months ago

I would stay away from YAML (almost at all costs).

Bubs@lemm.ee · 9 months ago

What’s your reasoning for that?

At this point, I think I’ll only use yaml as the scraper output and then create a database tool to convert that into whatever data format I end up using.

Kissaki · 9 months ago

https://ruudvanasseldonk.com/2023/01/11/the-yaml-document-from-hell

JSON is a much simpler (and consequently safer) format. It’s also more universally supported.

YAML (or TOML) is decent for a manually read and written configuration. But for a scraper output for storage and follow-up workflows being through code parsing anyway, I would go for JSON.

Bubs@lemm.ee · 9 months ago

That’s an interesting read. I’ll definitely give json a try too.

logging_strict · 9 months ago

Very wise idea. And if you want to up your game, can validate the yaml against a schema.

Check out strictyaml

The author is ahead of his time. Uses validated yaml to build stories and weave those into web sites.

Unfortunately the author also does the same with strictyaml tests. Can get frustrating cause the tests are too simple.

Bubs@lemm.ee · 9 months ago

Gonna be honest, I’ll need to research a bit more what validating against a schema is, but I get the general idea, and I like it.

For initial testing and prototypes, I probably won’t worry about validation, but once I get to the point of refining the system, validation like that would be a good idea.

logging_strict · 9 months ago

Curious to hear your reasoning as to why yaml is less desirable? Would think the opposite.

Surprised me with your strong opinion.

Maybe if you would allow, and have a few shot glasses handy, could take a stab at changing your mind.

But first list all your reservations concerning yaml

Relevent packages I wrote that rely on yaml

pytest-logging-strict
sphinx-external-toc-strict