Hey all. If you take a database backed site like Wordpress or Drupal and archive it to "flat" HTML, you take a one way trip to losing all metadata.
What I mean by that is, information about posts and pages like date created, author, tags, categories, etc.
Especially for large archives, it means you can't easily remix the site content again.
I've been using Jekyll, a static site generator, for this same purpose. Exporting to Jekyll means individual posts or pages are exported into HTML / markdown, with YAML front matter that contains this metadata.
That last bit was a bit gibberish if you haven't played with Jekyll yet. There is a block of text at the top of each text file that has author, tags, etc.
The downside to exporting to Jekyll is that it doesn't preserve the theme (because it's saving the content, not the presentation layer). And, that it's learning a little bit of Jekyll.
There is a WP plugin for Jekyll exports: https://en-ca.wordpress.org/plugins/jekyll-exporter/
Haven't tried it. Here at Reclaim, you might run a global instance of Jekyll in order to generate the flat HTML.
More complicated? Yes. I'm a big fan of GitHub Pages, where every site automatically runs Jekyll and does free hosting including domain names.
Hope that helps explain what I mean.