Is there some *easy* way to batch convert wordpress blog entries into plain HTML that keeps the same URLs?
-
Is there some *easy* way to batch convert wordpress blog entries into plain HTML that keeps the same URLs? I don’t want to break people’s links.
I can handle plain HTML. I have BBEdit and I know how to use it mostly (my grep is pretty minimal). (My tags and categories are already messed up, so they’d need editing anyway.)
Failing that, is there some way to import them into some other site management system?
2/2
-
@gannet do any of these helps?
https://wcanvas.com/blog/6-wordpress-to-html-converter-tools/
I've personally been looking to convert to markdown. But I'm not quite there yet energy wise.
-
@gannet I do something like this occasionally for people with old/defunct WP sites where they want to keep the content alive, but don't want to have to deal with keeping WP and all its requirements.
The tool I use is HTTrack. You can point it at any live web site, and it will create a completely static mirror of that site on your hard drive. It preserves things like directory structures, so if you use "pretty permalinks" for your WP post URLs, those URLs should stay intact. Once you have the static mirror, you can throw it up on any old static file hosting, or modify the HTML files by hand if you want to.
-
@jalefkowit @gannet omg this looks so useful
-
Jason Lefkowitzreplied to Jason Lefkowitz last edited by
@gannet If you want to migrate the content into another CMS, WP provides a way to do that, kinda. There's a function in the "Tools" menu called "Export" that will dump all your site content into a file using an XML-based format WordPress calls WXR. So when you have that, the question to ask the other CMS is if it has a way to handle WXR imports.
WXR is poorly documented, which sucks. But it's at least human-readable, and if a CMS these days is going to support content import from anything, it should be from WordPress.
-
Jason Lefkowitzreplied to metasilk last edited by [email protected]
@metasilk @gannet It's great. You can take a whole WP site and turn it into a static archive you can host from an S3 bucket for pennies. Terrific for older sites that aren't getting active updates anymore.
The only caveat is that features that rely on server-side functionality (like search, for instance) won't work. But you can solve that by running a regex against the static files to remove the search box