Hello,
I am using the REST API to retrieve the content of a page on Confluence. I am trying to parse the HTML content of the page in the reply but my parser (lxml, python) keeps throwing away all confluence-specific HTML tags.
Examples of such tags are:
<ac:structured-macro ac:name=\"toc\" /> <ac:structured-macro ac:name=\"tip\"> <ac:parameter ac:name=\"title\">Don't store information on this page!</ac:parameter> <ac:rich-text-body> <p>Do not store any information on this page as it might be lost while altering the page programmatically</p> </ac:rich-text-body> </ac:structured-macro> <ac:structured-macro ac:name=\"htmlcomment\"> <ac:parameter ac:name=\"hidden\">true</ac:parameter> <ac:parameter ac:name=\"atlassian-macro-output-type\">INLINE</ac:parameter> <ac:rich-text-body> <p>This should not display on the page! If you can see this, let me know!</p> </ac:rich-text-body> </ac:structured-macro>
Is there anywhere I can get the namespace from so that I can properly parse the custom HTML representing the page content? Many thanks in advance!
Community moderators have prevented the ability to post new answers.
Hi, Try this article here you will mayby find something interresting there.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.