baynezy / html2markdown Goto Github PK
View Code? Open in Web Editor NEWA library for converting HTML to markdown syntax in C#
License: Apache License 2.0
A library for converting HTML to markdown syntax in C#
License: Apache License 2.0
Try and reduce all the extra whitespace that gets generated
The Contributing Markdown is broken.
Just link to in it in the README
https://help.github.com/articles/helping-people-contribute-to-your-project/
Things like <!DOCTYPE HTML PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
should be removed
<blockquote>
<p class="right" align="right">
<em>“Qualquer coisa que possas fazer ou sonhar, podes começá-la. A ousadia encerra em si mesma genialidade, poder e magia.<br />
Ouse fazer, e o poder lhe será dado!”</em><br />
<strong>— Johann Wolfgang von Goethe</strong>
</p>
</blockquote>
Creates
>
*“Qualquer coisa que possas fazer ou sonhar, podes começá-la. A ousadia encerra em si mesma genialidade, poder e magia.
Ouse fazer, e o poder lhe será dado!”*
**— Johann Wolfgang von Goethe**
Should be
> *“Qualquer coisa que possas fazer ou sonhar, podes começá-la. A ousadia encerra em si mesma genialidade, poder e magia.
> Ouse fazer, e o poder lhe será dado!”*
> **— Johann Wolfgang von Goethe**
Currently all <code>
blocks as a single line. Should update this to handle multi-line code blocks.
Getting this error after updating to 1.0.3
System.MissingMethodException : Method not found: 'Void LinqExtensions.EachExtension.Each(System.Collections.Generic.IEnumerable`1<!!0>, System.Action`1<!!0>)'.
Updating LinqExtensions
to 0.0.1.14396 fixes it.
<pre><code>Install-Package Html2Markdown
</code></pre>
Does not convert properly
<pre><code>var converter = new Converter();
var result = converter.Convert(html);
</code></pre>
Does not convert properly
Update README to show a breakdown of the build status to show both
Please make methods Convert and ConvertFile static. I see no point, why the Converter object has to be created.
Instead of using the GUI for configuration utilise the .yml config
In html line breaks, spaces, and tabs don't make sense and should be replaced by a single space in markdown.
Html:
<p>
text
text
text
</p>
Markdown:
text text text\r\n\r\n
thaks
Change over the codebase to convert HTML by walking the graph, using depth first search. This will underpin subsequent features.
Update the project to be a portable class library
HTML block level tags div
and table
should not get converted. Any HTML inside these tags should also not get converted to Markdown.
Html like this <a name="curio"></a>
should be removed
I have some HTML like this:
<i>Some text</br></i>
It's a strange way of doing it, but I have no control over the HTML. The problem is that this causes the converter to incorrectly render the markdown. It would be nice if elements in between the </i>
didn't break it.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.