html2skim.1.html

<!DOCTYPE html>
<html lang="en">
<head>
  <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
  <meta name="language" content="EN">
  <title>R. S. Doiel, Software Engineer/Analyst - html2skim.1</title>

  <link rel="stylesheet" type="text/css"  href="/printfonts/print.css" media="print" />
  <link rel="stylesheet" type="text/css"  href="/webfonts/fonts.css" media="screen" />
  <link rel="stylesheet" type="text/css"  href="/css/site.css" media="screen" />
  <link title="RSS feed for rsdoiel's blog" rel="alternate" type="application/rss+xml" href="https://rsdoiel.github.io/rss.xml" />
  <link title="markdown source for page" rel="alternative" type="application/markdown" href="html2skim.1.md">
</head>
<body>
<nav>
<ul>
<li><a href="/">Home</a></li>
<li><a href="index.html">README</a></li>
<li><a href="user-manual.html">User Manual</a></li>
<li><a href="LICENSE">LICENSE</a></li>
<li><a href="INSTALL.html">Install</a></li>
<li><a href="search.html">Project Search</a></li>
<li><a href="ideas.html">Someday, Maybe</a></li>
<li><a href="https://github.com/rsdoiel/stngo">GitHub</a></li>
<li><a href="about.html">About</a></li>
</ul>
</nav>

<section>
<!--  <h1>html2skim.1</h1> -->

<h1 id="name">NAME</h1>
<p>html2skim</p>
<h1 id="synopsis">SYNOPSIS</h1>
<p>html2skim <a href="#options">OPTIONS</a> SKIM_DB_FILE URL
[CSS_SELECTOR]</p>
<h1 id="description">DESCRIPTION</h1>
<p>html2skim provides a way to treat a webpage containing links as a
feed storing the results in a skimmer database. A SKIM_DB_FILE will be
created if it does not already exist. Feed channel information can be
specified via html2skim options. The URL should be for an HTML page that
html2skim will scrape for links. An optional CSS_SELECTOR can be
included to filter a specific section of the HTML document for links. If
none is provided then the selector <code>a[href]</code> will be
used.</p>
<h1 id="options">OPTIONS</h1>
<dl>
<dt>-help</dt>
<dd>
display help
</dd>
<dt>-version</dt>
<dd>
display version info
</dd>
<dt>-license</dt>
<dd>
display license
</dd>
<dt>-title</dt>
<dd>
this is the channel title to use for the links scraped in the page
</dd>
<dt>-description</dt>
<dd>
this is the channel description to use for links scraped in the page
</dd>
<dt>-link</dt>
<dd>
this is the link to associated with the channel. You’d set this if you
were going to take the scaped links and turn them into RSS 2.0
documents.
</dd>
</dl>
<h1 id="example">EXAMPLE</h1>
<p>This is an example of scaping a web page identified as
“https://example.edu/” filter the links found in the “.college-news”
element.</p>
<pre><code>html2skim  \
     myfeeds.skim \
     https://example.edu/ \
     &#39;.college-news &gt; a[href]&#39;</code></pre>
</section>

<footer>
</footer>

</body>
</html>