diff --git a/DESCRIPTION b/DESCRIPTION
index 9ef8c69..3ab20ae 100644
--- a/DESCRIPTION
+++ b/DESCRIPTION
@@ -1,5 +1,5 @@
 Package: quanteda.textstats
-Version: 0.96.6
+Version: 0.97
 Title: Textual Statistics for the Quantitative Analysis of Textual Data
 Description: Textual statistics functions formerly in the 'quanteda' package.
     Textual statistics for characterizing and comparing textual data. Includes 
diff --git a/README.md b/README.md
index 7cf2658..e812b3c 100644
--- a/README.md
+++ b/README.md
@@ -5,7 +5,7 @@
 
 [![CRAN
 Version](https://www.r-pkg.org/badges/version/quanteda.textstats)](https://CRAN.R-project.org/package=quanteda.textstats)
-[![](https://img.shields.io/badge/devel%20version-0.96.5-royalblue.svg)](https://github.com/quanteda/quanteda.textstats)
+[![](https://img.shields.io/badge/devel%20version-0.97-royalblue.svg)](https://github.com/quanteda/quanteda.textstats)
 [![Downloads](https://cranlogs.r-pkg.org/badges/quanteda.textstats)](https://CRAN.R-project.org/package=quanteda.textstats)
 [![Total
 Downloads](https://cranlogs.r-pkg.org/badges/grand-total/quanteda.textstats?color=orange)](https://CRAN.R-project.org/package=quanteda.textstats)
diff --git a/cran-comments.md b/cran-comments.md
index da6bdd8..96a515e 100644
--- a/cran-comments.md
+++ b/cran-comments.md
@@ -2,11 +2,12 @@
 
 Purpose:
 
-* To update the C++ code to better call the tbb library for parallel computing.
+* To fix changes related to the quanteda v4.0 release and its move to relying on a version of TBB that is different from that provided in RcppParallel.
 
 ## Test environments
 
-* local macOS 13.6, R 4.3.1
+* local macOS 14.4.1, R 4.3.3
+* macOS release via devtools::check_mac_release()
 * Windows release via devtools::check_win_release()
 * Windows devel via devtools::check_win_devel()
 * Windows old-release via devtools::check_win_oldrelease()
diff --git a/docs/404.html b/docs/404.html
new file mode 100644
index 0000000..8355fd5
--- /dev/null
+++ b/docs/404.html
@@ -0,0 +1,103 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en">
+<head>
+<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+<meta charset="utf-8">
+<meta http-equiv="X-UA-Compatible" content="IE=edge">
+<meta name="viewport" content="width=device-width, initial-scale=1.0">
+<title>Page not found (404) • quanteda.textstats</title>
+<!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous">
+<script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="bootstrap-toc.css">
+<script src="bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous">
+<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous">
+<!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="pkgdown.css" rel="stylesheet">
+<script src="pkgdown.js"></script><meta property="og:title" content="Page not found (404)">
+<!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]-->
+</head>
+<body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-title-body">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav">
+<li>
+  <a href="reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="news/index.html">Changelog</a>
+</li>
+      </ul>
+<ul class="nav navbar-nav navbar-right">
+<li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul>
+</div>
+<!--/.nav-collapse -->
+  </div>
+<!--/.container -->
+</div>
+<!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="contents col-md-9">
+    <div class="page-header">
+      <h1>Page not found (404)</h1>
+    </div>
+
+Content not found. Please use links in the navbar.
+
+  </div>
+
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav>
+</div>
+
+</div>
+
+
+
+      <footer><div class="copyright">
+  <p></p>
+<p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p>
+<p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer>
+</div>
+
+  
+
+
+  
+
+  </body>
+</html>
diff --git a/docs/CONDUCT.html b/docs/CONDUCT.html
new file mode 100644
index 0000000..def4aba
--- /dev/null
+++ b/docs/CONDUCT.html
@@ -0,0 +1,85 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Contributor Code of Conduct • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="bootstrap-toc.css"><script src="bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="pkgdown.css" rel="stylesheet"><script src="pkgdown.js"></script><meta property="og:title" content="Contributor Code of Conduct"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-title-body">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="contents col-md-9">
+    <div class="page-header">
+      <h1>Contributor Code of Conduct</h1>
+    </div>
+
+<div id="contributor-code-of-conduct" class="section level1">
+
+<p>As contributors and maintainers of this project, we pledge to respect all people who contribute through reporting issues, posting feature requests, updating documentation, submitting pull requests or patches, and other activities.</p>
+<p>We are committed to making participation in this project a harassment-free experience for everyone, regardless of level of experience, gender, gender identity and expression, sexual orientation, disability, personal appearance, body size, race, ethnicity, age, religion, or choice of text editor.</p>
+<p>Examples of unacceptable behavior by participants include the use of sexual language or imagery, derogatory comments or personal attacks, trolling, public or private harassment, insults, or other unprofessional conduct.</p>
+<p>Project maintainers have the right and responsibility to remove, edit, or reject comments, commits, code, wiki edits, issues, and other contributions that are not aligned to this Code of Conduct. Project maintainers who do not follow the Code of Conduct may be removed from the project team.</p>
+<p>Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by opening an issue or contacting one or more of the project maintainers.</p>
+<p>This Code of Conduct is adapted from the Contributor Covenant (<a href="http:contributor-covenant.org" class="uri">http:contributor-covenant.org</a>), version 1.0.0, available at <a href="http://contributor-covenant.org/version/1/0/0/" class="external-link uri">http://contributor-covenant.org/version/1/0/0/</a></p>
+</div>
+
+  </div>
+
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+
+</div>
+
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/LICENSE-text.html b/docs/LICENSE-text.html
new file mode 100644
index 0000000..0af7677
--- /dev/null
+++ b/docs/LICENSE-text.html
@@ -0,0 +1,751 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>License • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="bootstrap-toc.css"><script src="bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="pkgdown.css" rel="stylesheet"><script src="pkgdown.js"></script><meta property="og:title" content="License"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-title-body">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="contents col-md-9">
+    <div class="page-header">
+      <h1>License</h1>
+    </div>
+
+<pre>                    GNU GENERAL PUBLIC LICENSE
+                       Version 3, 29 June 2007
+
+ Copyright (C) 2007 Free Software Foundation, Inc. &lt;https://fsf.org/&gt;
+ Everyone is permitted to copy and distribute verbatim copies
+ of this license document, but changing it is not allowed.
+
+                            Preamble
+
+  The GNU General Public License is a free, copyleft license for
+software and other kinds of works.
+
+  The licenses for most software and other practical works are designed
+to take away your freedom to share and change the works.  By contrast,
+the GNU General Public License is intended to guarantee your freedom to
+share and change all versions of a program--to make sure it remains free
+software for all its users.  We, the Free Software Foundation, use the
+GNU General Public License for most of our software; it applies also to
+any other work released this way by its authors.  You can apply it to
+your programs, too.
+
+  When we speak of free software, we are referring to freedom, not
+price.  Our General Public Licenses are designed to make sure that you
+have the freedom to distribute copies of free software (and charge for
+them if you wish), that you receive source code or can get it if you
+want it, that you can change the software or use pieces of it in new
+free programs, and that you know you can do these things.
+
+  To protect your rights, we need to prevent others from denying you
+these rights or asking you to surrender the rights.  Therefore, you have
+certain responsibilities if you distribute copies of the software, or if
+you modify it: responsibilities to respect the freedom of others.
+
+  For example, if you distribute copies of such a program, whether
+gratis or for a fee, you must pass on to the recipients the same
+freedoms that you received.  You must make sure that they, too, receive
+or can get the source code.  And you must show them these terms so they
+know their rights.
+
+  Developers that use the GNU GPL protect your rights with two steps:
+(1) assert copyright on the software, and (2) offer you this License
+giving you legal permission to copy, distribute and/or modify it.
+
+  For the developers' and authors' protection, the GPL clearly explains
+that there is no warranty for this free software.  For both users' and
+authors' sake, the GPL requires that modified versions be marked as
+changed, so that their problems will not be attributed erroneously to
+authors of previous versions.
+
+  Some devices are designed to deny users access to install or run
+modified versions of the software inside them, although the manufacturer
+can do so.  This is fundamentally incompatible with the aim of
+protecting users' freedom to change the software.  The systematic
+pattern of such abuse occurs in the area of products for individuals to
+use, which is precisely where it is most unacceptable.  Therefore, we
+have designed this version of the GPL to prohibit the practice for those
+products.  If such problems arise substantially in other domains, we
+stand ready to extend this provision to those domains in future versions
+of the GPL, as needed to protect the freedom of users.
+
+  Finally, every program is threatened constantly by software patents.
+States should not allow patents to restrict development and use of
+software on general-purpose computers, but in those that do, we wish to
+avoid the special danger that patents applied to a free program could
+make it effectively proprietary.  To prevent this, the GPL assures that
+patents cannot be used to render the program non-free.
+
+  The precise terms and conditions for copying, distribution and
+modification follow.
+
+                       TERMS AND CONDITIONS
+
+  0. Definitions.
+
+  "This License" refers to version 3 of the GNU General Public License.
+
+  "Copyright" also means copyright-like laws that apply to other kinds of
+works, such as semiconductor masks.
+
+  "The Program" refers to any copyrightable work licensed under this
+License.  Each licensee is addressed as "you".  "Licensees" and
+"recipients" may be individuals or organizations.
+
+  To "modify" a work means to copy from or adapt all or part of the work
+in a fashion requiring copyright permission, other than the making of an
+exact copy.  The resulting work is called a "modified version" of the
+earlier work or a work "based on" the earlier work.
+
+  A "covered work" means either the unmodified Program or a work based
+on the Program.
+
+  To "propagate" a work means to do anything with it that, without
+permission, would make you directly or secondarily liable for
+infringement under applicable copyright law, except executing it on a
+computer or modifying a private copy.  Propagation includes copying,
+distribution (with or without modification), making available to the
+public, and in some countries other activities as well.
+
+  To "convey" a work means any kind of propagation that enables other
+parties to make or receive copies.  Mere interaction with a user through
+a computer network, with no transfer of a copy, is not conveying.
+
+  An interactive user interface displays "Appropriate Legal Notices"
+to the extent that it includes a convenient and prominently visible
+feature that (1) displays an appropriate copyright notice, and (2)
+tells the user that there is no warranty for the work (except to the
+extent that warranties are provided), that licensees may convey the
+work under this License, and how to view a copy of this License.  If
+the interface presents a list of user commands or options, such as a
+menu, a prominent item in the list meets this criterion.
+
+  1. Source Code.
+
+  The "source code" for a work means the preferred form of the work
+for making modifications to it.  "Object code" means any non-source
+form of a work.
+
+  A "Standard Interface" means an interface that either is an official
+standard defined by a recognized standards body, or, in the case of
+interfaces specified for a particular programming language, one that
+is widely used among developers working in that language.
+
+  The "System Libraries" of an executable work include anything, other
+than the work as a whole, that (a) is included in the normal form of
+packaging a Major Component, but which is not part of that Major
+Component, and (b) serves only to enable use of the work with that
+Major Component, or to implement a Standard Interface for which an
+implementation is available to the public in source code form.  A
+"Major Component", in this context, means a major essential component
+(kernel, window system, and so on) of the specific operating system
+(if any) on which the executable work runs, or a compiler used to
+produce the work, or an object code interpreter used to run it.
+
+  The "Corresponding Source" for a work in object code form means all
+the source code needed to generate, install, and (for an executable
+work) run the object code and to modify the work, including scripts to
+control those activities.  However, it does not include the work's
+System Libraries, or general-purpose tools or generally available free
+programs which are used unmodified in performing those activities but
+which are not part of the work.  For example, Corresponding Source
+includes interface definition files associated with source files for
+the work, and the source code for shared libraries and dynamically
+linked subprograms that the work is specifically designed to require,
+such as by intimate data communication or control flow between those
+subprograms and other parts of the work.
+
+  The Corresponding Source need not include anything that users
+can regenerate automatically from other parts of the Corresponding
+Source.
+
+  The Corresponding Source for a work in source code form is that
+same work.
+
+  2. Basic Permissions.
+
+  All rights granted under this License are granted for the term of
+copyright on the Program, and are irrevocable provided the stated
+conditions are met.  This License explicitly affirms your unlimited
+permission to run the unmodified Program.  The output from running a
+covered work is covered by this License only if the output, given its
+content, constitutes a covered work.  This License acknowledges your
+rights of fair use or other equivalent, as provided by copyright law.
+
+  You may make, run and propagate covered works that you do not
+convey, without conditions so long as your license otherwise remains
+in force.  You may convey covered works to others for the sole purpose
+of having them make modifications exclusively for you, or provide you
+with facilities for running those works, provided that you comply with
+the terms of this License in conveying all material for which you do
+not control copyright.  Those thus making or running the covered works
+for you must do so exclusively on your behalf, under your direction
+and control, on terms that prohibit them from making any copies of
+your copyrighted material outside their relationship with you.
+
+  Conveying under any other circumstances is permitted solely under
+the conditions stated below.  Sublicensing is not allowed; section 10
+makes it unnecessary.
+
+  3. Protecting Users' Legal Rights From Anti-Circumvention Law.
+
+  No covered work shall be deemed part of an effective technological
+measure under any applicable law fulfilling obligations under article
+11 of the WIPO copyright treaty adopted on 20 December 1996, or
+similar laws prohibiting or restricting circumvention of such
+measures.
+
+  When you convey a covered work, you waive any legal power to forbid
+circumvention of technological measures to the extent such circumvention
+is effected by exercising rights under this License with respect to
+the covered work, and you disclaim any intention to limit operation or
+modification of the work as a means of enforcing, against the work's
+users, your or third parties' legal rights to forbid circumvention of
+technological measures.
+
+  4. Conveying Verbatim Copies.
+
+  You may convey verbatim copies of the Program's source code as you
+receive it, in any medium, provided that you conspicuously and
+appropriately publish on each copy an appropriate copyright notice;
+keep intact all notices stating that this License and any
+non-permissive terms added in accord with section 7 apply to the code;
+keep intact all notices of the absence of any warranty; and give all
+recipients a copy of this License along with the Program.
+
+  You may charge any price or no price for each copy that you convey,
+and you may offer support or warranty protection for a fee.
+
+  5. Conveying Modified Source Versions.
+
+  You may convey a work based on the Program, or the modifications to
+produce it from the Program, in the form of source code under the
+terms of section 4, provided that you also meet all of these conditions:
+
+    a) The work must carry prominent notices stating that you modified
+    it, and giving a relevant date.
+
+    b) The work must carry prominent notices stating that it is
+    released under this License and any conditions added under section
+    7.  This requirement modifies the requirement in section 4 to
+    "keep intact all notices".
+
+    c) You must license the entire work, as a whole, under this
+    License to anyone who comes into possession of a copy.  This
+    License will therefore apply, along with any applicable section 7
+    additional terms, to the whole of the work, and all its parts,
+    regardless of how they are packaged.  This License gives no
+    permission to license the work in any other way, but it does not
+    invalidate such permission if you have separately received it.
+
+    d) If the work has interactive user interfaces, each must display
+    Appropriate Legal Notices; however, if the Program has interactive
+    interfaces that do not display Appropriate Legal Notices, your
+    work need not make them do so.
+
+  A compilation of a covered work with other separate and independent
+works, which are not by their nature extensions of the covered work,
+and which are not combined with it such as to form a larger program,
+in or on a volume of a storage or distribution medium, is called an
+"aggregate" if the compilation and its resulting copyright are not
+used to limit the access or legal rights of the compilation's users
+beyond what the individual works permit.  Inclusion of a covered work
+in an aggregate does not cause this License to apply to the other
+parts of the aggregate.
+
+  6. Conveying Non-Source Forms.
+
+  You may convey a covered work in object code form under the terms
+of sections 4 and 5, provided that you also convey the
+machine-readable Corresponding Source under the terms of this License,
+in one of these ways:
+
+    a) Convey the object code in, or embodied in, a physical product
+    (including a physical distribution medium), accompanied by the
+    Corresponding Source fixed on a durable physical medium
+    customarily used for software interchange.
+
+    b) Convey the object code in, or embodied in, a physical product
+    (including a physical distribution medium), accompanied by a
+    written offer, valid for at least three years and valid for as
+    long as you offer spare parts or customer support for that product
+    model, to give anyone who possesses the object code either (1) a
+    copy of the Corresponding Source for all the software in the
+    product that is covered by this License, on a durable physical
+    medium customarily used for software interchange, for a price no
+    more than your reasonable cost of physically performing this
+    conveying of source, or (2) access to copy the
+    Corresponding Source from a network server at no charge.
+
+    c) Convey individual copies of the object code with a copy of the
+    written offer to provide the Corresponding Source.  This
+    alternative is allowed only occasionally and noncommercially, and
+    only if you received the object code with such an offer, in accord
+    with subsection 6b.
+
+    d) Convey the object code by offering access from a designated
+    place (gratis or for a charge), and offer equivalent access to the
+    Corresponding Source in the same way through the same place at no
+    further charge.  You need not require recipients to copy the
+    Corresponding Source along with the object code.  If the place to
+    copy the object code is a network server, the Corresponding Source
+    may be on a different server (operated by you or a third party)
+    that supports equivalent copying facilities, provided you maintain
+    clear directions next to the object code saying where to find the
+    Corresponding Source.  Regardless of what server hosts the
+    Corresponding Source, you remain obligated to ensure that it is
+    available for as long as needed to satisfy these requirements.
+
+    e) Convey the object code using peer-to-peer transmission, provided
+    you inform other peers where the object code and Corresponding
+    Source of the work are being offered to the general public at no
+    charge under subsection 6d.
+
+  A separable portion of the object code, whose source code is excluded
+from the Corresponding Source as a System Library, need not be
+included in conveying the object code work.
+
+  A "User Product" is either (1) a "consumer product", which means any
+tangible personal property which is normally used for personal, family,
+or household purposes, or (2) anything designed or sold for incorporation
+into a dwelling.  In determining whether a product is a consumer product,
+doubtful cases shall be resolved in favor of coverage.  For a particular
+product received by a particular user, "normally used" refers to a
+typical or common use of that class of product, regardless of the status
+of the particular user or of the way in which the particular user
+actually uses, or expects or is expected to use, the product.  A product
+is a consumer product regardless of whether the product has substantial
+commercial, industrial or non-consumer uses, unless such uses represent
+the only significant mode of use of the product.
+
+  "Installation Information" for a User Product means any methods,
+procedures, authorization keys, or other information required to install
+and execute modified versions of a covered work in that User Product from
+a modified version of its Corresponding Source.  The information must
+suffice to ensure that the continued functioning of the modified object
+code is in no case prevented or interfered with solely because
+modification has been made.
+
+  If you convey an object code work under this section in, or with, or
+specifically for use in, a User Product, and the conveying occurs as
+part of a transaction in which the right of possession and use of the
+User Product is transferred to the recipient in perpetuity or for a
+fixed term (regardless of how the transaction is characterized), the
+Corresponding Source conveyed under this section must be accompanied
+by the Installation Information.  But this requirement does not apply
+if neither you nor any third party retains the ability to install
+modified object code on the User Product (for example, the work has
+been installed in ROM).
+
+  The requirement to provide Installation Information does not include a
+requirement to continue to provide support service, warranty, or updates
+for a work that has been modified or installed by the recipient, or for
+the User Product in which it has been modified or installed.  Access to a
+network may be denied when the modification itself materially and
+adversely affects the operation of the network or violates the rules and
+protocols for communication across the network.
+
+  Corresponding Source conveyed, and Installation Information provided,
+in accord with this section must be in a format that is publicly
+documented (and with an implementation available to the public in
+source code form), and must require no special password or key for
+unpacking, reading or copying.
+
+  7. Additional Terms.
+
+  "Additional permissions" are terms that supplement the terms of this
+License by making exceptions from one or more of its conditions.
+Additional permissions that are applicable to the entire Program shall
+be treated as though they were included in this License, to the extent
+that they are valid under applicable law.  If additional permissions
+apply only to part of the Program, that part may be used separately
+under those permissions, but the entire Program remains governed by
+this License without regard to the additional permissions.
+
+  When you convey a copy of a covered work, you may at your option
+remove any additional permissions from that copy, or from any part of
+it.  (Additional permissions may be written to require their own
+removal in certain cases when you modify the work.)  You may place
+additional permissions on material, added by you to a covered work,
+for which you have or can give appropriate copyright permission.
+
+  Notwithstanding any other provision of this License, for material you
+add to a covered work, you may (if authorized by the copyright holders of
+that material) supplement the terms of this License with terms:
+
+    a) Disclaiming warranty or limiting liability differently from the
+    terms of sections 15 and 16 of this License; or
+
+    b) Requiring preservation of specified reasonable legal notices or
+    author attributions in that material or in the Appropriate Legal
+    Notices displayed by works containing it; or
+
+    c) Prohibiting misrepresentation of the origin of that material, or
+    requiring that modified versions of such material be marked in
+    reasonable ways as different from the original version; or
+
+    d) Limiting the use for publicity purposes of names of licensors or
+    authors of the material; or
+
+    e) Declining to grant rights under trademark law for use of some
+    trade names, trademarks, or service marks; or
+
+    f) Requiring indemnification of licensors and authors of that
+    material by anyone who conveys the material (or modified versions of
+    it) with contractual assumptions of liability to the recipient, for
+    any liability that these contractual assumptions directly impose on
+    those licensors and authors.
+
+  All other non-permissive additional terms are considered "further
+restrictions" within the meaning of section 10.  If the Program as you
+received it, or any part of it, contains a notice stating that it is
+governed by this License along with a term that is a further
+restriction, you may remove that term.  If a license document contains
+a further restriction but permits relicensing or conveying under this
+License, you may add to a covered work material governed by the terms
+of that license document, provided that the further restriction does
+not survive such relicensing or conveying.
+
+  If you add terms to a covered work in accord with this section, you
+must place, in the relevant source files, a statement of the
+additional terms that apply to those files, or a notice indicating
+where to find the applicable terms.
+
+  Additional terms, permissive or non-permissive, may be stated in the
+form of a separately written license, or stated as exceptions;
+the above requirements apply either way.
+
+  8. Termination.
+
+  You may not propagate or modify a covered work except as expressly
+provided under this License.  Any attempt otherwise to propagate or
+modify it is void, and will automatically terminate your rights under
+this License (including any patent licenses granted under the third
+paragraph of section 11).
+
+  However, if you cease all violation of this License, then your
+license from a particular copyright holder is reinstated (a)
+provisionally, unless and until the copyright holder explicitly and
+finally terminates your license, and (b) permanently, if the copyright
+holder fails to notify you of the violation by some reasonable means
+prior to 60 days after the cessation.
+
+  Moreover, your license from a particular copyright holder is
+reinstated permanently if the copyright holder notifies you of the
+violation by some reasonable means, this is the first time you have
+received notice of violation of this License (for any work) from that
+copyright holder, and you cure the violation prior to 30 days after
+your receipt of the notice.
+
+  Termination of your rights under this section does not terminate the
+licenses of parties who have received copies or rights from you under
+this License.  If your rights have been terminated and not permanently
+reinstated, you do not qualify to receive new licenses for the same
+material under section 10.
+
+  9. Acceptance Not Required for Having Copies.
+
+  You are not required to accept this License in order to receive or
+run a copy of the Program.  Ancillary propagation of a covered work
+occurring solely as a consequence of using peer-to-peer transmission
+to receive a copy likewise does not require acceptance.  However,
+nothing other than this License grants you permission to propagate or
+modify any covered work.  These actions infringe copyright if you do
+not accept this License.  Therefore, by modifying or propagating a
+covered work, you indicate your acceptance of this License to do so.
+
+  10. Automatic Licensing of Downstream Recipients.
+
+  Each time you convey a covered work, the recipient automatically
+receives a license from the original licensors, to run, modify and
+propagate that work, subject to this License.  You are not responsible
+for enforcing compliance by third parties with this License.
+
+  An "entity transaction" is a transaction transferring control of an
+organization, or substantially all assets of one, or subdividing an
+organization, or merging organizations.  If propagation of a covered
+work results from an entity transaction, each party to that
+transaction who receives a copy of the work also receives whatever
+licenses to the work the party's predecessor in interest had or could
+give under the previous paragraph, plus a right to possession of the
+Corresponding Source of the work from the predecessor in interest, if
+the predecessor has it or can get it with reasonable efforts.
+
+  You may not impose any further restrictions on the exercise of the
+rights granted or affirmed under this License.  For example, you may
+not impose a license fee, royalty, or other charge for exercise of
+rights granted under this License, and you may not initiate litigation
+(including a cross-claim or counterclaim in a lawsuit) alleging that
+any patent claim is infringed by making, using, selling, offering for
+sale, or importing the Program or any portion of it.
+
+  11. Patents.
+
+  A "contributor" is a copyright holder who authorizes use under this
+License of the Program or a work on which the Program is based.  The
+work thus licensed is called the contributor's "contributor version".
+
+  A contributor's "essential patent claims" are all patent claims
+owned or controlled by the contributor, whether already acquired or
+hereafter acquired, that would be infringed by some manner, permitted
+by this License, of making, using, or selling its contributor version,
+but do not include claims that would be infringed only as a
+consequence of further modification of the contributor version.  For
+purposes of this definition, "control" includes the right to grant
+patent sublicenses in a manner consistent with the requirements of
+this License.
+
+  Each contributor grants you a non-exclusive, worldwide, royalty-free
+patent license under the contributor's essential patent claims, to
+make, use, sell, offer for sale, import and otherwise run, modify and
+propagate the contents of its contributor version.
+
+  In the following three paragraphs, a "patent license" is any express
+agreement or commitment, however denominated, not to enforce a patent
+(such as an express permission to practice a patent or covenant not to
+sue for patent infringement).  To "grant" such a patent license to a
+party means to make such an agreement or commitment not to enforce a
+patent against the party.
+
+  If you convey a covered work, knowingly relying on a patent license,
+and the Corresponding Source of the work is not available for anyone
+to copy, free of charge and under the terms of this License, through a
+publicly available network server or other readily accessible means,
+then you must either (1) cause the Corresponding Source to be so
+available, or (2) arrange to deprive yourself of the benefit of the
+patent license for this particular work, or (3) arrange, in a manner
+consistent with the requirements of this License, to extend the patent
+license to downstream recipients.  "Knowingly relying" means you have
+actual knowledge that, but for the patent license, your conveying the
+covered work in a country, or your recipient's use of the covered work
+in a country, would infringe one or more identifiable patents in that
+country that you have reason to believe are valid.
+
+  If, pursuant to or in connection with a single transaction or
+arrangement, you convey, or propagate by procuring conveyance of, a
+covered work, and grant a patent license to some of the parties
+receiving the covered work authorizing them to use, propagate, modify
+or convey a specific copy of the covered work, then the patent license
+you grant is automatically extended to all recipients of the covered
+work and works based on it.
+
+  A patent license is "discriminatory" if it does not include within
+the scope of its coverage, prohibits the exercise of, or is
+conditioned on the non-exercise of one or more of the rights that are
+specifically granted under this License.  You may not convey a covered
+work if you are a party to an arrangement with a third party that is
+in the business of distributing software, under which you make payment
+to the third party based on the extent of your activity of conveying
+the work, and under which the third party grants, to any of the
+parties who would receive the covered work from you, a discriminatory
+patent license (a) in connection with copies of the covered work
+conveyed by you (or copies made from those copies), or (b) primarily
+for and in connection with specific products or compilations that
+contain the covered work, unless you entered into that arrangement,
+or that patent license was granted, prior to 28 March 2007.
+
+  Nothing in this License shall be construed as excluding or limiting
+any implied license or other defenses to infringement that may
+otherwise be available to you under applicable patent law.
+
+  12. No Surrender of Others' Freedom.
+
+  If conditions are imposed on you (whether by court order, agreement or
+otherwise) that contradict the conditions of this License, they do not
+excuse you from the conditions of this License.  If you cannot convey a
+covered work so as to satisfy simultaneously your obligations under this
+License and any other pertinent obligations, then as a consequence you may
+not convey it at all.  For example, if you agree to terms that obligate you
+to collect a royalty for further conveying from those to whom you convey
+the Program, the only way you could satisfy both those terms and this
+License would be to refrain entirely from conveying the Program.
+
+  13. Use with the GNU Affero General Public License.
+
+  Notwithstanding any other provision of this License, you have
+permission to link or combine any covered work with a work licensed
+under version 3 of the GNU Affero General Public License into a single
+combined work, and to convey the resulting work.  The terms of this
+License will continue to apply to the part which is the covered work,
+but the special requirements of the GNU Affero General Public License,
+section 13, concerning interaction through a network will apply to the
+combination as such.
+
+  14. Revised Versions of this License.
+
+  The Free Software Foundation may publish revised and/or new versions of
+the GNU General Public License from time to time.  Such new versions will
+be similar in spirit to the present version, but may differ in detail to
+address new problems or concerns.
+
+  Each version is given a distinguishing version number.  If the
+Program specifies that a certain numbered version of the GNU General
+Public License "or any later version" applies to it, you have the
+option of following the terms and conditions either of that numbered
+version or of any later version published by the Free Software
+Foundation.  If the Program does not specify a version number of the
+GNU General Public License, you may choose any version ever published
+by the Free Software Foundation.
+
+  If the Program specifies that a proxy can decide which future
+versions of the GNU General Public License can be used, that proxy's
+public statement of acceptance of a version permanently authorizes you
+to choose that version for the Program.
+
+  Later license versions may give you additional or different
+permissions.  However, no additional obligations are imposed on any
+author or copyright holder as a result of your choosing to follow a
+later version.
+
+  15. Disclaimer of Warranty.
+
+  THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY
+APPLICABLE LAW.  EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT
+HOLDERS AND/OR OTHER PARTIES PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY
+OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
+THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
+PURPOSE.  THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE PROGRAM
+IS WITH YOU.  SHOULD THE PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF
+ALL NECESSARY SERVICING, REPAIR OR CORRECTION.
+
+  16. Limitation of Liability.
+
+  IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
+WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MODIFIES AND/OR CONVEYS
+THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY
+GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE
+USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED TO LOSS OF
+DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD
+PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER PROGRAMS),
+EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF
+SUCH DAMAGES.
+
+  17. Interpretation of Sections 15 and 16.
+
+  If the disclaimer of warranty and limitation of liability provided
+above cannot be given local legal effect according to their terms,
+reviewing courts shall apply local law that most closely approximates
+an absolute waiver of all civil liability in connection with the
+Program, unless a warranty or assumption of liability accompanies a
+copy of the Program in return for a fee.
+
+                     END OF TERMS AND CONDITIONS
+
+            How to Apply These Terms to Your New Programs
+
+  If you develop a new program, and you want it to be of the greatest
+possible use to the public, the best way to achieve this is to make it
+free software which everyone can redistribute and change under these terms.
+
+  To do so, attach the following notices to the program.  It is safest
+to attach them to the start of each source file to most effectively
+state the exclusion of warranty; and each file should have at least
+the "copyright" line and a pointer to where the full notice is found.
+
+    &lt;one line to give the program's name and a brief idea of what it does.&gt;
+    Copyright (C) &lt;year&gt;  &lt;name of author&gt;
+
+    This program is free software: you can redistribute it and/or modify
+    it under the terms of the GNU General Public License as published by
+    the Free Software Foundation, either version 3 of the License, or
+    (at your option) any later version.
+
+    This program is distributed in the hope that it will be useful,
+    but WITHOUT ANY WARRANTY; without even the implied warranty of
+    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+    GNU General Public License for more details.
+
+    You should have received a copy of the GNU General Public License
+    along with this program.  If not, see &lt;https://www.gnu.org/licenses/&gt;.
+
+Also add information on how to contact you by electronic and paper mail.
+
+  If the program does terminal interaction, make it output a short
+notice like this when it starts in an interactive mode:
+
+    &lt;program&gt;  Copyright (C) &lt;year&gt;  &lt;name of author&gt;
+    This program comes with ABSOLUTELY NO WARRANTY; for details type `show w'.
+    This is free software, and you are welcome to redistribute it
+    under certain conditions; type `show c' for details.
+
+The hypothetical commands `show w' and `show c' should show the appropriate
+parts of the General Public License.  Of course, your program's commands
+might be different; for a GUI interface, you would use an "about box".
+
+  You should also get your employer (if you work as a programmer) or school,
+if any, to sign a "copyright disclaimer" for the program, if necessary.
+For more information on this, and how to apply and follow the GNU GPL, see
+&lt;https://www.gnu.org/licenses/&gt;.
+
+  The GNU General Public License does not permit incorporating your program
+into proprietary programs.  If your program is a subroutine library, you
+may consider it more useful to permit linking proprietary applications with
+the library.  If this is what you want to do, use the GNU Lesser General
+Public License instead of this License.  But first, please read
+&lt;https://www.gnu.org/licenses/why-not-lgpl.html&gt;.
+</pre>
+
+  </div>
+
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+
+</div>
+
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/authors.html b/docs/authors.html
new file mode 100644
index 0000000..11c9bf4
--- /dev/null
+++ b/docs/authors.html
@@ -0,0 +1,123 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Authors and Citation • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="bootstrap-toc.css"><script src="bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="pkgdown.css" rel="stylesheet"><script src="pkgdown.js"></script><meta property="og:title" content="Authors and Citation"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-citation-authors">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="contents col-md-9">
+    <div class="section level2 authors-section">
+      <div class="page-header">
+        <h1>Authors</h1>
+      </div>
+
+      
+      <ul class="list-unstyled"><li>
+          <p><strong>Kenneth Benoit</strong>. Maintainer, author, copyright holder. <a href="https://orcid.org/0000-0002-0797-564X" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a>
+          </p>
+        </li>
+        <li>
+          <p><strong>Kohei Watanabe</strong>. Author. <a href="https://orcid.org/0000-0001-6519-5265" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a>
+          </p>
+        </li>
+        <li>
+          <p><strong>Haiyan Wang</strong>. Author. <a href="https://orcid.org/0000-0003-4992-4311" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a>
+          </p>
+        </li>
+        <li>
+          <p><strong>Jiong Wei Lua</strong>. Author. 
+          </p>
+        </li>
+        <li>
+          <p><strong>Jouni Kuha</strong>. Author. <a href="https://orcid.org/0000-0002-1156-8465" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a>
+          </p>
+        </li>
+        <li>
+          <p><strong>European Research Council</strong>. Funder. 
+          <br><small>ERC-2011-StG 283794-QUANTESS</small></p>
+        </li>
+      </ul></div>
+    <div class="section level2 citation-section">
+    <div>
+      <h1 id="citation">Citation</h1>
+      <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/inst/CITATION" class="external-link"><code>inst/CITATION</code></a></small>
+    </div>
+    </div>
+
+
+    <p>Benoit K, Watanabe K, Wang H, Nulty P, Obeng A, Müller S, Matsuo A (2018).
+“quanteda: An R package for the quantitative analysis of textual data.”
+<em>Journal of Open Source Software</em>, <b>3</b>(30), 774.
+<a href="https://doi.org/10.21105/joss.00774" class="external-link">doi:10.21105/joss.00774</a>, <a href="https://quanteda.io" class="external-link">https://quanteda.io</a>. 
+</p>
+    <pre>@Article{,
+  title = {quanteda: An R package for the quantitative analysis of textual data},
+  journal = {Journal of Open Source Software},
+  author = {Kenneth Benoit and Kohei Watanabe and Haiyan Wang and Paul Nulty and Adam Obeng and Stefan Müller and Akitaka Matsuo},
+  doi = {10.21105/joss.00774},
+  url = {https://quanteda.io},
+  volume = {3},
+  number = {30},
+  pages = {774},
+  year = {2018},
+}</pre>
+
+  </div>
+
+</div>
+
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/bootstrap-toc.css b/docs/bootstrap-toc.css
new file mode 100644
index 0000000..5a85941
--- /dev/null
+++ b/docs/bootstrap-toc.css
@@ -0,0 +1,60 @@
+/*!
+ * Bootstrap Table of Contents v0.4.1 (http://afeld.github.io/bootstrap-toc/)
+ * Copyright 2015 Aidan Feldman
+ * Licensed under MIT (https://github.com/afeld/bootstrap-toc/blob/gh-pages/LICENSE.md) */
+
+/* modified from https://github.com/twbs/bootstrap/blob/94b4076dd2efba9af71f0b18d4ee4b163aa9e0dd/docs/assets/css/src/docs.css#L548-L601 */
+
+/* All levels of nav */
+nav[data-toggle='toc'] .nav > li > a {
+  display: block;
+  padding: 4px 20px;
+  font-size: 13px;
+  font-weight: 500;
+  color: #767676;
+}
+nav[data-toggle='toc'] .nav > li > a:hover,
+nav[data-toggle='toc'] .nav > li > a:focus {
+  padding-left: 19px;
+  color: #563d7c;
+  text-decoration: none;
+  background-color: transparent;
+  border-left: 1px solid #563d7c;
+}
+nav[data-toggle='toc'] .nav > .active > a,
+nav[data-toggle='toc'] .nav > .active:hover > a,
+nav[data-toggle='toc'] .nav > .active:focus > a {
+  padding-left: 18px;
+  font-weight: bold;
+  color: #563d7c;
+  background-color: transparent;
+  border-left: 2px solid #563d7c;
+}
+
+/* Nav: second level (shown on .active) */
+nav[data-toggle='toc'] .nav .nav {
+  display: none; /* Hide by default, but at >768px, show it */
+  padding-bottom: 10px;
+}
+nav[data-toggle='toc'] .nav .nav > li > a {
+  padding-top: 1px;
+  padding-bottom: 1px;
+  padding-left: 30px;
+  font-size: 12px;
+  font-weight: normal;
+}
+nav[data-toggle='toc'] .nav .nav > li > a:hover,
+nav[data-toggle='toc'] .nav .nav > li > a:focus {
+  padding-left: 29px;
+}
+nav[data-toggle='toc'] .nav .nav > .active > a,
+nav[data-toggle='toc'] .nav .nav > .active:hover > a,
+nav[data-toggle='toc'] .nav .nav > .active:focus > a {
+  padding-left: 28px;
+  font-weight: 500;
+}
+
+/* from https://github.com/twbs/bootstrap/blob/e38f066d8c203c3e032da0ff23cd2d6098ee2dd6/docs/assets/css/src/docs.css#L631-L634 */
+nav[data-toggle='toc'] .nav > .active > ul {
+  display: block;
+}
diff --git a/docs/bootstrap-toc.js b/docs/bootstrap-toc.js
new file mode 100644
index 0000000..1cdd573
--- /dev/null
+++ b/docs/bootstrap-toc.js
@@ -0,0 +1,159 @@
+/*!
+ * Bootstrap Table of Contents v0.4.1 (http://afeld.github.io/bootstrap-toc/)
+ * Copyright 2015 Aidan Feldman
+ * Licensed under MIT (https://github.com/afeld/bootstrap-toc/blob/gh-pages/LICENSE.md) */
+(function() {
+  'use strict';
+
+  window.Toc = {
+    helpers: {
+      // return all matching elements in the set, or their descendants
+      findOrFilter: function($el, selector) {
+        // http://danielnouri.org/notes/2011/03/14/a-jquery-find-that-also-finds-the-root-element/
+        // http://stackoverflow.com/a/12731439/358804
+        var $descendants = $el.find(selector);
+        return $el.filter(selector).add($descendants).filter(':not([data-toc-skip])');
+      },
+
+      generateUniqueIdBase: function(el) {
+        var text = $(el).text();
+        var anchor = text.trim().toLowerCase().replace(/[^A-Za-z0-9]+/g, '-');
+        return anchor || el.tagName.toLowerCase();
+      },
+
+      generateUniqueId: function(el) {
+        var anchorBase = this.generateUniqueIdBase(el);
+        for (var i = 0; ; i++) {
+          var anchor = anchorBase;
+          if (i > 0) {
+            // add suffix
+            anchor += '-' + i;
+          }
+          // check if ID already exists
+          if (!document.getElementById(anchor)) {
+            return anchor;
+          }
+        }
+      },
+
+      generateAnchor: function(el) {
+        if (el.id) {
+          return el.id;
+        } else {
+          var anchor = this.generateUniqueId(el);
+          el.id = anchor;
+          return anchor;
+        }
+      },
+
+      createNavList: function() {
+        return $('<ul class="nav"></ul>');
+      },
+
+      createChildNavList: function($parent) {
+        var $childList = this.createNavList();
+        $parent.append($childList);
+        return $childList;
+      },
+
+      generateNavEl: function(anchor, text) {
+        var $a = $('<a></a>');
+        $a.attr('href', '#' + anchor);
+        $a.text(text);
+        var $li = $('<li></li>');
+        $li.append($a);
+        return $li;
+      },
+
+      generateNavItem: function(headingEl) {
+        var anchor = this.generateAnchor(headingEl);
+        var $heading = $(headingEl);
+        var text = $heading.data('toc-text') || $heading.text();
+        return this.generateNavEl(anchor, text);
+      },
+
+      // Find the first heading level (`<h1>`, then `<h2>`, etc.) that has more than one element. Defaults to 1 (for `<h1>`).
+      getTopLevel: function($scope) {
+        for (var i = 1; i <= 6; i++) {
+          var $headings = this.findOrFilter($scope, 'h' + i);
+          if ($headings.length > 1) {
+            return i;
+          }
+        }
+
+        return 1;
+      },
+
+      // returns the elements for the top level, and the next below it
+      getHeadings: function($scope, topLevel) {
+        var topSelector = 'h' + topLevel;
+
+        var secondaryLevel = topLevel + 1;
+        var secondarySelector = 'h' + secondaryLevel;
+
+        return this.findOrFilter($scope, topSelector + ',' + secondarySelector);
+      },
+
+      getNavLevel: function(el) {
+        return parseInt(el.tagName.charAt(1), 10);
+      },
+
+      populateNav: function($topContext, topLevel, $headings) {
+        var $context = $topContext;
+        var $prevNav;
+
+        var helpers = this;
+        $headings.each(function(i, el) {
+          var $newNav = helpers.generateNavItem(el);
+          var navLevel = helpers.getNavLevel(el);
+
+          // determine the proper $context
+          if (navLevel === topLevel) {
+            // use top level
+            $context = $topContext;
+          } else if ($prevNav && $context === $topContext) {
+            // create a new level of the tree and switch to it
+            $context = helpers.createChildNavList($prevNav);
+          } // else use the current $context
+
+          $context.append($newNav);
+
+          $prevNav = $newNav;
+        });
+      },
+
+      parseOps: function(arg) {
+        var opts;
+        if (arg.jquery) {
+          opts = {
+            $nav: arg
+          };
+        } else {
+          opts = arg;
+        }
+        opts.$scope = opts.$scope || $(document.body);
+        return opts;
+      }
+    },
+
+    // accepts a jQuery object, or an options object
+    init: function(opts) {
+      opts = this.helpers.parseOps(opts);
+
+      // ensure that the data attribute is in place for styling
+      opts.$nav.attr('data-toggle', 'toc');
+
+      var $topContext = this.helpers.createChildNavList(opts.$nav);
+      var topLevel = this.helpers.getTopLevel(opts.$scope);
+      var $headings = this.helpers.getHeadings(opts.$scope, topLevel);
+      this.helpers.populateNav($topContext, topLevel, $headings);
+    }
+  };
+
+  $(function() {
+    $('nav[data-toggle="toc"]').each(function(i, el) {
+      var $nav = $(el);
+      Toc.init($nav);
+    });
+  });
+})();
diff --git a/docs/docsearch.css b/docs/docsearch.css
new file mode 100644
index 0000000..e5f1fe1
--- /dev/null
+++ b/docs/docsearch.css
@@ -0,0 +1,148 @@
+/* Docsearch -------------------------------------------------------------- */
+/*
+  Source: https://github.com/algolia/docsearch/
+  License: MIT
+*/
+
+.algolia-autocomplete {
+  display: block;
+  -webkit-box-flex: 1;
+  -ms-flex: 1;
+  flex: 1
+}
+
+.algolia-autocomplete .ds-dropdown-menu {
+  width: 100%;
+  min-width: none;
+  max-width: none;
+  padding: .75rem 0;
+  background-color: #fff;
+  background-clip: padding-box;
+  border: 1px solid rgba(0, 0, 0, .1);
+  box-shadow: 0 .5rem 1rem rgba(0, 0, 0, .175);
+}
+
+@media (min-width:768px) {
+  .algolia-autocomplete .ds-dropdown-menu {
+      width: 175%
+  }
+}
+
+.algolia-autocomplete .ds-dropdown-menu::before {
+  display: none
+}
+
+.algolia-autocomplete .ds-dropdown-menu [class^=ds-dataset-] {
+  padding: 0;
+  background-color: rgb(255,255,255);
+  border: 0;
+  max-height: 80vh;
+}
+
+.algolia-autocomplete .ds-dropdown-menu .ds-suggestions {
+  margin-top: 0
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion {
+  padding: 0;
+  overflow: visible
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--category-header {
+  padding: .125rem 1rem;
+  margin-top: 0;
+  font-size: 1.3em;
+  font-weight: 500;
+  color: #00008B;
+  border-bottom: 0
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--wrapper {
+    float: none;
+    padding-top: 0
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--subcategory-column {
+  float: none;
+  width: auto;
+  padding: 0;
+  text-align: left
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--content {
+  float: none;
+  width: auto;
+  padding: 0
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--content::before {
+  display: none
+}
+
+.algolia-autocomplete .ds-suggestion:not(:first-child) .algolia-docsearch-suggestion--category-header {
+  padding-top: .75rem;
+  margin-top: .75rem;
+  border-top: 1px solid rgba(0, 0, 0, .1)
+}
+
+.algolia-autocomplete .ds-suggestion .algolia-docsearch-suggestion--subcategory-column {
+  display: block;
+  padding: .1rem 1rem;
+  margin-bottom: 0.1;
+  font-size: 1.0em;
+  font-weight: 400
+  /* display: none */
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--title {
+  display: block;
+  padding: .25rem 1rem;
+  margin-bottom: 0;
+  font-size: 0.9em;
+  font-weight: 400
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--text {
+  padding: 0 1rem .5rem;
+  margin-top: -.25rem;
+  font-size: 0.8em;
+  font-weight: 400;
+  line-height: 1.25
+}
+
+.algolia-autocomplete .algolia-docsearch-footer {
+  width: 110px;
+  height: 20px;
+  z-index: 3;
+  margin-top: 10.66667px;
+  float: right;
+  font-size: 0;
+  line-height: 0;
+}
+
+.algolia-autocomplete .algolia-docsearch-footer--logo {
+  background-image: url("data:image/svg+xml;utf8,<svg viewBox='0 0 130 18' xmlns='http://www.w3.org/2000/svg'><defs><linearGradient x1='-36.868%' y1='134.936%' x2='129.432%' y2='-27.7%' id='a'><stop stop-color='%2300AEFF' offset='0%'/><stop stop-color='%233369E7' offset='100%'/></linearGradient></defs><g fill='none' fill-rule='evenodd'><path d='M59.399.022h13.299a2.372 2.372 0 0 1 2.377 2.364V15.62a2.372 2.372 0 0 1-2.377 2.364H59.399a2.372 2.372 0 0 1-2.377-2.364V2.381A2.368 2.368 0 0 1 59.399.022z' fill='url(%23a)'/><path d='M66.257 4.56c-2.815 0-5.1 2.272-5.1 5.078 0 2.806 2.284 5.072 5.1 5.072 2.815 0 5.1-2.272 5.1-5.078 0-2.806-2.279-5.072-5.1-5.072zm0 8.652c-1.983 0-3.593-1.602-3.593-3.574 0-1.972 1.61-3.574 3.593-3.574 1.983 0 3.593 1.602 3.593 3.574a3.582 3.582 0 0 1-3.593 3.574zm0-6.418v2.664c0 .076.082.131.153.093l2.377-1.226c.055-.027.071-.093.044-.147a2.96 2.96 0 0 0-2.465-1.487c-.055 0-.11.044-.11.104l.001-.001zm-3.33-1.956l-.312-.311a.783.783 0 0 0-1.106 0l-.372.37a.773.773 0 0 0 0 1.101l.307.305c.049.049.121.038.164-.011.181-.245.378-.479.597-.697.225-.223.455-.42.707-.599.055-.033.06-.109.016-.158h-.001zm5.001-.806v-.616a.781.781 0 0 0-.783-.779h-1.824a.78.78 0 0 0-.783.779v.632c0 .071.066.12.137.104a5.736 5.736 0 0 1 1.588-.223c.52 0 1.035.071 1.534.207a.106.106 0 0 0 .131-.104z' fill='%23FFF'/><path d='M102.162 13.762c0 1.455-.372 2.517-1.123 3.193-.75.676-1.895 1.013-3.44 1.013-.564 0-1.736-.109-2.673-.316l.345-1.689c.783.163 1.819.207 2.361.207.86 0 1.473-.174 1.84-.523.367-.349.548-.866.548-1.553v-.349a6.374 6.374 0 0 1-.838.316 4.151 4.151 0 0 1-1.194.158 4.515 4.515 0 0 1-1.616-.278 3.385 3.385 0 0 1-1.254-.817 3.744 3.744 0 0 1-.811-1.351c-.192-.539-.29-1.504-.29-2.212 0-.665.104-1.498.307-2.054a3.925 3.925 0 0 1 .904-1.433 4.124 4.124 0 0 1 1.441-.926 5.31 5.31 0 0 1 1.945-.365c.696 0 1.337.087 1.961.191a15.86 15.86 0 0 1 1.588.332v8.456h-.001zm-5.954-4.206c0 .893.197 1.885.592 2.299.394.414.904.621 1.528.621.34 0 .663-.049.964-.142a2.75 2.75 0 0 0 .734-.332v-5.29a8.531 8.531 0 0 0-1.413-.18c-.778-.022-1.369.294-1.786.801-.411.507-.619 1.395-.619 2.223zm16.12 0c0 .719-.104 1.264-.318 1.858a4.389 4.389 0 0 1-.904 1.52c-.389.42-.854.746-1.402.975-.548.229-1.391.36-1.813.36-.422-.005-1.26-.125-1.802-.36a4.088 4.088 0 0 1-1.397-.975 4.486 4.486 0 0 1-.909-1.52 5.037 5.037 0 0 1-.329-1.858c0-.719.099-1.411.318-1.999.219-.588.526-1.09.92-1.509.394-.42.865-.741 1.402-.97a4.547 4.547 0 0 1 1.786-.338 4.69 4.69 0 0 1 1.791.338c.548.229 1.019.55 1.402.97.389.42.69.921.909 1.509.23.588.345 1.28.345 1.999h.001zm-2.191.005c0-.921-.203-1.689-.597-2.223-.394-.539-.948-.806-1.654-.806-.707 0-1.26.267-1.654.806-.394.539-.586 1.302-.586 2.223 0 .932.197 1.558.592 2.098.394.545.948.812 1.654.812.707 0 1.26-.272 1.654-.812.394-.545.592-1.166.592-2.098h-.001zm6.962 4.707c-3.511.016-3.511-2.822-3.511-3.274L113.583.926l2.142-.338v10.003c0 .256 0 1.88 1.375 1.885v1.792h-.001zm3.774 0h-2.153V5.072l2.153-.338v9.534zm-1.079-10.542c.718 0 1.304-.578 1.304-1.291 0-.714-.581-1.291-1.304-1.291-.723 0-1.304.578-1.304 1.291 0 .714.586 1.291 1.304 1.291zm6.431 1.013c.707 0 1.304.087 1.786.262.482.174.871.42 1.156.73.285.311.488.735.608 1.182.126.447.186.937.186 1.476v5.481a25.24 25.24 0 0 1-1.495.251c-.668.098-1.419.147-2.251.147a6.829 6.829 0 0 1-1.517-.158 3.213 3.213 0 0 1-1.178-.507 2.455 2.455 0 0 1-.761-.904c-.181-.37-.274-.893-.274-1.438 0-.523.104-.855.307-1.215.208-.36.487-.654.838-.883a3.609 3.609 0 0 1 1.227-.49 7.073 7.073 0 0 1 2.202-.103c.263.027.537.076.833.147v-.349c0-.245-.027-.479-.088-.697a1.486 1.486 0 0 0-.307-.583c-.148-.169-.34-.3-.581-.392a2.536 2.536 0 0 0-.915-.163c-.493 0-.942.06-1.353.131-.411.071-.75.153-1.008.245l-.257-1.749c.268-.093.668-.185 1.183-.278a9.335 9.335 0 0 1 1.66-.142l-.001-.001zm.181 7.731c.657 0 1.145-.038 1.484-.104v-2.168a5.097 5.097 0 0 0-1.978-.104c-.241.033-.46.098-.652.191a1.167 1.167 0 0 0-.466.392c-.121.169-.175.267-.175.523 0 .501.175.79.493.981.323.196.75.289 1.293.289h.001zM84.109 4.794c.707 0 1.304.087 1.786.262.482.174.871.42 1.156.73.29.316.487.735.608 1.182.126.447.186.937.186 1.476v5.481a25.24 25.24 0 0 1-1.495.251c-.668.098-1.419.147-2.251.147a6.829 6.829 0 0 1-1.517-.158 3.213 3.213 0 0 1-1.178-.507 2.455 2.455 0 0 1-.761-.904c-.181-.37-.274-.893-.274-1.438 0-.523.104-.855.307-1.215.208-.36.487-.654.838-.883a3.609 3.609 0 0 1 1.227-.49 7.073 7.073 0 0 1 2.202-.103c.257.027.537.076.833.147v-.349c0-.245-.027-.479-.088-.697a1.486 1.486 0 0 0-.307-.583c-.148-.169-.34-.3-.581-.392a2.536 2.536 0 0 0-.915-.163c-.493 0-.942.06-1.353.131-.411.071-.75.153-1.008.245l-.257-1.749c.268-.093.668-.185 1.183-.278a8.89 8.89 0 0 1 1.66-.142l-.001-.001zm.186 7.736c.657 0 1.145-.038 1.484-.104v-2.168a5.097 5.097 0 0 0-1.978-.104c-.241.033-.46.098-.652.191a1.167 1.167 0 0 0-.466.392c-.121.169-.175.267-.175.523 0 .501.175.79.493.981.318.191.75.289 1.293.289h.001zm8.682 1.738c-3.511.016-3.511-2.822-3.511-3.274L89.461.926l2.142-.338v10.003c0 .256 0 1.88 1.375 1.885v1.792h-.001z' fill='%23182359'/><path d='M5.027 11.025c0 .698-.252 1.246-.757 1.644-.505.397-1.201.596-2.089.596-.888 0-1.615-.138-2.181-.414v-1.214c.358.168.739.301 1.141.397.403.097.778.145 1.125.145.508 0 .884-.097 1.125-.29a.945.945 0 0 0 .363-.779.978.978 0 0 0-.333-.747c-.222-.204-.68-.446-1.375-.725-.716-.29-1.221-.621-1.515-.994-.294-.372-.44-.82-.44-1.343 0-.655.233-1.171.698-1.547.466-.376 1.09-.564 1.875-.564.752 0 1.5.165 2.245.494l-.408 1.047c-.698-.294-1.321-.44-1.869-.44-.415 0-.73.09-.945.271a.89.89 0 0 0-.322.717c0 .204.043.379.129.524.086.145.227.282.424.411.197.129.551.299 1.063.51.577.24.999.464 1.268.671.269.208.466.442.591.704.125.261.188.569.188.924l-.001.002zm3.98 2.24c-.924 0-1.646-.269-2.167-.808-.521-.539-.782-1.281-.782-2.226 0-.97.242-1.733.725-2.288.483-.555 1.148-.833 1.993-.833.784 0 1.404.238 1.858.714.455.476.682 1.132.682 1.966v.682H7.357c.018.577.174 1.02.467 1.329.294.31.707.465 1.241.465.351 0 .678-.033.98-.099a5.1 5.1 0 0 0 .975-.33v1.026a3.865 3.865 0 0 1-.935.312 5.723 5.723 0 0 1-1.08.091l.002-.001zm-.231-5.199c-.401 0-.722.127-.964.381s-.386.625-.432 1.112h2.696c-.007-.491-.125-.862-.354-1.115-.229-.252-.544-.379-.945-.379l-.001.001zm7.692 5.092l-.252-.827h-.043c-.286.362-.575.608-.865.739-.29.131-.662.196-1.117.196-.584 0-1.039-.158-1.367-.473-.328-.315-.491-.761-.491-1.337 0-.612.227-1.074.682-1.386.455-.312 1.148-.482 2.079-.51l1.026-.032v-.317c0-.38-.089-.663-.266-.851-.177-.188-.452-.282-.824-.282-.304 0-.596.045-.876.134a6.68 6.68 0 0 0-.806.317l-.408-.902a4.414 4.414 0 0 1 1.058-.384 4.856 4.856 0 0 1 1.085-.132c.756 0 1.326.165 1.711.494.385.329.577.847.577 1.552v4.002h-.902l-.001-.001zm-1.88-.859c.458 0 .826-.128 1.104-.384.278-.256.416-.615.416-1.077v-.516l-.763.032c-.594.021-1.027.121-1.297.298s-.406.448-.406.814c0 .265.079.47.236.615.158.145.394.218.709.218h.001zm7.557-5.189c.254 0 .464.018.628.054l-.124 1.176a2.383 2.383 0 0 0-.559-.064c-.505 0-.914.165-1.227.494-.313.329-.47.757-.47 1.284v3.105h-1.262V7.218h.988l.167 1.047h.064c.197-.354.454-.636.771-.843a1.83 1.83 0 0 1 1.023-.312h.001zm4.125 6.155c-.899 0-1.582-.262-2.049-.787-.467-.525-.701-1.277-.701-2.259 0-.999.244-1.767.733-2.304.489-.537 1.195-.806 2.119-.806.627 0 1.191.116 1.692.349l-.381 1.015c-.534-.208-.974-.312-1.321-.312-1.028 0-1.542.682-1.542 2.046 0 .666.128 1.166.384 1.501.256.335.631.502 1.125.502a3.23 3.23 0 0 0 1.595-.419v1.101a2.53 2.53 0 0 1-.722.285 4.356 4.356 0 0 1-.932.086v.002zm8.277-.107h-1.268V9.506c0-.458-.092-.8-.277-1.026-.184-.226-.477-.338-.878-.338-.53 0-.919.158-1.168.475-.249.317-.373.848-.373 1.593v2.949h-1.262V4.801h1.262v2.122c0 .34-.021.704-.064 1.09h.081a1.76 1.76 0 0 1 .717-.666c.306-.158.663-.236 1.072-.236 1.439 0 2.159.725 2.159 2.175v3.873l-.001-.001zm7.649-6.048c.741 0 1.319.269 1.732.806.414.537.62 1.291.62 2.261 0 .974-.209 1.732-.628 2.275-.419.542-1.001.814-1.746.814-.752 0-1.336-.27-1.751-.811h-.086l-.231.704h-.945V4.801h1.262v1.987l-.021.655-.032.553h.054c.401-.591.992-.886 1.772-.886zm-.328 1.031c-.508 0-.875.149-1.098.448-.224.299-.339.799-.346 1.501v.086c0 .723.115 1.247.344 1.571.229.324.603.486 1.123.486.448 0 .787-.177 1.018-.532.231-.354.346-.867.346-1.536 0-1.35-.462-2.025-1.386-2.025l-.001.001zm3.244-.924h1.375l1.209 3.368c.183.48.304.931.365 1.354h.043c.032-.197.091-.436.177-.717.086-.281.541-1.616 1.364-4.004h1.364l-2.541 6.73c-.462 1.235-1.232 1.853-2.31 1.853-.279 0-.551-.03-.816-.091v-.999c.19.043.406.064.65.064.609 0 1.037-.353 1.284-1.058l.22-.559-2.385-5.941h.001z' fill='%231D3657'/></g></svg>");
+  background-repeat: no-repeat;
+  background-position: 50%;
+  background-size: 100%;
+  overflow: hidden;
+  text-indent: -9000px;
+  width: 100%;
+  height: 100%;
+  display: block;
+  transform: translate(-8px);
+}
+
+.algolia-autocomplete .algolia-docsearch-suggestion--highlight {
+  color: #FF8C00;
+  background: rgba(232, 189, 54, 0.1)
+}
+
+
+.algolia-autocomplete .algolia-docsearch-suggestion--text .algolia-docsearch-suggestion--highlight {
+  box-shadow: inset 0 -2px 0 0 rgba(105, 105, 105, .5)
+}
+
+.algolia-autocomplete .ds-suggestion.ds-cursor .algolia-docsearch-suggestion--content {
+  background-color: rgba(192, 192, 192, .15)
+}
diff --git a/docs/docsearch.js b/docs/docsearch.js
new file mode 100644
index 0000000..b35504c
--- /dev/null
+++ b/docs/docsearch.js
@@ -0,0 +1,85 @@
+$(function() {
+
+  // register a handler to move the focus to the search bar
+  // upon pressing shift + "/" (i.e. "?")
+  $(document).on('keydown', function(e) {
+    if (e.shiftKey && e.keyCode == 191) {
+      e.preventDefault();
+      $("#search-input").focus();
+    }
+  });
+
+  $(document).ready(function() {
+    // do keyword highlighting
+    /* modified from https://jsfiddle.net/julmot/bL6bb5oo/ */
+    var mark = function() {
+
+      var referrer = document.URL ;
+      var paramKey = "q" ;
+
+      if (referrer.indexOf("?") !== -1) {
+        var qs = referrer.substr(referrer.indexOf('?') + 1);
+        var qs_noanchor = qs.split('#')[0];
+        var qsa = qs_noanchor.split('&');
+        var keyword = "";
+
+        for (var i = 0; i < qsa.length; i++) {
+          var currentParam = qsa[i].split('=');
+
+          if (currentParam.length !== 2) {
+            continue;
+          }
+
+          if (currentParam[0] == paramKey) {
+            keyword = decodeURIComponent(currentParam[1].replace(/\+/g, "%20"));
+          }
+        }
+
+        if (keyword !== "") {
+          $(".contents").unmark({
+            done: function() {
+              $(".contents").mark(keyword);
+            }
+          });
+        }
+      }
+    };
+
+    mark();
+  });
+});
+
+/* Search term highlighting ------------------------------*/
+
+function matchedWords(hit) {
+  var words = [];
+
+  var hierarchy = hit._highlightResult.hierarchy;
+  // loop to fetch from lvl0, lvl1, etc.
+  for (var idx in hierarchy) {
+    words = words.concat(hierarchy[idx].matchedWords);
+  }
+
+  var content = hit._highlightResult.content;
+  if (content) {
+    words = words.concat(content.matchedWords);
+  }
+
+  // return unique words
+  var words_uniq = [...new Set(words)];
+  return words_uniq;
+}
+
+function updateHitURL(hit) {
+
+  var words = matchedWords(hit);
+  var url = "";
+
+  if (hit.anchor) {
+    url = hit.url_without_anchor + '?q=' + escape(words.join(" ")) + '#' + hit.anchor;
+  } else {
+    url = hit.url + '?q=' + escape(words.join(" "));
+  }
+
+  return url;
+}
diff --git a/docs/index.html b/docs/index.html
new file mode 100644
index 0000000..ca49f6b
--- /dev/null
+++ b/docs/index.html
@@ -0,0 +1,179 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en">
+<head>
+<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+<meta charset="utf-8">
+<meta http-equiv="X-UA-Compatible" content="IE=edge">
+<meta name="viewport" content="width=device-width, initial-scale=1.0">
+<title>Textual Statistics for the Quantitative Analysis of Textual Data • quanteda.textstats</title>
+<!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous">
+<script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="bootstrap-toc.css">
+<script src="bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous">
+<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous">
+<!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="pkgdown.css" rel="stylesheet">
+<script src="pkgdown.js"></script><meta property="og:title" content="Textual Statistics for the Quantitative Analysis of Textual Data">
+<meta property="og:description" content="Textual statistics functions formerly in the quanteda package.
+    Textual statistics for characterizing and comparing textual data. Includes 
+    functions for measuring term and document frequency, the co-occurrence of 
+    words, similarity and distance between features and documents, feature entropy, 
+    keyword occurrence, readability, and lexical diversity.  These functions 
+    extend the quanteda package and are specially designed for sparse textual data.">
+<!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]-->
+</head>
+<body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-home">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav">
+<li>
+  <a href="reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="news/index.html">Changelog</a>
+</li>
+      </ul>
+<ul class="nav navbar-nav navbar-right">
+<li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul>
+</div>
+<!--/.nav-collapse -->
+  </div>
+<!--/.container -->
+</div>
+<!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="contents col-md-9">
+<div class="section level1">
+<div class="page-header"><h1 id="quantedatextstats-textual-statistics-for-quanteda">quanteda.textstats: textual statistics for quanteda<a class="anchor" aria-label="anchor" href="#quantedatextstats-textual-statistics-for-quanteda"></a>
+</h1></div>
+<!-- badges: start -->
+
+<div class="section level2">
+<h2 id="about">About<a class="anchor" aria-label="anchor" href="#about"></a>
+</h2>
+<p>Contains the textstat functions formerly in <strong>quanteda</strong>. For more details, see <a href="https://quanteda.io" class="external-link uri">https://quanteda.io</a>.</p>
+</div>
+<div class="section level2">
+<h2 id="how-to-install">How to Install<a class="anchor" aria-label="anchor" href="#how-to-install"></a>
+</h2>
+<p>The normal way from CRAN, using your R GUI or</p>
+<div class="sourceCode" id="cb1"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span><span class="fu"><a href="https://rdrr.io/r/utils/install.packages.html" class="external-link">install.packages</a></span><span class="op">(</span><span class="st">"quanteda.textstats"</span><span class="op">)</span> </span></code></pre></div>
+<p>Or for the latest development version:</p>
+<div class="sourceCode" id="cb2"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span><span class="co"># devtools package required to install quanteda from Github </span></span>
+<span><span class="fu">remotes</span><span class="fu">::</span><span class="fu"><a href="https://remotes.r-lib.org/reference/install_github.html" class="external-link">install_github</a></span><span class="op">(</span><span class="st">"quanteda/quanteda.textstats"</span><span class="op">)</span> </span></code></pre></div>
+<p>Because this compiles some C++ and Fortran source code, you will need to have installed the appropriate compilers.</p>
+<p><strong>If you are using a Windows platform</strong>, this means you will need also to install the <a href="https://CRAN.R-project.org/bin/windows/Rtools/" class="external-link">Rtools</a> software available from CRAN.</p>
+<p><strong>If you are using macOS</strong>, you should install the <a href="https://cran.r-project.org/bin/macosx/tools/" class="external-link">macOS tools</a>, namely the Clang 6.x compiler and the GNU Fortran compiler (as <strong>quanteda.textstats</strong> requires gfortran to build). If you are still getting errors related to gfortran, follow the fixes <a href="https://thecoatlessprofessor.com/programming/rcpp-rcpparmadillo-and-os-x-mavericks--lgfortran-and--lquadmath-error/" class="external-link">here</a>.</p>
+</div>
+<div class="section level2">
+<h2 id="how-to-cite">How to cite<a class="anchor" aria-label="anchor" href="#how-to-cite"></a>
+</h2>
+<p>Benoit, Kenneth, Kohei Watanabe, Haiyan Wang, Paul Nulty, Adam Obeng, Stefan Müller, and Akitaka Matsuo. (2018) “<a href="https://www.theoj.org/joss-papers/joss.00774/10.21105.joss.00774.pdf" class="external-link">quanteda: An R package for the quantitative analysis of textual data</a>”. <em>Journal of Open Source Software</em>. 3(30), 774. <a href="https://doi.org/10.21105/joss.00774" class="external-link uri">https://doi.org/10.21105/joss.00774</a>.</p>
+<p>For a BibTeX entry, use the output from <code>citation(package = "quanteda.textstats")</code>.</p>
+</div>
+</div>
+  </div>
+
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <div class="links">
+<h2 data-toc-skip>Links</h2>
+<ul class="list-unstyled">
+<li><a href="https://cloud.r-project.org/package=quanteda.textstats" class="external-link">View on CRAN</a></li>
+<li><a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">Browse source code</a></li>
+<li><a href="https://github.com/quanteda/quanteda.textstats/issues" class="external-link">Report a bug</a></li>
+</ul>
+</div>
+
+<div class="license">
+<h2 data-toc-skip>License</h2>
+<ul class="list-unstyled">
+<li><a href="https://www.r-project.org/Licenses/GPL-3" class="external-link">GPL-3</a></li>
+</ul>
+</div>
+
+
+<div class="citation">
+<h2 data-toc-skip>Citation</h2>
+<ul class="list-unstyled">
+<li><a href="authors.html#citation">Citing quanteda.textstats</a></li>
+</ul>
+</div>
+
+<div class="developers">
+<h2 data-toc-skip>Developers</h2>
+<ul class="list-unstyled">
+<li>Kenneth Benoit <br><small class="roles"> Maintainer, author, copyright holder </small> <a href="https://orcid.org/0000-0002-0797-564X" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a> </li>
+<li>Kohei Watanabe <br><small class="roles"> Author </small> <a href="https://orcid.org/0000-0001-6519-5265" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a> </li>
+<li>Haiyan Wang <br><small class="roles"> Author </small> <a href="https://orcid.org/0000-0003-4992-4311" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a> </li>
+<li>Jiong Wei Lua <br><small class="roles"> Author </small>  </li>
+<li>Jouni Kuha <br><small class="roles"> Author </small> <a href="https://orcid.org/0000-0002-1156-8465" target="orcid.widget" aria-label="ORCID" class="external-link"><span class="fab fa-orcid orcid" aria-hidden="true"></span></a> </li>
+<li>European Research Council <br><small class="roles"> Funder </small>  </li>
+<li><a href="authors.html">More about authors...</a></li>
+</ul>
+</div>
+
+<div class="dev-status">
+<h2 data-toc-skip>Dev status</h2>
+<ul class="list-unstyled">
+<li><a href="https://CRAN.R-project.org/package=quanteda.textstats" class="external-link"><img src="https://www.r-pkg.org/badges/version/quanteda.textstats" alt="CRAN Version"></a></li>
+<li><a href="https://github.com/quanteda/quanteda.textstats" class="external-link"><img src="https://img.shields.io/badge/devel%20version-0.97-royalblue.svg"></a></li>
+<li><a href="https://CRAN.R-project.org/package=quanteda.textstats" class="external-link"><img src="https://cranlogs.r-pkg.org/badges/quanteda.textstats" alt="Downloads"></a></li>
+<li><a href="https://CRAN.R-project.org/package=quanteda.textstats" class="external-link"><img src="https://cranlogs.r-pkg.org/badges/grand-total/quanteda.textstats?color=orange" alt="Total Downloads"></a></li>
+<li><a href="https://app.codecov.io/gh/quanteda/quanteda.textstats?branch=master" class="external-link"><img src="https://codecov.io/gh/quanteda/quanteda.textstats/branch/master/graph/badge.svg" alt="Codecov test coverage"></a></li>
+<li><a href="https://github.com/quanteda/quanteda.textstats/actions/workflows/R-CMD-check.yaml" class="external-link"><img src="https://github.com/quanteda/quanteda.textstats/actions/workflows/R-CMD-check.yaml/badge.svg" alt="R-CMD-check"></a></li>
+</ul>
+</div>
+
+  </div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p>
+<p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p>
+<p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer>
+</div>
+
+  
+
+
+  
+
+  </body>
+</html>
diff --git a/docs/link.svg b/docs/link.svg
new file mode 100644
index 0000000..88ad827
--- /dev/null
+++ b/docs/link.svg
@@ -0,0 +1,12 @@
+<?xml version="1.0" encoding="utf-8"?>
+<!-- Generator: Adobe Illustrator 19.2.1, SVG Export Plug-In . SVG Version: 6.00 Build 0)  -->
+<svg version="1.1" id="Layer_1" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" x="0px" y="0px"
+	 viewBox="0 0 20 20" style="enable-background:new 0 0 20 20;" xml:space="preserve">
+<style type="text/css">
+	.st0{fill:#75AADB;}
+</style>
+<path class="st0" d="M4,11.3h1.3v1.3H4c-2,0-4-2.3-4-4.7s2.1-4.7,4-4.7h5.3c1.9,0,4,2.3,4,4.7c0,1.9-1.2,3.6-2.7,4.3v-1.5
+	C11.4,10.2,12,9.1,12,8c0-1.7-1.4-3.3-2.7-3.3H4C2.7,4.7,1.3,6.3,1.3,8S2.7,11.3,4,11.3z M16,7.3h-1.3v1.3H16c1.3,0,2.7,1.6,2.7,3.3
+	s-1.4,3.3-2.7,3.3h-5.3C9.4,15.3,8,13.7,8,12c0-1.1,0.6-2.2,1.3-2.8V7.7C7.9,8.4,6.7,10.1,6.7,12c0,2.4,2.1,4.7,4,4.7H16
+	c1.9,0,4-2.3,4-4.7S18,7.3,16,7.3z"/>
+</svg>
diff --git a/docs/news/index.html b/docs/news/index.html
new file mode 100644
index 0000000..c9ffe1f
--- /dev/null
+++ b/docs/news/index.html
@@ -0,0 +1,119 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Changelog • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Changelog"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-news">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+      <h1 data-toc-skip>Changelog <small></small></h1>
+      <small>Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/NEWS.md" class="external-link"><code>NEWS.md</code></a></small>
+    </div>
+
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.96" id="quantedatextstats-096">quanteda.textstats 0.96<small>2022-09-19</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-096"></a></h2>
+<ul><li>Fixes for C++ header compatibility for existing <strong>quanteda</strong> 3.x and the forthcoming 4.0 version.</li>
+<li>Fixes for compatibility with Matrix &gt;= 1.5. (<a href="https://github.com/quanteda/quanteda.textstats/issues/54" class="external-link">#54</a>)</li>
+<li>Fixed how subsetting (<code>[</code>) works for textstat outputs, to fix <a href="https://github.com/quanteda/quanteda.textstats/issues/50" class="external-link">#50</a>.</li>
+<li>Updated the C++ code generally and for better calling the tbb library for multi-threading.</li>
+</ul></div>
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.95" id="quantedatextstats-095">quanteda.textstats 0.95<small>2021-11-24</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-095"></a></h2>
+<ul><li>Updated <code><a href="../reference/textstat_simil.html">textstat_simil()</a></code> for new <strong>proxyC</strong> version v0.2.2, which affects how similarities are returned for <code>NA</code> values. See <a href="https://github.com/quanteda/quanteda.textstats/issues/45" class="external-link">#45</a>.</li>
+<li>Fixed a bug in the computation of Yule’s K. (<a href="https://github.com/quanteda/quanteda.textstats/issues/46" class="external-link">#46</a>)</li>
+<li>Corrected the name of similarity method “hamann” to its correct spelling (formerly “hamman”, which still works too). (<a href="https://github.com/quanteda/quanteda.textstats/issues/44" class="external-link">#44</a>)</li>
+</ul></div>
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.94.1" id="quantedatextstats-0941">quanteda.textstats 0.94.1<small>2021-05-11</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-0941"></a></h2>
+<ul><li>Updated <code><a href="../reference/textstat_simil.html">textstat_simil()</a></code> for new <strong>proxyC</strong> version v-0.2.0.</li>
+<li>Now returns emoji counts as <code>NA</code>, without failure, for ICU versions older than 9 (<a href="https://github.com/quanteda/quanteda.textstats/issues/35" class="external-link">#35</a> and <a href="https://github.com/quanteda/quanteda.textstats/issues/24" class="external-link">#24</a>).</li>
+</ul></div>
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.94" id="quantedatextstats-094">quanteda.textstats 0.94<small>2021-04-06</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-094"></a></h2>
+<ul><li>Move the S4 definitions for simil, dist, and proxy textstat classes from <strong>quanteda</strong> to <strong>quanteda.textstats</strong>.</li>
+<li>Changes the operation of <code>groups</code> in <code><a href="../reference/textstat_frequency.html">textstat_frequency()</a></code> to operate as in <strong>quanteda</strong> v3.</li>
+</ul></div>
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.93" id="quantedatextstats-093">quanteda.textstats 0.93<small>2021-03-15</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-093"></a></h2>
+<ul><li>Minor changes to ensure compatibility with <strong>quanteda</strong> v3.</li>
+<li>Changes to avoid breaking tests on older releases, caused by changes to the default for <code>stringsAsFactors</code> in <code><a href="https://rdrr.io/r/base/data.frame.html" class="external-link">data.frame()</a></code>.</li>
+</ul></div>
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.92" id="quantedatextstats-092">quanteda.textstats 0.92<small>2021-02-20</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-092"></a></h2>
+<ul><li>Removed <strong>data.table</strong> dependency (<a href="https://github.com/quanteda/quanteda.textstats/issues/5" class="external-link">#5</a>).</li>
+<li>Removed older non-C++ keyness methods (<a href="https://github.com/quanteda/quanteda.textstats/issues/4" class="external-link">#4</a>).</li>
+<li>Removed code that was breaking the Solaris build on CRAN.</li>
+<li>Removed <strong>digest</strong> Import not used.</li>
+</ul></div>
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.91" id="quantedatextstats-091">quanteda.textstats 0.91<small>2020-12-11</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-091"></a></h2>
+<ul><li>Fixes some issues causing errors on Solaris and on tests for the older R release on Windows.</li>
+<li>Removes problematic cacheing of results from <code><a href="../reference/textstat_summary.html">textstat_summary()</a></code> and associated functions and tests.</li>
+</ul></div>
+    <div class="section level2">
+<h2 class="page-header" data-toc-text="0.90" id="quantedatextstats-090">quanteda.textstats 0.90<small>2020-12-07</small><a class="anchor" aria-label="anchor" href="#quantedatextstats-090"></a></h2>
+<p>First version, split from <strong>quanteda</strong> 2.2. This is a transitional version designed to get the package on CRAN, so that we can shift existing reverse dependencies to the new package, before we update <strong>quanteda</strong> with a new version without the <code>textstat_*()</code> functions.</p>
+</div>
+  </div>
+
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/pkgdown.css b/docs/pkgdown.css
new file mode 100644
index 0000000..80ea5b8
--- /dev/null
+++ b/docs/pkgdown.css
@@ -0,0 +1,384 @@
+/* Sticky footer */
+
+/**
+ * Basic idea: https://philipwalton.github.io/solved-by-flexbox/demos/sticky-footer/
+ * Details: https://github.com/philipwalton/solved-by-flexbox/blob/master/assets/css/components/site.css
+ *
+ * .Site -> body > .container
+ * .Site-content -> body > .container .row
+ * .footer -> footer
+ *
+ * Key idea seems to be to ensure that .container and __all its parents__
+ * have height set to 100%
+ *
+ */
+
+html, body {
+  height: 100%;
+}
+
+body {
+  position: relative;
+}
+
+body > .container {
+  display: flex;
+  height: 100%;
+  flex-direction: column;
+}
+
+body > .container .row {
+  flex: 1 0 auto;
+}
+
+footer {
+  margin-top: 45px;
+  padding: 35px 0 36px;
+  border-top: 1px solid #e5e5e5;
+  color: #666;
+  display: flex;
+  flex-shrink: 0;
+}
+footer p {
+  margin-bottom: 0;
+}
+footer div {
+  flex: 1;
+}
+footer .pkgdown {
+  text-align: right;
+}
+footer p {
+  margin-bottom: 0;
+}
+
+img.icon {
+  float: right;
+}
+
+/* Ensure in-page images don't run outside their container */
+.contents img {
+  max-width: 100%;
+  height: auto;
+}
+
+/* Fix bug in bootstrap (only seen in firefox) */
+summary {
+  display: list-item;
+}
+
+/* Typographic tweaking ---------------------------------*/
+
+.contents .page-header {
+  margin-top: calc(-60px + 1em);
+}
+
+dd {
+  margin-left: 3em;
+}
+
+/* Section anchors ---------------------------------*/
+
+a.anchor {
+  display: none;
+  margin-left: 5px;
+  width: 20px;
+  height: 20px;
+
+  background-image: url(./link.svg);
+  background-repeat: no-repeat;
+  background-size: 20px 20px;
+  background-position: center center;
+}
+
+h1:hover .anchor,
+h2:hover .anchor,
+h3:hover .anchor,
+h4:hover .anchor,
+h5:hover .anchor,
+h6:hover .anchor {
+  display: inline-block;
+}
+
+/* Fixes for fixed navbar --------------------------*/
+
+.contents h1, .contents h2, .contents h3, .contents h4 {
+  padding-top: 60px;
+  margin-top: -40px;
+}
+
+/* Navbar submenu --------------------------*/
+
+.dropdown-submenu {
+  position: relative;
+}
+
+.dropdown-submenu>.dropdown-menu {
+  top: 0;
+  left: 100%;
+  margin-top: -6px;
+  margin-left: -1px;
+  border-radius: 0 6px 6px 6px;
+}
+
+.dropdown-submenu:hover>.dropdown-menu {
+  display: block;
+}
+
+.dropdown-submenu>a:after {
+  display: block;
+  content: " ";
+  float: right;
+  width: 0;
+  height: 0;
+  border-color: transparent;
+  border-style: solid;
+  border-width: 5px 0 5px 5px;
+  border-left-color: #cccccc;
+  margin-top: 5px;
+  margin-right: -10px;
+}
+
+.dropdown-submenu:hover>a:after {
+  border-left-color: #ffffff;
+}
+
+.dropdown-submenu.pull-left {
+  float: none;
+}
+
+.dropdown-submenu.pull-left>.dropdown-menu {
+  left: -100%;
+  margin-left: 10px;
+  border-radius: 6px 0 6px 6px;
+}
+
+/* Sidebar --------------------------*/
+
+#pkgdown-sidebar {
+  margin-top: 30px;
+  position: -webkit-sticky;
+  position: sticky;
+  top: 70px;
+}
+
+#pkgdown-sidebar h2 {
+  font-size: 1.5em;
+  margin-top: 1em;
+}
+
+#pkgdown-sidebar h2:first-child {
+  margin-top: 0;
+}
+
+#pkgdown-sidebar .list-unstyled li {
+  margin-bottom: 0.5em;
+}
+
+/* bootstrap-toc tweaks ------------------------------------------------------*/
+
+/* All levels of nav */
+
+nav[data-toggle='toc'] .nav > li > a {
+  padding: 4px 20px 4px 6px;
+  font-size: 1.5rem;
+  font-weight: 400;
+  color: inherit;
+}
+
+nav[data-toggle='toc'] .nav > li > a:hover,
+nav[data-toggle='toc'] .nav > li > a:focus {
+  padding-left: 5px;
+  color: inherit;
+  border-left: 1px solid #878787;
+}
+
+nav[data-toggle='toc'] .nav > .active > a,
+nav[data-toggle='toc'] .nav > .active:hover > a,
+nav[data-toggle='toc'] .nav > .active:focus > a {
+  padding-left: 5px;
+  font-size: 1.5rem;
+  font-weight: 400;
+  color: inherit;
+  border-left: 2px solid #878787;
+}
+
+/* Nav: second level (shown on .active) */
+
+nav[data-toggle='toc'] .nav .nav {
+  display: none; /* Hide by default, but at >768px, show it */
+  padding-bottom: 10px;
+}
+
+nav[data-toggle='toc'] .nav .nav > li > a {
+  padding-left: 16px;
+  font-size: 1.35rem;
+}
+
+nav[data-toggle='toc'] .nav .nav > li > a:hover,
+nav[data-toggle='toc'] .nav .nav > li > a:focus {
+  padding-left: 15px;
+}
+
+nav[data-toggle='toc'] .nav .nav > .active > a,
+nav[data-toggle='toc'] .nav .nav > .active:hover > a,
+nav[data-toggle='toc'] .nav .nav > .active:focus > a {
+  padding-left: 15px;
+  font-weight: 500;
+  font-size: 1.35rem;
+}
+
+/* orcid ------------------------------------------------------------------- */
+
+.orcid {
+  font-size: 16px;
+  color: #A6CE39;
+  /* margins are required by official ORCID trademark and display guidelines */
+  margin-left:4px;
+  margin-right:4px;
+  vertical-align: middle;
+}
+
+/* Reference index & topics ----------------------------------------------- */
+
+.ref-index th {font-weight: normal;}
+
+.ref-index td {vertical-align: top; min-width: 100px}
+.ref-index .icon {width: 40px;}
+.ref-index .alias {width: 40%;}
+.ref-index-icons .alias {width: calc(40% - 40px);}
+.ref-index .title {width: 60%;}
+
+.ref-arguments th {text-align: right; padding-right: 10px;}
+.ref-arguments th, .ref-arguments td {vertical-align: top; min-width: 100px}
+.ref-arguments .name {width: 20%;}
+.ref-arguments .desc {width: 80%;}
+
+/* Nice scrolling for wide elements --------------------------------------- */
+
+table {
+  display: block;
+  overflow: auto;
+}
+
+/* Syntax highlighting ---------------------------------------------------- */
+
+pre, code, pre code {
+  background-color: #f8f8f8;
+  color: #333;
+}
+pre, pre code {
+  white-space: pre-wrap;
+  word-break: break-all;
+  overflow-wrap: break-word;
+}
+
+pre {
+  border: 1px solid #eee;
+}
+
+pre .img, pre .r-plt {
+  margin: 5px 0;
+}
+
+pre .img img, pre .r-plt img {
+  background-color: #fff;
+}
+
+code a, pre a {
+  color: #375f84;
+}
+
+a.sourceLine:hover {
+  text-decoration: none;
+}
+
+.fl      {color: #1514b5;}
+.fu      {color: #000000;} /* function */
+.ch,.st  {color: #036a07;} /* string */
+.kw      {color: #264D66;} /* keyword */
+.co      {color: #888888;} /* comment */
+
+.error   {font-weight: bolder;}
+.warning {font-weight: bolder;}
+
+/* Clipboard --------------------------*/
+
+.hasCopyButton {
+  position: relative;
+}
+
+.btn-copy-ex {
+  position: absolute;
+  right: 0;
+  top: 0;
+  visibility: hidden;
+}
+
+.hasCopyButton:hover button.btn-copy-ex {
+  visibility: visible;
+}
+
+/* headroom.js ------------------------ */
+
+.headroom {
+  will-change: transform;
+  transition: transform 200ms linear;
+}
+.headroom--pinned {
+  transform: translateY(0%);
+}
+.headroom--unpinned {
+  transform: translateY(-100%);
+}
+
+/* mark.js ----------------------------*/
+
+mark {
+  background-color: rgba(255, 255, 51, 0.5);
+  border-bottom: 2px solid rgba(255, 153, 51, 0.3);
+  padding: 1px;
+}
+
+/* vertical spacing after htmlwidgets */
+.html-widget {
+  margin-bottom: 10px;
+}
+
+/* fontawesome ------------------------ */
+
+.fab {
+    font-family: "Font Awesome 5 Brands" !important;
+}
+
+/* don't display links in code chunks when printing */
+/* source: https://stackoverflow.com/a/10781533 */
+@media print {
+  code a:link:after, code a:visited:after {
+    content: "";
+  }
+}
+
+/* Section anchors ---------------------------------
+   Added in pandoc 2.11: https://github.com/jgm/pandoc-templates/commit/9904bf71
+*/
+
+div.csl-bib-body { }
+div.csl-entry {
+  clear: both;
+}
+.hanging-indent div.csl-entry {
+  margin-left:2em;
+  text-indent:-2em;
+}
+div.csl-left-margin {
+  min-width:2em;
+  float:left;
+}
+div.csl-right-inline {
+  margin-left:2em;
+  padding-left:1em;
+}
+div.csl-indent {
+  margin-left: 2em;
+}
diff --git a/docs/pkgdown.js b/docs/pkgdown.js
new file mode 100644
index 0000000..6f0eee4
--- /dev/null
+++ b/docs/pkgdown.js
@@ -0,0 +1,108 @@
+/* http://gregfranko.com/blog/jquery-best-practices/ */
+(function($) {
+  $(function() {
+
+    $('.navbar-fixed-top').headroom();
+
+    $('body').css('padding-top', $('.navbar').height() + 10);
+    $(window).resize(function(){
+      $('body').css('padding-top', $('.navbar').height() + 10);
+    });
+
+    $('[data-toggle="tooltip"]').tooltip();
+
+    var cur_path = paths(location.pathname);
+    var links = $("#navbar ul li a");
+    var max_length = -1;
+    var pos = -1;
+    for (var i = 0; i < links.length; i++) {
+      if (links[i].getAttribute("href") === "#")
+        continue;
+      // Ignore external links
+      if (links[i].host !== location.host)
+        continue;
+
+      var nav_path = paths(links[i].pathname);
+
+      var length = prefix_length(nav_path, cur_path);
+      if (length > max_length) {
+        max_length = length;
+        pos = i;
+      }
+    }
+
+    // Add class to parent <li>, and enclosing <li> if in dropdown
+    if (pos >= 0) {
+      var menu_anchor = $(links[pos]);
+      menu_anchor.parent().addClass("active");
+      menu_anchor.closest("li.dropdown").addClass("active");
+    }
+  });
+
+  function paths(pathname) {
+    var pieces = pathname.split("/");
+    pieces.shift(); // always starts with /
+
+    var end = pieces[pieces.length - 1];
+    if (end === "index.html" || end === "")
+      pieces.pop();
+    return(pieces);
+  }
+
+  // Returns -1 if not found
+  function prefix_length(needle, haystack) {
+    if (needle.length > haystack.length)
+      return(-1);
+
+    // Special case for length-0 haystack, since for loop won't run
+    if (haystack.length === 0) {
+      return(needle.length === 0 ? 0 : -1);
+    }
+
+    for (var i = 0; i < haystack.length; i++) {
+      if (needle[i] != haystack[i])
+        return(i);
+    }
+
+    return(haystack.length);
+  }
+
+  /* Clipboard --------------------------*/
+
+  function changeTooltipMessage(element, msg) {
+    var tooltipOriginalTitle=element.getAttribute('data-original-title');
+    element.setAttribute('data-original-title', msg);
+    $(element).tooltip('show');
+    element.setAttribute('data-original-title', tooltipOriginalTitle);
+  }
+
+  if(ClipboardJS.isSupported()) {
+    $(document).ready(function() {
+      var copyButton = "<button type='button' class='btn btn-primary btn-copy-ex' type = 'submit' title='Copy to clipboard' aria-label='Copy to clipboard' data-toggle='tooltip' data-placement='left auto' data-trigger='hover' data-clipboard-copy><i class='fa fa-copy'></i></button>";
+
+      $("div.sourceCode").addClass("hasCopyButton");
+
+      // Insert copy buttons:
+      $(copyButton).prependTo(".hasCopyButton");
+
+      // Initialize tooltips:
+      $('.btn-copy-ex').tooltip({container: 'body'});
+
+      // Initialize clipboard:
+      var clipboardBtnCopies = new ClipboardJS('[data-clipboard-copy]', {
+        text: function(trigger) {
+          return trigger.parentNode.textContent.replace(/\n#>[^\n]*/g, "");
+        }
+      });
+
+      clipboardBtnCopies.on('success', function(e) {
+        changeTooltipMessage(e.trigger, 'Copied!');
+        e.clearSelection();
+      });
+
+      clipboardBtnCopies.on('error', function() {
+        changeTooltipMessage(e.trigger,'Press Ctrl+C or Command+C to copy');
+      });
+    });
+  }
+})(window.jQuery || window.$)
diff --git a/docs/pkgdown.yml b/docs/pkgdown.yml
new file mode 100644
index 0000000..42472b9
--- /dev/null
+++ b/docs/pkgdown.yml
@@ -0,0 +1,6 @@
+pandoc: 3.1.1
+pkgdown: 2.0.7
+pkgdown_sha: ~
+articles: {}
+last_built: 2024-04-09T08:02Z
+
diff --git a/docs/reference/Rplot001.png b/docs/reference/Rplot001.png
new file mode 100644
index 0000000..17a3580
Binary files /dev/null and b/docs/reference/Rplot001.png differ
diff --git a/docs/reference/as.list.textstat_proxy.html b/docs/reference/as.list.textstat_proxy.html
new file mode 100644
index 0000000..806fc7f
--- /dev/null
+++ b/docs/reference/as.list.textstat_proxy.html
@@ -0,0 +1,153 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>textstat_simil/dist coercion methods — as.list.textstat_proxy • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="textstat_simil/dist coercion methods — as.list.textstat_proxy"><meta property="og:description" content="Coercion methods for objects created by textstat_simil() and
+textstat_dist()."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>textstat_simil/dist coercion methods</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_simil.R" class="external-link"><code>R/textstat_simil.R</code></a></small>
+    <div class="hidden name"><code>as.list.textstat_proxy.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Coercion methods for objects created by <code><a href="textstat_simil.html">textstat_simil()</a></code> and
+<code><a href="textstat_simil.html">textstat_dist()</a></code>.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="co"># S3 method for textstat_proxy</span></span>
+<span><span class="fu"><a href="https://rdrr.io/r/base/list.html" class="external-link">as.list</a></span><span class="op">(</span><span class="va">x</span>, sorted <span class="op">=</span> <span class="cn">TRUE</span>, n <span class="op">=</span> <span class="cn">NULL</span>, diag <span class="op">=</span> <span class="cn">FALSE</span>, <span class="va">...</span><span class="op">)</span></span>
+<span></span>
+<span><span class="co"># S3 method for textstat_proxy</span></span>
+<span><span class="fu"><a href="https://rdrr.io/r/base/as.data.frame.html" class="external-link">as.data.frame</a></span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  row.names <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  optional <span class="op">=</span> <span class="cn">FALSE</span>,</span>
+<span>  diag <span class="op">=</span> <span class="cn">FALSE</span>,</span>
+<span>  upper <span class="op">=</span> <span class="cn">FALSE</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>any <span style="R">R</span> object.</p></dd>
+
+
+<dt>sorted</dt>
+<dd><p>sort results in descending order if <code>TRUE</code></p></dd>
+
+
+<dt>n</dt>
+<dd><p>the top <code>n</code> highest-ranking items will be returned.  If n is
+<code>NULL</code>, return all items.</p></dd>
+
+
+<dt>diag</dt>
+<dd><p>logical; if <code>FALSE</code>, exclude the item's comparison with itself</p></dd>
+
+
+<dt>...</dt>
+<dd><p>additional arguments to be passed to or from methods.</p></dd>
+
+
+<dt>row.names</dt>
+<dd><p><code>NULL</code> or a character vector giving the row
+    names for the data frame.  Missing values are not allowed.</p></dd>
+
+
+<dt>optional</dt>
+<dd><p>logical. If <code>TRUE</code>, setting row names and
+    converting column names (to syntactic names: see
+    <code><a href="https://rdrr.io/r/base/make.names.html" class="external-link">make.names</a></code>) is optional.  Note that all of <span style="R">R</span>'s
+    <span class="pkg">base</span> package <code><a href="https://rdrr.io/r/base/as.data.frame.html" class="external-link">as.data.frame()</a></code> methods use
+    <code>optional</code> only for column names treatment, basically with the
+    meaning of <code><a href="https://rdrr.io/r/base/data.frame.html" class="external-link">data.frame</a>(*, check.names = !optional)</code>.
+    See also the <code>make.names</code> argument of the <code>matrix</code> method.</p></dd>
+
+
+<dt>upper</dt>
+<dd><p>logical; if <code>TRUE</code>, return pairs as both (A, B) and (B, A)</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p><code>as.data.list</code> for a <code>textstat_simil</code> or
+<code>textstat_dist</code> object returns a list equal in length to the columns of the
+simil or dist object, with the rows and their values as named  elements.  By default,
+this list excludes same-time pairs (when <code>diag = FALSE</code>) and sorts the values
+in descending order (when <code>sorted = TRUE</code>).</p>
+
+
+<p><code>as.data.frame</code> for a <code>textstat_simil</code> or
+<code>textstat_dist</code> object returns a data.frame of pairwise combinations
+and the and their similarity or distance value.</p>
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/as.matrix.textstat_simil_sparse.html b/docs/reference/as.matrix.textstat_simil_sparse.html
new file mode 100644
index 0000000..872d4b6
--- /dev/null
+++ b/docs/reference/as.matrix.textstat_simil_sparse.html
@@ -0,0 +1,107 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>as.matrix method for textstat_simil_sparse — as.matrix,textstat_simil_sparse-method • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="as.matrix method for textstat_simil_sparse — as.matrix,textstat_simil_sparse-method"><meta property="og:description" content="as.matrix method for textstat_simil_sparse"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>as.matrix method for textstat_simil_sparse</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_simil.R" class="external-link"><code>R/textstat_simil.R</code></a></small>
+    <div class="hidden name"><code>as.matrix.textstat_simil_sparse.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>as.matrix method for textstat_simil_sparse</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="co"># S4 method for textstat_simil_sparse</span></span>
+<span><span class="fu"><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix</a></span><span class="op">(</span><span class="va">x</span>, omitted <span class="op">=</span> <span class="cn">NA</span>, <span class="va">...</span><span class="op">)</span></span>
+<span></span>
+<span><span class="co"># S4 method for textstat_simil_symm_sparse</span></span>
+<span><span class="fu"><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix</a></span><span class="op">(</span><span class="va">x</span>, omitted <span class="op">=</span> <span class="cn">NA</span>, <span class="va">...</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>an object returned by <a href="textstat_simil.html">textstat_simil</a> when <code>min_simil &gt; 0</code></p></dd>
+
+
+<dt>omitted</dt>
+<dd><p>value that will replace the omitted cells</p></dd>
+
+
+<dt>...</dt>
+<dd><p>unused</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>a <a href="https://rdrr.io/r/base/matrix.html" class="external-link">matrix</a> object</p>
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/check_dots.html b/docs/reference/check_dots.html
new file mode 100644
index 0000000..f10ac95
--- /dev/null
+++ b/docs/reference/check_dots.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Check arguments passed to other functions via ... — check_dots • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Check arguments passed to other functions via ... — check_dots"><meta property="og:description" content="Check arguments passed to other functions via ..."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Check arguments passed to other functions via ...</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/utils.R" class="external-link"><code>R/utils.R</code></a></small>
+    <div class="hidden name"><code>check_dots.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Check arguments passed to other functions via ...</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">check_dots</span><span class="op">(</span><span class="va">...</span>, method <span class="op">=</span> <span class="cn">NULL</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>...</dt>
+<dd><p>dots to check</p></dd>
+
+
+<dt>method</dt>
+<dd><p>the names of functions <code>...</code> is passed to</p></dd>
+
+</dl></div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/compute_lexdiv_stats.html b/docs/reference/compute_lexdiv_stats.html
new file mode 100644
index 0000000..e8c9026
--- /dev/null
+++ b/docs/reference/compute_lexdiv_stats.html
@@ -0,0 +1,135 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Compute lexical diversity from a dfm or tokens — compute_lexdiv_stats • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Compute lexical diversity from a dfm or tokens — compute_lexdiv_stats"><meta property="og:description" content="Internal functions used in textstat_lexdiv(), for computing
+lexical diversity measures on dfms or tokens objects"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Compute lexical diversity from a dfm or tokens</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_lexdiv.R" class="external-link"><code>R/textstat_lexdiv.R</code></a></small>
+    <div class="hidden name"><code>compute_lexdiv_stats.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Internal functions used in <code><a href="textstat_lexdiv.html">textstat_lexdiv()</a></code>, for computing
+lexical diversity measures on dfms or tokens objects</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">compute_lexdiv_dfm_stats</span><span class="op">(</span><span class="va">x</span>, measure <span class="op">=</span> <span class="cn">NULL</span>, log.base <span class="op">=</span> <span class="fl">10</span><span class="op">)</span></span>
+<span></span>
+<span><span class="fu">compute_lexdiv_tokens_stats</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  measure <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"MATTR"</span>, <span class="st">"MSTTR"</span><span class="op">)</span>,</span>
+<span>  <span class="va">MATTR_window</span>,</span>
+<span>  <span class="va">MSTTR_segment</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a dfm object</p></dd>
+
+
+<dt>measure</dt>
+<dd><p>a list of lexical diversity measures.</p></dd>
+
+
+<dt>log.base</dt>
+<dd><p>a numeric value defining the base of the logarithm (for
+measures using logs)</p></dd>
+
+
+<dt>MATTR_window</dt>
+<dd><p>a numeric value defining the size of the moving window
+for computation of the Moving-Average Type-Token Ratio (Covington &amp; McFall, 2010)</p></dd>
+
+
+<dt>MSTTR_segment</dt>
+<dd><p>a numeric value defining the size of the each segment
+for the computation of the the Mean Segmental Type-Token Ratio (Johnson, 1944)</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>a <code>data.frame</code> with a <code>document</code> column containing the
+input document name, followed by columns with the lexical diversity
+statistic, in the order in which they were supplied as the <code>measure</code></p>
+
+
+<p>argument.</p>
+    </div>
+    <div id="details">
+    <h2>Details</h2>
+    <p><code>compute_lexdiv_dfm_stats</code> in an internal function that
+computes the lexical diversity measures from a dfm input.</p>
+<p><code>compute_lexdiv_tokens_stats</code> in an internal function that
+computes the lexical diversity measures from a dfm input.</p>
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/compute_mattr.html b/docs/reference/compute_mattr.html
new file mode 100644
index 0000000..2532644
--- /dev/null
+++ b/docs/reference/compute_mattr.html
@@ -0,0 +1,100 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Compute the Moving-Average Type-Token Ratio (MATTR) — compute_mattr • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Compute the Moving-Average Type-Token Ratio (MATTR) — compute_mattr"><meta property="og:description" content="From a tokens object, computes the Moving-Average Type-Token Ratio (MATTR)
+from Covington &amp;amp; McFall (2010), averaging all of the sequential moving
+windows of tokens of size MATTR_window across the text, returning the
+average as the MATTR."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Compute the Moving-Average Type-Token Ratio (MATTR)</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_lexdiv.R" class="external-link"><code>R/textstat_lexdiv.R</code></a></small>
+    <div class="hidden name"><code>compute_mattr.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>From a tokens object, computes the Moving-Average Type-Token Ratio (MATTR)
+from Covington &amp; McFall (2010), averaging all of the sequential moving
+windows of tokens of size <code>MATTR_window</code> across the text, returning the
+average as the MATTR.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">compute_mattr</span><span class="op">(</span><span class="va">x</span>, MATTR_window <span class="op">=</span> <span class="fl">100L</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a tokens object</p></dd>
+
+
+<dt>MATTR_window</dt>
+<dd><p>integer; the size of the moving window for computation of
+TTR, between 1 and the number of tokens of the document</p></dd>
+
+</dl></div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/compute_msttr.html b/docs/reference/compute_msttr.html
new file mode 100644
index 0000000..82fdea0
--- /dev/null
+++ b/docs/reference/compute_msttr.html
@@ -0,0 +1,94 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Compute the Mean Segmental Type-Token Ratio (MSTTR) — compute_msttr • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Compute the Mean Segmental Type-Token Ratio (MSTTR) — compute_msttr"><meta property="og:description" content="Compute the Mean Segmental Type-Token Ratio (Johnson 1944) for a tokens input."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Compute the Mean Segmental Type-Token Ratio (MSTTR)</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_lexdiv.R" class="external-link"><code>R/textstat_lexdiv.R</code></a></small>
+    <div class="hidden name"><code>compute_msttr.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Compute the Mean Segmental Type-Token Ratio (Johnson 1944) for a tokens input.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">compute_msttr</span><span class="op">(</span><span class="va">x</span>, <span class="va">MSTTR_segment</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>input tokens</p></dd>
+
+
+<dt>MSTTR_segment</dt>
+<dd><p>a numeric value defining the size of the each segment
+for the computation of the the Mean Segmental Type-Token Ratio (Johnson, 1944)</p></dd>
+
+</dl></div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/data_char_wordlists.html b/docs/reference/data_char_wordlists.html
new file mode 100644
index 0000000..0a08b3d
--- /dev/null
+++ b/docs/reference/data_char_wordlists.html
@@ -0,0 +1,115 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Word lists for readability statistics — data_char_wordlists • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Word lists for readability statistics — data_char_wordlists"><meta property="og:description" content="data_char_wordlists provides word lists used in some readability indexes;
+it is a named list of character vectors where each list element
+corresponds to a different readability index."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Word lists for readability statistics</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_readability.R" class="external-link"><code>R/textstat_readability.R</code></a></small>
+    <div class="hidden name"><code>data_char_wordlists.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p><code>data_char_wordlists</code> provides word lists used in some readability indexes;
+it is a named list of character vectors where each list element
+corresponds to a different readability index.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="va">data_char_wordlists</span></span></code></pre></div>
+    </div>
+
+    <div id="format">
+    <h2>Format</h2>
+    <p>A list of length two:</p><dl><dt><code>DaleChall</code></dt>
+<dd><p>The long Dale-Chall list of 3,000 familiar (English)
+words needed to compute the Dale-Chall Readability Formula.</p></dd>
+
+<dt><code>Spache</code></dt>
+<dd><p>The revised Spache word list (see Klare 1975, 73; Spache
+1974) needed to compute the Spache Revised Formula of readability (Spache
+1953).</p></dd>
+
+
+</dl></div>
+    <div id="references">
+    <h2>References</h2>
+    <p>Chall, J.S., &amp; Dale, E. (1995). <em>Readability Revisited: The New
+Dale-Chall Readability Formula</em>. Brookline Books.</p>
+<p>Dale, E. &amp; Chall, J.S. (1948). A Formula for Predicting
+Readability. <em>Educational Research Bulletin</em>, 27(1): 11--20.</p>
+<p>Dale, E. &amp; Chall, J.S. (1948). A Formula for Predicting Readability:
+Instructions. <em>Educational Research Bulletin</em>, 27(2): 37--54.</p>
+<p>Klare, G.R. (1975). Assessing Readability. <em>Reading Research Quarterly</em>
+10(1), 62--102.</p>
+<p>Spache, G. (1953). A New Readability Formula for Primary-Grade Reading
+Materials. <em>The Elementary School Journal</em>, 53, 410--413.</p>
+<p>Spache, G. (1974).  <em>Good reading for poor readers</em>. (Rvd. 9th Ed.)
+Champaign, Illinois: Garrard, 1974.</p>
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/dfm_split_hyphenated_features.html b/docs/reference/dfm_split_hyphenated_features.html
new file mode 100644
index 0000000..9d2c4b4
--- /dev/null
+++ b/docs/reference/dfm_split_hyphenated_features.html
@@ -0,0 +1,93 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Split a dfm's hyphenated features into constituent parts — dfm_split_hyphenated_features • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Split a dfm's hyphenated features into constituent parts — dfm_split_hyphenated_features"><meta property="og:description" content='Takes a dfm that contains features with hyphenated words, such as
+"split-second" and turns them into features that split the elements
+in the same was as tokens(x, remove_hyphens = TRUE) would have done.'><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Split a dfm's hyphenated features into constituent parts</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_lexdiv.R" class="external-link"><code>R/textstat_lexdiv.R</code></a></small>
+    <div class="hidden name"><code>dfm_split_hyphenated_features.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Takes a dfm that contains features with hyphenated words, such as
+"split-second" and turns them into features that split the elements
+in the same was as <code>tokens(x, remove_hyphens = TRUE)</code> would have done.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">dfm_split_hyphenated_features</span><span class="op">(</span><span class="va">x</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>input dfm</p></dd>
+
+</dl></div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/diag2na.html b/docs/reference/diag2na.html
new file mode 100644
index 0000000..c5819c9
--- /dev/null
+++ b/docs/reference/diag2na.html
@@ -0,0 +1,99 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>convert same-value pairs to NA in a textstat_proxy object — diag2na • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="convert same-value pairs to NA in a textstat_proxy object — diag2na"><meta property="og:description" content="Converts the diagonal, or the same-pair equivalent in an object
+where the columns have been selected, to NA."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>convert same-value pairs to NA in a textstat_proxy object</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_simil.R" class="external-link"><code>R/textstat_simil.R</code></a></small>
+    <div class="hidden name"><code>diag2na.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Converts the diagonal, or the same-pair equivalent in an object
+where the columns have been selected, to NA.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">diag2na</span><span class="op">(</span><span class="va">x</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>the return from <code><a href="textstat_simil.html">textstat_simil()</a></code> or <code><a href="textstat_simil.html">textstat_dist()</a></code></p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>sparse Matrix format with same-pair values replaced with <code>NA</code></p>
+
+
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/head.textstat_proxy.html b/docs/reference/head.textstat_proxy.html
new file mode 100644
index 0000000..aa3e0e6
--- /dev/null
+++ b/docs/reference/head.textstat_proxy.html
@@ -0,0 +1,112 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Return the first or last part of a textstat_proxy object — head.textstat_proxy • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Return the first or last part of a textstat_proxy object — head.textstat_proxy"><meta property="og:description" content="For a similarity or distance object computed via textstat_simil or
+textstat_dist, returns the first or last n rows."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Return the first or last part of a textstat_proxy object</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_simil.R" class="external-link"><code>R/textstat_simil.R</code></a></small>
+    <div class="hidden name"><code>head.textstat_proxy.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>For a similarity or distance object computed via <a href="textstat_simil.html">textstat_simil</a> or
+<a href="textstat_simil.html">textstat_dist</a>, returns the first or last <code>n</code> rows.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="co"># S3 method for textstat_proxy</span></span>
+<span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">x</span>, n <span class="op">=</span> <span class="fl">6L</span>, <span class="va">...</span><span class="op">)</span></span>
+<span></span>
+<span><span class="co"># S3 method for textstat_proxy</span></span>
+<span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">tail</a></span><span class="op">(</span><span class="va">x</span>, n <span class="op">=</span> <span class="fl">6L</span>, <span class="va">...</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a textstat_simil/textstat_dist object</p></dd>
+
+
+<dt>n</dt>
+<dd><p>a single, positive integer.  If positive, size for the resulting
+object: number of first/last documents for the dfm. If negative, all but
+the n last/first number of documents of x.</p></dd>
+
+
+<dt>...</dt>
+<dd><p>unused</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>A <a href="https://rdrr.io/r/base/matrix.html" class="external-link">matrix</a> corresponding to the subset defined
+by <code>n</code>.</p>
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/index.html b/docs/reference/index.html
new file mode 100644
index 0000000..e24a844
--- /dev/null
+++ b/docs/reference/index.html
@@ -0,0 +1,113 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Function reference • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Function reference"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-index">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="contents col-md-9">
+    <div class="page-header">
+      <h1>Reference</h1>
+    </div>
+
+    <table class="ref-index"><colgroup><col class="alias"><col class="title"></colgroup><tbody><tr><th colspan="2">
+          <h2 id="all-functions">All functions <a href="#all-functions" class="anchor" aria-hidden="true"></a></h2>
+          <p class="section-desc"></p>
+        </th>
+      </tr></tbody><tbody><tr><td>
+          <p><code><a href="data_char_wordlists.html">data_char_wordlists</a></code> </p>
+        </td>
+        <td><p>Word lists for readability statistics</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_collocations.html">textstat_collocations()</a></code> </p>
+        </td>
+        <td><p>Identify and score multi-word expressions</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_entropy.html">textstat_entropy()</a></code> </p>
+        </td>
+        <td><p>Compute entropies of documents or features</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_frequency.html">textstat_frequency()</a></code> </p>
+        </td>
+        <td><p>Tabulate feature frequencies</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_keyness.html">textstat_keyness()</a></code> </p>
+        </td>
+        <td><p>Calculate keyness statistics</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_lexdiv.html">textstat_lexdiv()</a></code> </p>
+        </td>
+        <td><p>Calculate lexical diversity</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_readability.html">textstat_readability()</a></code> </p>
+        </td>
+        <td><p>Calculate readability</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_simil.html">textstat_simil()</a></code> <code><a href="textstat_simil.html">textstat_dist()</a></code> </p>
+        </td>
+        <td><p>Similarity and distance computation between documents or features</p></td>
+      </tr><tr><td>
+          <p><code><a href="textstat_summary.html">textstat_summary()</a></code> </p>
+        </td>
+        <td><p>Summarize documents as syntactic and lexical feature counts</p></td>
+      </tr></tbody></table></div>
+
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/nscrabble.html b/docs/reference/nscrabble.html
new file mode 100644
index 0000000..2541a2e
--- /dev/null
+++ b/docs/reference/nscrabble.html
@@ -0,0 +1,124 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Count the Scrabble letter values of text — nscrabble • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Count the Scrabble letter values of text — nscrabble"><meta property="og:description" content="Tally the Scrabble letter values of text given a user-supplied function, such
+as the sum (default) or mean of the character values."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Count the Scrabble letter values of text</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/nscrabble.R" class="external-link"><code>R/nscrabble.R</code></a></small>
+    <div class="hidden name"><code>nscrabble.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Tally the Scrabble letter values of text given a user-supplied function, such
+as the sum (default) or mean of the character values.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">nscrabble</span><span class="op">(</span><span class="va">x</span>, FUN <span class="op">=</span> <span class="va">sum</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a character vector</p></dd>
+
+
+<dt>FUN</dt>
+<dd><p>function to be applied to the character values in the text;
+default is <code>sum</code>, but could also be <code>mean</code> or a user-supplied
+function.  Missing values are automatically removed.</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>a (named) integer vector of Scrabble letter values, computed using
+<code>FUN</code>, corresponding to the input text(s)</p>
+    </div>
+    <div id="note">
+    <h2>Note</h2>
+    <p>Character values are only defined for non-accented Latin a-z, A-Z
+letters.  Lower-casing is unnecessary.</p>
+<p>We would be happy to add more languages to this <em>extremely useful
+function</em> if you send us the values for your language!</p>
+    </div>
+    <div id="author">
+    <h2>Author</h2>
+    <p>Kenneth Benoit</p>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="fu">nscrabble</span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"muzjiks"</span>, <span class="st">"excellency"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [1] 29 24</span>
+<span class="r-in"><span><span class="fu">nscrabble</span><span class="op">(</span><span class="fu">quanteda</span><span class="fu">::</span><span class="va"><a href="https://quanteda.io/reference/data_corpus_inaugural.html" class="external-link">data_corpus_inaugural</a></span><span class="op">[</span><span class="fl">1</span><span class="op">:</span><span class="fl">5</span><span class="op">]</span>, <span class="va">mean</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1789-Washington 1793-Washington      1797-Adams  1801-Jefferson  1805-Jefferson </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>        1.706789        1.721875        1.624590        1.678183        1.663654 </span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/nsyllable.tokens.html b/docs/reference/nsyllable.tokens.html
new file mode 100644
index 0000000..6fc077a
--- /dev/null
+++ b/docs/reference/nsyllable.tokens.html
@@ -0,0 +1,133 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>nsyllable methods for tokens — nsyllable.tokens • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="nsyllable methods for tokens — nsyllable.tokens"><meta property="og:description" content="Extends nsyllable() methods for tokens objects."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>nsyllable methods for tokens</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/nsyllable-methods.R" class="external-link"><code>R/nsyllable-methods.R</code></a></small>
+    <div class="hidden name"><code>nsyllable.tokens.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Extends <code>nsyllable()</code> methods for tokens objects.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="co"># S3 method for tokens</span></span>
+<span><span class="fu">nsyllable</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  language <span class="op">=</span> <span class="st">"en"</span>,</span>
+<span>  syllable_dictionary <span class="op">=</span> <span class="fu">nsyllable</span><span class="fu">::</span><span class="va"><a href="https://rdrr.io/pkg/nsyllable/man/data_syllables_en.html" class="external-link">data_syllables_en</a></span>,</span>
+<span>  use.names <span class="op">=</span> <span class="cn">FALSE</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>character vector whose
+syllables will be counted.  This will count all syllables in a character
+vector without regard to separating tokens, so it is recommended that x be
+individual terms.</p></dd>
+
+
+<dt>language</dt>
+<dd><p>specify the language for syllable counts by <a href="https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes" class="external-link">ISO 639-1</a> code. The
+default is English, using the data object <code><a href="https://rdrr.io/pkg/nsyllable/man/data_syllables_en.html" class="external-link">data_syllables_en</a></code>, an English
+pronunciation dictionary from CMU.</p></dd>
+
+
+<dt>syllable_dictionary</dt>
+<dd><p>optional named integer vector of syllable counts
+where the names are lower case tokens.  This can be used to override the
+language setting, when set to <code>NULL</code> (the default).  If a syllable
+dictionary is supplied, this will override the <code>language</code> argument.</p></dd>
+
+
+<dt>use.names</dt>
+<dd><p>logical; if <code>TRUE</code>, assign the tokens as the names of the
+syllable count vector</p></dd>
+
+</dl></div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="co"># \dontshow{</span></span></span>
+<span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://github.com/quanteda/nsyllable" class="external-link">"nsyllable"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">txt</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span>one <span class="op">=</span> <span class="st">"super freakily yes"</span>,</span></span>
+<span class="r-in"><span>         two <span class="op">=</span> <span class="st">"merrily all go aerodynamic"</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">toks</span> <span class="op">&lt;-</span> <span class="fu">quanteda</span><span class="fu">::</span><span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">txt</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/pkg/nsyllable/man/nsyllable.html" class="external-link">nsyllable</a></span><span class="op">(</span><span class="va">toks</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [1] 2 3 1</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $two</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [1] 3 1 1 5</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-in"><span><span class="co"># }</span></span></span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/quanteda.textstats-package.html b/docs/reference/quanteda.textstats-package.html
new file mode 100644
index 0000000..e5d1768
--- /dev/null
+++ b/docs/reference/quanteda.textstats-package.html
@@ -0,0 +1,95 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>quanteda.textstats: Textual Statistics for the Quantitative Analysis of Textual Data — quanteda.textstats-package • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="quanteda.textstats: Textual Statistics for the Quantitative Analysis of Textual Data — quanteda.textstats-package"><meta property="og:description" content="Textual statistics functions formerly in the 'quanteda' package. Textual statistics for characterizing and comparing textual data. Includes functions for measuring term and document frequency, the co-occurrence of words, similarity and distance between features and documents, feature entropy, keyword occurrence, readability, and lexical diversity. These functions extend the 'quanteda' package and are specially designed for sparse textual data."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>quanteda.textstats: Textual Statistics for the Quantitative Analysis of Textual Data</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/quanteda.textstats-package.R" class="external-link"><code>R/quanteda.textstats-package.R</code></a></small>
+    <div class="hidden name"><code>quanteda.textstats-package.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Textual statistics functions formerly in the 'quanteda' package. Textual statistics for characterizing and comparing textual data. Includes functions for measuring term and document frequency, the co-occurrence of words, similarity and distance between features and documents, feature entropy, keyword occurrence, readability, and lexical diversity. These functions extend the 'quanteda' package and are specially designed for sparse textual data.</p>
+    </div>
+
+
+    <div id="see-also">
+    <h2>See also</h2>
+    <div class="dont-index"><p>Useful links:</p><ul><li><p><a href="https://quanteda.io" class="external-link">https://quanteda.io</a></p></li>
+<li><p>Report bugs at <a href="https://github.com/quanteda/quanteda.textstats/issues" class="external-link">https://github.com/quanteda/quanteda.textstats/issues</a></p></li>
+</ul></div>
+    </div>
+    <div id="author">
+    <h2>Author</h2>
+    <p><strong>Maintainer</strong>: Kenneth Benoit <a href="mailto:kbenoit@lse.ac.uk">kbenoit@lse.ac.uk</a> (<a href="https://orcid.org/0000-0002-0797-564X" class="external-link">ORCID</a>) [copyright holder]</p>
+<p>Authors:</p><ul><li><p>Kohei Watanabe <a href="mailto:watanabe.kohei@gmail.com">watanabe.kohei@gmail.com</a> (<a href="https://orcid.org/0000-0001-6519-5265" class="external-link">ORCID</a>)</p></li>
+<li><p>Haiyan Wang <a href="mailto:whyinsa@yahoo.com">whyinsa@yahoo.com</a> (<a href="https://orcid.org/0000-0003-4992-4311" class="external-link">ORCID</a>)</p></li>
+<li><p>Jiong Wei Lua <a href="mailto:J.W.Lua@lse.ac.uk">J.W.Lua@lse.ac.uk</a></p></li>
+<li><p>Jouni Kuha <a href="mailto:j.kuha@lse.ac.uk">j.kuha@lse.ac.uk</a> (<a href="https://orcid.org/0000-0002-1156-8465" class="external-link">ORCID</a>)</p></li>
+</ul><p>Other contributors:</p><ul><li><p>European Research Council (ERC-2011-StG 283794-QUANTESS) [funder]</p></li>
+</ul></div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_collocations.html b/docs/reference/textstat_collocations.html
new file mode 100644
index 0000000..8357e51
--- /dev/null
+++ b/docs/reference/textstat_collocations.html
@@ -0,0 +1,282 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Identify and score multi-word expressions — textstat_collocations • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Identify and score multi-word expressions — textstat_collocations"><meta property="og:description" content="Identify and score multi-word expressions, or adjacent fixed-length
+collocations, from text."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Identify and score multi-word expressions</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_collocations.R" class="external-link"><code>R/textstat_collocations.R</code></a></small>
+    <div class="hidden name"><code>textstat_collocations.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Identify and score multi-word expressions, or adjacent fixed-length
+collocations, from text.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_collocations</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  method <span class="op">=</span> <span class="st">"lambda"</span>,</span>
+<span>  size <span class="op">=</span> <span class="fl">2</span>,</span>
+<span>  min_count <span class="op">=</span> <span class="fl">2</span>,</span>
+<span>  smoothing <span class="op">=</span> <span class="fl">0.5</span>,</span>
+<span>  tolower <span class="op">=</span> <span class="cn">TRUE</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a character, <a href="https://quanteda.io/reference/corpus.html" class="external-link">corpus</a>, or <a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a> object whose collocations will be
+scored.  The tokens object should include punctuation, and if any words
+have been removed, these should have been removed with <code>padding = TRUE</code>.
+While identifying collocations for tokens objects is supported, you will
+get better results with character or corpus objects due to relatively
+imperfect detection of sentence boundaries from texts already tokenized.</p></dd>
+
+
+<dt>method</dt>
+<dd><p>association measure for detecting collocations. Currently this
+is limited to <code>"lambda"</code>.  See Details.</p></dd>
+
+
+<dt>size</dt>
+<dd><p>integer; the length of the collocations
+to be scored</p></dd>
+
+
+<dt>min_count</dt>
+<dd><p>numeric; minimum frequency of collocations that will be
+scored</p></dd>
+
+
+<dt>smoothing</dt>
+<dd><p>numeric; a smoothing parameter added to the observed counts
+(default is 0.5)</p></dd>
+
+
+<dt>tolower</dt>
+<dd><p>logical; if <code>TRUE</code>, form collocations as lower-cased
+combinations</p></dd>
+
+
+<dt>...</dt>
+<dd><p>additional arguments passed to <code><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens()</a></code></p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p><code>textstat_collocations</code> returns a data.frame of collocations and
+their scores and statistics. This consists of the collocations, their
+counts, length, and \(\lambda\) and \(z\) statistics.  When <code>size</code> is a
+vector, then <code>count_nested</code> counts the lower-order collocations that occur
+within a higher-order collocation (but this does not affect the
+statistics).</p>
+    </div>
+    <div id="details">
+    <h2>Details</h2>
+    <p>Documents are grouped for the purposes of scoring, but collocations will not
+span sentences. If <code>x</code> is a <a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a> object and some tokens have been
+removed, this should be done using <code>[tokens_remove](x, pattern, padding = TRUE)</code> so that counts will still be accurate, but the pads will prevent those
+collocations from being scored.</p>
+<p>The <code>lambda</code> computed for a size = \(K\)-word target multi-word expression
+the coefficient for the  \(K\)-way interaction parameter in the saturated
+log-linear model fitted to the counts of the terms forming the set of
+eligible multi-word expressions. This is the same as the "lambda" computed in
+Blaheta and Johnson's (2001), where all multi-word expressions are considered
+(rather than just verbs, as in that paper). The <code>z</code> is the Wald
+\(z\)-statistic computed as the quotient of <code>lambda</code> and the Wald statistic
+for <code>lambda</code> as described below.</p>
+<p>In detail:</p>
+<p>Consider a \(K\)-word target expression \(x\), and let \(z\) be any
+\(K\)-word expression. Define a comparison function \(c(x,z)=(j_{1},
+\dots, j_{K})=c\) such that the \(k\)th element of \(c\) is 1 if the
+\(k\)th word in \(z\) is equal to the \(k\)th word in \(x\), and 0
+otherwise. Let \(c_{i}=(j_{i1}, \dots, j_{iK})\), \(i=1, \dots,
+2^{K}=M\), be the possible values of \(c(x,z)\), with \(c_{M}=(1,1,
+\dots, 1)\). Consider the set of \(c(x,z_{r})\) across all expressions
+\(z_{r}\) in a corpus of text, and let \(n_{i}\), for \(i=1,\dots,M\),
+denote the number of the \(c(x,z_{r})\) which equal \(c_{i}\), plus the
+smoothing constant <code>smoothing</code>. The \(n_{i}\) are the counts in a
+\(2^{K}\) contingency table whose dimensions are defined by the
+\(c_{i}\).</p>
+<p>\(\lambda\): The \(K\)-way interaction parameter in the saturated
+loglinear model fitted to the \(n_{i}\). It can be calculated as</p>
+<p>$$\lambda  = \sum_{i=1}^{M} (-1)^{K-b_{i}} * log n_{i}$$</p>
+<p>where \(b_{i}\) is the number of the elements of \(c_{i}\) which are
+equal to 1.</p>
+<p>Wald test \(z\)-statistic \(z\) is calculated as:</p>
+<p>$$z = \frac{\lambda}{[\sum_{i=1}^{M} n_{i}^{-1}]^{(1/2)}}$$</p>
+    </div>
+    <div id="references">
+    <h2>References</h2>
+    <p>Blaheta, D. &amp; Johnson, M. (2001). <a href="http://web.science.mq.edu.au/~mjohnson/papers/2001/dpb-colloc01.pdf" class="external-link">Unsupervised learning of multi-word verbs</a>.
+Presented at the ACLEACL Workshop on the Computational Extraction, Analysis
+and Exploitation of Collocations.</p>
+    </div>
+    <div id="author">
+    <h2>Author</h2>
+    <p>Kenneth Benoit, Jouni Kuha, Haiyan Wang, and Kohei Watanabe</p>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-msg co"><span class="r-pr">#&gt;</span> Package version: 4.0.1</span>
+<span class="r-msg co"><span class="r-pr">#&gt;</span> Unicode version: 14.0</span>
+<span class="r-msg co"><span class="r-pr">#&gt;</span> ICU version: 71.1</span>
+<span class="r-msg co"><span class="r-pr">#&gt;</span> Parallel computing: disabled</span>
+<span class="r-msg co"><span class="r-pr">#&gt;</span> See https://quanteda.io for tutorials and examples.</span>
+<span class="r-in"><span><span class="va">corp</span> <span class="op">&lt;-</span> <span class="va">data_corpus_inaugural</span><span class="op">[</span><span class="fl">1</span><span class="op">:</span><span class="fl">2</span><span class="op">]</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">cols</span> <span class="op">&lt;-</span> <span class="fu">textstat_collocations</span><span class="op">(</span><span class="va">corp</span>, size <span class="op">=</span> <span class="fl">2</span>, min_count <span class="op">=</span> <span class="fl">2</span><span class="op">)</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>    collocation count count_nested length   lambda        z</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1    have been     5            0      2 5.704259 7.354588</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2     has been     3            0      2 5.565217 6.409333</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       of the    24            0      2 1.673501 6.382475</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4       i have     5            0      2 3.743580 6.268303</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5      which i     6            0      2 3.172217 6.135144</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6      will be     4            0      2 3.868500 5.930143</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7    less than     2            0      2 6.279494 5.529680</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8  public good     2            0      2 6.279494 5.529680</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9     you will     2            0      2 4.917893 5.431752</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10      may be     3            0      2 4.190711 5.328038</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">cols</span> <span class="op">&lt;-</span> <span class="fu">textstat_collocations</span><span class="op">(</span><span class="va">corp</span>, size <span class="op">=</span> <span class="fl">3</span>, min_count <span class="op">=</span> <span class="fl">2</span><span class="op">)</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>          collocation count count_nested length    lambda         z</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1       of which the     2            0      3 6.1259648 2.8317522</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2         in which i     3            0      3 2.1689288 1.1741918</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3          i have in     2            0      3 2.3809129 1.0618774</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4         and of the     2            0      3 0.8847383 0.7498730</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5          me by the     2            0      3 1.4726869 0.6560780</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6       to the great     2            0      3 1.2891870 0.5660311</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7        voice of my     2            0      3 1.2270130 0.5298220</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8     which ought to     2            0      3 1.4083232 0.5278314</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9  of the confidence     2            0      3 1.1220858 0.4948962</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10 the united states     2            0      3 1.2597834 0.4272349</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># extracting multi-part proper nouns (capitalized terms)</span></span></span>
+<span class="r-in"><span><span class="va">toks1</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">toks2</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens_select.html" class="external-link">tokens_remove</a></span><span class="op">(</span><span class="va">toks1</span>, pattern <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/pkg/stopwords/man/stopwords.html" class="external-link">stopwords</a></span><span class="op">(</span><span class="st">"english"</span><span class="op">)</span>, padding <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">toks3</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens_select.html" class="external-link">tokens_select</a></span><span class="op">(</span><span class="va">toks2</span>, pattern <span class="op">=</span> <span class="st">"^([A-Z][a-z\\-]{2,})"</span>, valuetype <span class="op">=</span> <span class="st">"regex"</span>,</span></span>
+<span class="r-in"><span>                       case_insensitive <span class="op">=</span> <span class="cn">FALSE</span>, padding <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">tstat</span> <span class="op">&lt;-</span> <span class="fu">textstat_collocations</span><span class="op">(</span><span class="va">toks3</span>, size <span class="op">=</span> <span class="fl">3</span>, tolower <span class="op">=</span> <span class="cn">FALSE</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">tstat</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>                   collocation count count_nested length     lambda         z</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1      United States Congress     2            0      3  -2.174793 -1.025182</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2 Arlington National Cemetery     2            0      3  -6.301876 -2.086677</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       Chief Justice Roberts     2            0      3  -7.818352 -3.033147</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4         Vice President Bush     2            0      3 -11.741818 -4.537337</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># vectorized size</span></span></span>
+<span class="r-in"><span><span class="va">txt</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">". . . . a b c . . a b c . . . c d e"</span>,</span></span>
+<span class="r-in"><span>         <span class="st">"a b . . a b . . a b . . a b . a b"</span>,</span></span>
+<span class="r-in"><span>         <span class="st">"b c d . . b c . b c . . . b c"</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_collocations</span><span class="op">(</span><span class="va">txt</span>, size <span class="op">=</span> <span class="fl">2</span><span class="op">:</span><span class="fl">3</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   collocation count count_nested length        lambda             z</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1         a b     7            2      2  5.652489e+00  2.745546e+00</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2         b c     6            3      2  5.609472e+00  2.721287e+00</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3         c d     2            2      2  4.976734e+00  2.354187e+00</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4       a b c     2            0      3 -1.110223e-16 -3.103168e-17</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># compounding tokens from collocations</span></span></span>
+<span class="r-in"><span><span class="va">toks</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="st">"This is the European Union."</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">colls</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="st">"The new European Union is not the old European Union."</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu">textstat_collocations</span><span class="op">(</span>size <span class="op">=</span> <span class="fl">2</span>, min_count <span class="op">=</span> <span class="fl">1</span>, tolower <span class="op">=</span> <span class="cn">FALSE</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">colls</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>      collocation count count_nested length   lambda        z</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1 European Union     2            0      2 4.317488 2.027787</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2        The new     1            0      2 3.931826 1.797564</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       Union is     1            0      2 3.931826 1.797564</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4         is not     1            0      2 3.931826 1.797564</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5        not the     1            0      2 3.931826 1.797564</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6        the old     1            0      2 3.931826 1.797564</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7   new European     1            0      2 2.708050 1.454456</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8   old European     1            0      2 2.708050 1.454456</span>
+<span class="r-in"><span><span class="fu"><a href="https://quanteda.io/reference/tokens_compound.html" class="external-link">tokens_compound</a></span><span class="op">(</span><span class="va">toks</span>, <span class="va">colls</span>, case_insensitive <span class="op">=</span> <span class="cn">FALSE</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> Tokens consisting of 1 document.</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> text1 :</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [1] "This"           "is"             "the"            "European_Union"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [5] "."             </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co">#' # from a collocations object</span></span></span>
+<span class="r-in"><span><span class="op">(</span><span class="va">coll</span> <span class="op">&lt;-</span> <span class="fu">textstat_collocations</span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="st">"a b c a b d e b d a b"</span><span class="op">)</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   collocation count count_nested length   lambda        z</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1         a b     3            0      2 3.412247 1.936083</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2         b d     2            0      2 3.218876 1.799406</span>
+<span class="r-in"><span><span class="fu"><a href="https://quanteda.io/reference/phrase.html" class="external-link">phrase</a></span><span class="op">(</span><span class="va">coll</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [[1]]</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [1] "a" "b"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [[2]]</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [1] "b" "d"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_entropy.html b/docs/reference/textstat_entropy.html
new file mode 100644
index 0000000..337fe32
--- /dev/null
+++ b/docs/reference/textstat_entropy.html
@@ -0,0 +1,155 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Compute entropies of documents or features — textstat_entropy • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Compute entropies of documents or features — textstat_entropy"><meta property="og:description" content="Compute entropies of documents or features"><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Compute entropies of documents or features</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_entropy.R" class="external-link"><code>R/textstat_entropy.R</code></a></small>
+    <div class="hidden name"><code>textstat_entropy.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Compute entropies of documents or features</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_entropy</span><span class="op">(</span><span class="va">x</span>, margin <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"documents"</span>, <span class="st">"features"</span><span class="op">)</span>, base <span class="op">=</span> <span class="fl">2</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a <code>dfm</code></p></dd>
+
+
+<dt>margin</dt>
+<dd><p>character indicating for which margin to compute entropy</p></dd>
+
+
+<dt>base</dt>
+<dd><p>base for logarithm function</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>a data.frame of entropies for the given document or feature</p>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_entropy</span><span class="op">(</span><span class="va">data_dfm_lbgexample</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   document  entropy</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1       R1 3.386943</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2       R2 3.386943</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       R3 3.386943</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4       R4 3.386943</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5       R5 3.386943</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6       V1 3.386943</span>
+<span class="r-in"><span><span class="fu">textstat_entropy</span><span class="op">(</span><span class="va">data_dfm_lbgexample</span>, <span class="st">"features"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>    feature   entropy</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1        A 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2        B 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3        C 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4        D 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5        E 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6        F 0.1686609</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7        G 0.1708952</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8        H 0.4371120</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9        I 0.6476138</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10       J 1.0338027</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 11       K 1.4131631</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 12       L 1.5669101</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 13       M 1.5996467</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 14       N 1.5656144</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 15       O 1.5806321</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 16       P 1.6267307</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 17       Q 1.6414915</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 18       R 1.6034693</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 19       S 1.5561626</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 20       T 1.5311306</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 21       U 1.4979274</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 22       V 1.3664642</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 23       W 1.1291805</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 24       X 1.0439334</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 25       Y 1.0338027</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 26       Z 1.0726302</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 27      ZA 1.0458291</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 28      ZB 0.7876499</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 29      ZC 0.5357150</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 30      ZD 0.3435197</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 31      ZE 0.1708952</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 32      ZF 0.1686609</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 33      ZG 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 34      ZH 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 35      ZI 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 36      ZJ 0.0000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 37      ZK 0.0000000</span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_frequency.html b/docs/reference/textstat_frequency.html
new file mode 100644
index 0000000..30e5dcc
--- /dev/null
+++ b/docs/reference/textstat_frequency.html
@@ -0,0 +1,248 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Tabulate feature frequencies — textstat_frequency • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Tabulate feature frequencies — textstat_frequency"><meta property="og:description" content="Produces counts and document frequencies summaries of the features in a
+dfm, optionally grouped by a docvars variable or other supplied
+grouping variable."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Tabulate feature frequencies</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_frequency.R" class="external-link"><code>R/textstat_frequency.R</code></a></small>
+    <div class="hidden name"><code>textstat_frequency.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Produces counts and document frequencies summaries of the features in a
+<a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a>, optionally grouped by a <a href="https://quanteda.io/reference/docvars.html" class="external-link">docvars</a> variable or other supplied
+grouping variable.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_frequency</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  n <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  groups <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  ties_method <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"min"</span>, <span class="st">"average"</span>, <span class="st">"first"</span>, <span class="st">"random"</span>, <span class="st">"max"</span>, <span class="st">"dense"</span><span class="op">)</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a <a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a> object</p></dd>
+
+
+<dt>n</dt>
+<dd><p>(optional) integer specifying the top <code>n</code> features to be returned,
+within group if <code>groups</code> is specified</p></dd>
+
+
+<dt>groups</dt>
+<dd><p>grouping variable for sampling, equal in length to the number
+of documents. This will be evaluated in the docvars data.frame, so that
+docvars may be referred to by name without quoting. This also changes
+previous behaviours for <code>groups</code>. See <code>news(Version &gt;= "3.0", package = "quanteda")</code> for details.</p></dd>
+
+
+<dt>ties_method</dt>
+<dd><p>character string specifying how ties are treated.  See
+<code><a href="https://rdrr.io/r/base/rank.html" class="external-link">base::rank()</a></code> for details.  Unlike that function, however, the default is
+<code>"min"</code>, so that frequencies of 10, 10, 11 would be ranked 1, 1, 3.</p></dd>
+
+
+<dt>...</dt>
+<dd><p>additional arguments passed to <code><a href="https://quanteda.io/reference/dfm_group.html" class="external-link">dfm_group()</a></code>.  This can
+be useful in passing <code>force = TRUE</code>, for instance, if you are grouping a
+dfm that has been weighted.</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>a data.frame containing the following variables:</p><dl><dt><code>feature</code></dt>
+<dd><p>(character) the feature</p></dd>
+
+<dt><code>frequency</code></dt>
+<dd><p>count of the feature</p></dd>
+
+<dt><code>rank</code></dt>
+<dd><p>rank of the feature, where 1 indicates the greatest
+frequency</p></dd>
+
+<dt><code>docfreq</code></dt>
+<dd><p>document frequency of the feature, as a count (the
+number of documents in which this feature occurred at least once)</p></dd>
+
+<dt><code>docfreq</code></dt>
+<dd><p>document frequency of the feature, as a count</p></dd>
+
+<dt><code>group</code></dt>
+<dd><p>(only if <code>groups</code> is specified) the label of the group.
+If the features have been grouped, then all counts, ranks, and document
+frequencies are within group.  If groups is not specified, the <code>group</code>
+column is omitted from the returned data.frame.</p></dd>
+
+
+</dl><p><code>textstat_frequency</code> returns a data.frame of features and
+their term and document frequencies within groups.</p>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/Random.html" class="external-link">set.seed</a></span><span class="op">(</span><span class="fl">20</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">dfmat1</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"a a b b c d"</span>, <span class="st">"a d d d"</span>, <span class="st">"a a a"</span><span class="op">)</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="fu">textstat_frequency</span><span class="op">(</span><span class="va">dfmat1</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   feature frequency rank docfreq group</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1       a         6    1       3   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2       d         4    2       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       b         2    3       1   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4       c         1    4       1   all</span>
+<span class="r-in"><span><span class="fu">textstat_frequency</span><span class="op">(</span><span class="va">dfmat1</span>, groups <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"one"</span>, <span class="st">"two"</span>, <span class="st">"one"</span><span class="op">)</span>, ties_method <span class="op">=</span> <span class="st">"first"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   feature frequency rank docfreq group</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1       a         5    1       2   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2       b         2    2       1   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       c         1    3       1   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4       d         1    4       1   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5       d         3    1       1   two</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6       a         1    2       1   two</span>
+<span class="r-in"><span><span class="fu">textstat_frequency</span><span class="op">(</span><span class="va">dfmat1</span>, groups <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"one"</span>, <span class="st">"two"</span>, <span class="st">"one"</span><span class="op">)</span>, ties_method <span class="op">=</span> <span class="st">"average"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   feature frequency rank docfreq group</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1       a         5  1.0       2   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2       b         2  2.0       1   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       c         1  3.5       1   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4       d         1  3.5       1   one</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5       d         3  1.0       1   two</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6       a         1  2.0       1   two</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="va">dfmat2</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/corpus_subset.html" class="external-link">corpus_subset</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span>, <span class="va">President</span> <span class="op">==</span> <span class="st">"Obama"</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>   <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span>remove_punct <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>   <span class="fu"><a href="https://quanteda.io/reference/tokens_select.html" class="external-link">tokens_remove</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/pkg/stopwords/man/stopwords.html" class="external-link">stopwords</a></span><span class="op">(</span><span class="st">"en"</span><span class="op">)</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>   <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">tstat1</span> <span class="op">&lt;-</span> <span class="fu">textstat_frequency</span><span class="op">(</span><span class="va">dfmat2</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">tstat1</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>    feature frequency rank docfreq group</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1       us        44    1       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2     must        25    2       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3      can        20    3       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4   nation        18    4       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5   people        18    4       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6      new        17    6       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7     time        16    7       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8    every        15    8       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9  america        14    9       2   all</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10     now        11   10       2   all</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="va">dfmat3</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>   <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span>remove_punct <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>   <span class="fu"><a href="https://quanteda.io/reference/tokens_select.html" class="external-link">tokens_remove</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/pkg/stopwords/man/stopwords.html" class="external-link">stopwords</a></span><span class="op">(</span><span class="st">"en"</span><span class="op">)</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>   <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_frequency</span><span class="op">(</span><span class="va">dfmat3</span>, n <span class="op">=</span> <span class="fl">2</span>, groups <span class="op">=</span> <span class="va">President</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>      feature frequency rank docfreq      group</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1     people        20    1       1      Adams</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2 government        16    2       1      Adams</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3     public        18    1       2  Jefferson</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4        may        18    1       2  Jefferson</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5     public         6    1       1    Madison</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6    nations         6    1       1    Madison</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7        can         9    1       1 Washington</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8      every         9    1       1 Washington</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="kw">if</span> <span class="op">(</span><span class="cn">FALSE</span><span class="op">)</span> <span class="op">{</span></span></span>
+<span class="r-in"><span><span class="co"># plot 20 most frequent words</span></span></span>
+<span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://ggplot2.tidyverse.org" class="external-link">"ggplot2"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://ggplot2.tidyverse.org/reference/ggplot.html" class="external-link">ggplot</a></span><span class="op">(</span><span class="va">tstat1</span><span class="op">[</span><span class="fl">1</span><span class="op">:</span><span class="fl">20</span>, <span class="op">]</span>, <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/aes.html" class="external-link">aes</a></span><span class="op">(</span>x <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/stats/reorder.factor.html" class="external-link">reorder</a></span><span class="op">(</span><span class="va">feature</span>, <span class="va">frequency</span><span class="op">)</span>, y <span class="op">=</span> <span class="va">frequency</span><span class="op">)</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/geom_point.html" class="external-link">geom_point</a></span><span class="op">(</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/coord_flip.html" class="external-link">coord_flip</a></span><span class="op">(</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/labs.html" class="external-link">labs</a></span><span class="op">(</span>x <span class="op">=</span> <span class="cn">NULL</span>, y <span class="op">=</span> <span class="st">"Frequency"</span><span class="op">)</span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># plot relative frequencies by group</span></span></span>
+<span class="r-in"><span><span class="va">dfmat3</span> <span class="op">&lt;-</span> <span class="va">data_corpus_inaugural</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/corpus_subset.html" class="external-link">corpus_subset</a></span><span class="op">(</span><span class="va">Year</span> <span class="op">&gt;</span> <span class="fl">2000</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span>remove_punct <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/tokens_select.html" class="external-link">tokens_remove</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/pkg/stopwords/man/stopwords.html" class="external-link">stopwords</a></span><span class="op">(</span><span class="st">"en"</span><span class="op">)</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm_group.html" class="external-link">dfm_group</a></span><span class="op">(</span>groups <span class="op">=</span> <span class="va">President</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm_weight.html" class="external-link">dfm_weight</a></span><span class="op">(</span>scheme <span class="op">=</span> <span class="st">"prop"</span><span class="op">)</span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># calculate relative frequency by president</span></span></span>
+<span class="r-in"><span><span class="va">tstat2</span> <span class="op">&lt;-</span> <span class="fu">textstat_frequency</span><span class="op">(</span><span class="va">dfmat3</span>, n <span class="op">=</span> <span class="fl">10</span>, groups <span class="op">=</span> <span class="va">President</span><span class="op">)</span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># plot frequencies</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://ggplot2.tidyverse.org/reference/ggplot.html" class="external-link">ggplot</a></span><span class="op">(</span>data <span class="op">=</span> <span class="va">tstat2</span>, <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/aes.html" class="external-link">aes</a></span><span class="op">(</span>x <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/factor.html" class="external-link">factor</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/nrow.html" class="external-link">nrow</a></span><span class="op">(</span><span class="va">tstat2</span><span class="op">)</span><span class="op">:</span><span class="fl">1</span><span class="op">)</span>, y <span class="op">=</span> <span class="va">frequency</span><span class="op">)</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/geom_point.html" class="external-link">geom_point</a></span><span class="op">(</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/facet_wrap.html" class="external-link">facet_wrap</a></span><span class="op">(</span><span class="op">~</span> <span class="va">group</span>, scales <span class="op">=</span> <span class="st">"free"</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/coord_flip.html" class="external-link">coord_flip</a></span><span class="op">(</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/scale_discrete.html" class="external-link">scale_x_discrete</a></span><span class="op">(</span>breaks <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/nrow.html" class="external-link">nrow</a></span><span class="op">(</span><span class="va">tstat2</span><span class="op">)</span><span class="op">:</span><span class="fl">1</span>,</span></span>
+<span class="r-in"><span>                       labels <span class="op">=</span> <span class="va">tstat2</span><span class="op">$</span><span class="va">feature</span><span class="op">)</span> <span class="op">+</span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://ggplot2.tidyverse.org/reference/labs.html" class="external-link">labs</a></span><span class="op">(</span>x <span class="op">=</span> <span class="cn">NULL</span>, y <span class="op">=</span> <span class="st">"Relative frequency"</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="op">}</span></span></span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_keyness.html b/docs/reference/textstat_keyness.html
new file mode 100644
index 0000000..d04358f
--- /dev/null
+++ b/docs/reference/textstat_keyness.html
@@ -0,0 +1,254 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Calculate keyness statistics — textstat_keyness • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Calculate keyness statistics — textstat_keyness"><meta property="og:description" content='Calculate "keyness", a score for features that occur differentially across
+different categories.  Here, the categories are defined by reference to a
+"target" document index in the dfm, with the reference group
+consisting of all other documents.'><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Calculate keyness statistics</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_keyness.R" class="external-link"><code>R/textstat_keyness.R</code></a></small>
+    <div class="hidden name"><code>textstat_keyness.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Calculate "keyness", a score for features that occur differentially across
+different categories.  Here, the categories are defined by reference to a
+"target" document index in the <a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a>, with the reference group
+consisting of all other documents.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_keyness</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  target <span class="op">=</span> <span class="fl">1L</span>,</span>
+<span>  measure <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"chi2"</span>, <span class="st">"exact"</span>, <span class="st">"lr"</span>, <span class="st">"pmi"</span><span class="op">)</span>,</span>
+<span>  sort <span class="op">=</span> <span class="cn">TRUE</span>,</span>
+<span>  correction <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"default"</span>, <span class="st">"yates"</span>, <span class="st">"williams"</span>, <span class="st">"none"</span><span class="op">)</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a <a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a> containing the features to be examined for keyness</p></dd>
+
+
+<dt>target</dt>
+<dd><p>the document index (numeric, character or logical) identifying
+the document forming the "target" for computing keyness; all other
+documents' feature frequencies will be combined for use as a reference</p></dd>
+
+
+<dt>measure</dt>
+<dd><p>(signed) association measure to be used for computing keyness.
+Currently available: <code>"chi2"</code>; <code>"exact"</code> (Fisher's exact test); <code>"lr"</code> for
+the likelihood ratio; <code>"pmi"</code> for pointwise mutual information.  Note that
+the "exact" test is very computationally intensive and therefore much
+slower than the other methods.</p></dd>
+
+
+<dt>sort</dt>
+<dd><p>logical; if <code>TRUE</code> sort features scored in descending order
+of the measure, otherwise leave in original feature order</p></dd>
+
+
+<dt>correction</dt>
+<dd><p>if <code>"default"</code>, Yates correction is applied to
+<code>"chi2"</code>; William's correction is applied to <code>"lr"</code>; and no
+correction is applied for the <code>"exact"</code> and <code>"pmi"</code> measures.
+Specifying a value other than the default can be used to override the
+defaults, for instance to apply the Williams correction to the chi2
+measure.  Specifying a correction for the <code>"exact"</code> and <code>"pmi"</code>
+measures has no effect and produces a warning.</p></dd>
+
+
+<dt>...</dt>
+<dd><p>not used</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>a data.frame of computed statistics and associated p-values, where
+the features scored name each row, and the number of occurrences for both
+the target and reference groups. For <code>measure = "chi2"</code> this is the
+chi-squared value, signed positively if the observed value in the target
+exceeds its expected value; for <code>measure = "exact"</code> this is the
+estimate of the odds ratio; for <code>measure = "lr"</code> this is the
+likelihood ratio \(G2\) statistic; for <code>"pmi"</code> this is the pointwise
+mutual information statistics.</p>
+
+
+<p><code>textstat_keyness</code> returns a data.frame of features and
+their keyness scores and frequency counts.</p>
+    </div>
+    <div id="references">
+    <h2>References</h2>
+    <p>Bondi, M. &amp; Scott, M. (eds) (2010). <em>Keyness in
+Texts</em>. Amsterdam, Philadelphia: John Benjamins.</p>
+<p>Stubbs, M. (2010). Three Concepts of Keywords. In <em>Keyness in
+Texts</em>, Bondi, M. &amp; Scott, M. (eds): 1--42. Amsterdam, Philadelphia:
+John Benjamins.</p>
+<p>Scott, M. &amp; Tribble, C. (2006). <em>Textual Patterns: Keyword and Corpus
+Analysis in Language Education</em>. Amsterdam: Benjamins: 55.</p>
+<p>Dunning, T. (1993). <a href="https://dl.acm.org/doi/10.5555/972450.972454" class="external-link">Accurate Methods for the Statistics of Surprise and Coincidence</a>. <em>Computational
+Linguistics</em>, 19(1): 61--74.</p>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># compare pre- v. post-war terms using grouping</span></span></span>
+<span class="r-in"><span><span class="va">period</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/ifelse.html" class="external-link">ifelse</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/docvars.html" class="external-link">docvars</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span>, <span class="st">"Year"</span><span class="op">)</span> <span class="op">&lt;</span> <span class="fl">1945</span>, <span class="st">"pre-war"</span>, <span class="st">"post-war"</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">dfmat1</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm_group.html" class="external-link">dfm_group</a></span><span class="op">(</span>groups <span class="op">=</span> <span class="va">period</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">dfmat1</span><span class="op">)</span> <span class="co"># make sure 'post-war' is in the first row</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> Document-feature matrix of: 2 documents, 9,437 features (34.79% sparse) and 0 docvars.</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>           features</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> docs       fellow-citizens   of  the senate  and house representatives   :</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   post-war               0 1514 2089      2 1552     3               0 115</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   pre-war               39 5666 8094     13 3854     8              19  29</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>           features</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> docs       among vicissitudes</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   post-war    22            0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   pre-war     86            5</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> [ reached max_nfeat ... 9,427 more features ]</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="va">tstat1</span> <span class="op">&lt;-</span> <span class="fu">textstat_keyness</span><span class="op">(</span><span class="va">dfmat1</span><span class="op">)</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>      feature     chi2 p n_target n_reference</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1         we 764.4484 0     1048         779</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2          . 300.1887 0     2014        3141</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3         us 207.5181 0      289         216</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4          - 205.0408 0      242         157</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5    america 200.5544 0      148          54</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6          : 187.9821 0      115          29</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7        our 183.5362 0      917        1307</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8      world 171.9660 0      196         123</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9  americans 163.1532 0       76           7</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10     today 137.7753 0       84          21</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">tail</a></span><span class="op">(</span><span class="va">tstat1</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>           feature       chi2            p n_target n_reference</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9428         upon  -58.39810 2.142730e-14       39         332</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9429       public  -58.87042 1.687539e-14       12         213</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9430 constitution  -59.66134 1.121325e-14        9         200</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9431           it  -60.68425 6.661338e-15      266        1132</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9432       states  -63.87803 1.332268e-15       29         305</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9433           be  -72.68161 0.000000e+00      278        1224</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9434       should  -88.14817 0.000000e+00       16         309</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9435        which -177.10087 0.000000e+00       96         911</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9436           of -197.08830 0.000000e+00     1514        5666</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9437          the -331.99069 0.000000e+00     2089        8094</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># compare pre- v. post-war terms using logical vector</span></span></span>
+<span class="r-in"><span><span class="va">dfmat2</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="fu">textstat_keyness</span><span class="op">(</span><span class="va">dfmat2</span>, <span class="fu"><a href="https://quanteda.io/reference/docvars.html" class="external-link">docvars</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span>, <span class="st">"Year"</span><span class="op">)</span> <span class="op">&gt;=</span> <span class="fl">1945</span><span class="op">)</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>      feature     chi2 p n_target n_reference</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1         we 764.4484 0     1048         779</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2          . 300.1887 0     2014        3141</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3         us 207.5181 0      289         216</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4          - 205.0408 0      242         157</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5    america 200.5544 0      148          54</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6          : 187.9821 0      115          29</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7        our 183.5362 0      917        1307</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8      world 171.9660 0      196         123</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9  americans 163.1532 0       76           7</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10     today 137.7753 0       84          21</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># compare Trump 2017 to other post-war preseidents</span></span></span>
+<span class="r-in"><span><span class="va">dfmat3</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/corpus_subset.html" class="external-link">corpus_subset</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span>, <span class="va">period</span> <span class="op">==</span> <span class="st">"post-war"</span><span class="op">)</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="fu">textstat_keyness</span><span class="op">(</span><span class="va">dfmat3</span>, target <span class="op">=</span> <span class="st">"2017-Trump"</span><span class="op">)</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>      feature     chi2            p n_target n_reference</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1  protected 81.83024 0.000000e+00        5           1</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2      while 51.79484 6.161738e-13        6           7</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3      obama 51.05861 8.965051e-13        3           0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4      we've 51.05861 8.965051e-13        3           0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5       will 48.03251 4.192091e-12       40         332</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6   everyone 29.76164 4.885651e-08        4           5</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7       your 28.60175 8.890179e-08       11          51</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8    america 27.57968 1.507539e-07       18         130</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9     breath 27.27421 1.765507e-07        2           0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10    exists 27.27421 1.765507e-07        2           0</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># using the likelihood ratio method</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="fu">textstat_keyness</span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/dfm_weight.html" class="external-link">dfm_smooth</a></span><span class="op">(</span><span class="va">dfmat3</span><span class="op">)</span>, measure <span class="op">=</span> <span class="st">"lr"</span>, target <span class="op">=</span> <span class="st">"2017-Trump"</span><span class="op">)</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>      feature        G2            p n_target n_reference</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1       will 22.609878 1.984616e-06       41         351</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2    america 12.306921 4.512817e-04       19         149</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3       your 10.868622 9.780727e-04       12          70</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4      while  9.707425 1.835249e-03        7          26</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5      again  9.345219 2.235679e-03       10          56</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6  protected  8.909125 2.837491e-03        6          20</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7   american  7.996610 4.686501e-03       12          86</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8       back  7.113978 7.648521e-03        7          35</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9     dreams  5.908744 1.506591e-02        6          30</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10   country  5.725811 1.671732e-02       10          77</span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_lexdiv.html b/docs/reference/textstat_lexdiv.html
new file mode 100644
index 0000000..949e0d5
--- /dev/null
+++ b/docs/reference/textstat_lexdiv.html
@@ -0,0 +1,311 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Calculate lexical diversity — textstat_lexdiv • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Calculate lexical diversity — textstat_lexdiv"><meta property="og:description" content="Calculate the lexical diversity of text(s)."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Calculate lexical diversity</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_lexdiv.R" class="external-link"><code>R/textstat_lexdiv.R</code></a></small>
+    <div class="hidden name"><code>textstat_lexdiv.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Calculate the lexical diversity of text(s).</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_lexdiv</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  measure <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"TTR"</span>, <span class="st">"C"</span>, <span class="st">"R"</span>, <span class="st">"CTTR"</span>, <span class="st">"U"</span>, <span class="st">"S"</span>, <span class="st">"K"</span>, <span class="st">"I"</span>, <span class="st">"D"</span>, <span class="st">"Vm"</span>, <span class="st">"Maas"</span>, <span class="st">"MATTR"</span>,</span>
+<span>    <span class="st">"MSTTR"</span>, <span class="st">"all"</span><span class="op">)</span>,</span>
+<span>  remove_numbers <span class="op">=</span> <span class="cn">TRUE</span>,</span>
+<span>  remove_punct <span class="op">=</span> <span class="cn">TRUE</span>,</span>
+<span>  remove_symbols <span class="op">=</span> <span class="cn">TRUE</span>,</span>
+<span>  remove_hyphens <span class="op">=</span> <span class="cn">FALSE</span>,</span>
+<span>  log.base <span class="op">=</span> <span class="fl">10</span>,</span>
+<span>  MATTR_window <span class="op">=</span> <span class="fl">100L</span>,</span>
+<span>  MSTTR_segment <span class="op">=</span> <span class="fl">100L</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>an <a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a> or <a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a> input object for whose documents
+lexical diversity will be computed</p></dd>
+
+
+<dt>measure</dt>
+<dd><p>a character vector defining the measure to compute</p></dd>
+
+
+<dt>remove_numbers</dt>
+<dd><p>logical; if <code>TRUE</code> remove features or tokens that
+consist only of numerals (the Unicode "Number" <code>[N]</code> class)</p></dd>
+
+
+<dt>remove_punct</dt>
+<dd><p>logical; if <code>TRUE</code> remove all features or tokens
+that consist only of the Unicode "Punctuation" <code>[P]</code> class)</p></dd>
+
+
+<dt>remove_symbols</dt>
+<dd><p>logical; if <code>TRUE</code> remove all features or tokens
+that consist only of the Unicode "Punctuation" <code>[S]</code> class)</p></dd>
+
+
+<dt>remove_hyphens</dt>
+<dd><p>logical; if <code>TRUE</code> split words that are connected
+by hyphenation and hyphenation-like characters in between words, e.g.
+"self-storage" becomes two features or tokens "self" and "storage". Default
+is FALSE to preserve such words as is, with the hyphens.</p></dd>
+
+
+<dt>log.base</dt>
+<dd><p>a numeric value defining the base of the logarithm (for
+measures using logarithms)</p></dd>
+
+
+<dt>MATTR_window</dt>
+<dd><p>a numeric value defining the size of the moving window
+for computation of the Moving-Average Type-Token Ratio (Covington &amp; McFall, 2010)</p></dd>
+
+
+<dt>MSTTR_segment</dt>
+<dd><p>a numeric value defining the size of the each segment
+for the computation of the the Mean Segmental Type-Token Ratio (Johnson, 1944)</p></dd>
+
+
+<dt>...</dt>
+<dd><p>not used directly</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>A data.frame of documents and their lexical diversity scores.</p>
+    </div>
+    <div id="details">
+    <h2>Details</h2>
+    <p><code>textstat_lexdiv</code> calculates the lexical diversity of documents
+using a variety of indices.</p>
+<p>In the following formulas, \(N\) refers to the total number of
+tokens, \(V\) to the number of types, and \(f_v(i, N)\) to the numbers
+of types occurring \(i\) times in a sample of length \(N\).</p><dl><dt><code>"TTR"</code>:</dt>
+<dd><p>The ordinary <em>Type-Token Ratio</em>: $$TTR =
+  \frac{V}{N}$$</p></dd>
+
+
+<dt><code>"C"</code>:</dt>
+<dd><p>Herdan's <em>C</em> (Herdan, 1960, as cited in Tweedie &amp;
+Baayen, 1998; sometimes referred to as <em>LogTTR</em>): $$C =
+  \frac{\log{V}}{\log{N}}$$</p></dd>
+
+
+<dt><code>"R"</code>:</dt>
+<dd><p>Guiraud's <em>Root TTR</em> (Guiraud, 1954, as cited in
+Tweedie &amp; Baayen, 1998): $$R = \frac{V}{\sqrt{N}}$$</p></dd>
+
+
+<dt><code>"CTTR"</code>:</dt>
+<dd><p>Carroll's <em>Corrected TTR</em>: $$CTTR =
+  \frac{V}{\sqrt{2N}}$$</p></dd>
+
+
+<dt><code>"U"</code>:</dt>
+<dd><p>Dugast's <em>Uber Index</em>  (Dugast, 1978, as cited in
+Tweedie &amp; Baayen, 1998): $$U = \frac{(\log{N})^2}{\log{N} - \log{V}}$$</p></dd>
+
+
+<dt><code>"S"</code>:</dt>
+<dd><p>Summer's index: $$S =
+  \frac{\log{\log{V}}}{\log{\log{N}}}$$</p></dd>
+
+
+<dt><code>"K"</code>:</dt>
+<dd><p>Yule's <em>K</em>  (Yule, 1944, as presented in Tweedie &amp;
+Baayen, 1998, Eq. 16) is calculated by: $$K = 10^4 \times
+  \left[ -\frac{1}{N} + \sum_{i=1}^{V} f_v(i, N) \left( \frac{i}{N} \right)^2 \right] $$</p></dd>
+
+
+<dt><code>"I"</code>:</dt>
+<dd><p>Yule's <em>I</em>  (Yule, 1944) is calculated by: $$I = \frac{V^2}{M_2 - V}$$
+$$M_2 = \sum_{i=1}^{V} i^2 * f_v(i, N)$$</p></dd>
+
+
+<dt><code>"D"</code>:</dt>
+<dd><p>Simpson's <em>D</em>  (Simpson 1949, as presented in
+Tweedie &amp; Baayen, 1998, Eq. 17) is calculated by:
+$$D = \sum_{i=1}^{V} f_v(i, N) \frac{i}{N} \frac{i-1}{N-1}$$</p></dd>
+
+
+<dt><code>"Vm"</code>:</dt>
+<dd><p>Herdan's \(V_m\)  (Herdan 1955, as presented in
+Tweedie &amp; Baayen, 1998, Eq. 18) is calculated by:
+$$V_m = \sqrt{ \sum_{i=1}^{V} f_v(i, N) (i/N)^2 - \frac{i}{V} }$$</p></dd>
+
+
+<dt><code>"Maas"</code>:</dt>
+<dd><p>Maas' indices (\(a\), \(\log{V_0}\) &amp;
+\(\log{}_{e}{V_0}\)): $$a^2 = \frac{\log{N} -
+  \log{V}}{\log{N}^2}$$ $$\log{V_0} =
+  \frac{\log{V}}{\sqrt{1 - \frac{\log{V}}{\log{N}}^2}}$$ The measure was derived from a formula by
+Mueller (1969, as cited in Maas, 1972). \(\log{}_{e}{V_0}\) is equivalent
+to \(\log{V_0}\), only with \(e\) as the base for the logarithms. Also
+calculated are \(a\), \(\log{V_0}\) (both not the same as before) and
+\(V'\) as measures of relative vocabulary growth while the text
+progresses. To calculate these measures, the first half of the text and the
+full text will be examined (see Maas, 1972, p. 67 ff. for details).  Note:
+for the current method (for a dfm) there is no computation on separate
+halves of the text.</p></dd>
+
+
+<dt><code>"MATTR"</code>:</dt>
+<dd><p>The Moving-Average Type-Token Ratio (Covington &amp;
+McFall, 2010) calculates TTRs for a moving window of tokens from the first
+to the last token, computing a TTR for each window. The MATTR is the mean
+of the TTRs of each window.</p></dd>
+
+
+<dt><code>"MSTTR"</code>:</dt>
+<dd><p>Mean Segmental Type-Token Ratio (sometimes referred
+to as <em>Split TTR</em>) splits the tokens into segments of the given size,
+TTR for each segment is calculated and the mean of these values returned.
+When this value is &lt; 1.0, it splits the tokens into equal, non-overlapping
+sections of that size.  When this value is &gt; 1, it defines the segments as
+windows of that size. Tokens at the end which do not make a full segment
+are ignored.</p></dd>
+
+
+</dl></div>
+    <div id="references">
+    <h2>References</h2>
+    <p>Covington, M.A. &amp; McFall, J.D. (2010). Cutting the Gordian Knot: The
+Moving-Average Type-Token Ratio (MATTR) <em>Journal of Quantitative
+Linguistics</em>, 17(2), 94--100.
+<a href="https://doi.org/10.1080/09296171003643098" class="external-link">doi:10.1080/09296171003643098</a></p>
+<p>Herdan, G. (1955). <a href="https://link.springer.com/article/10.1007/BF01587632" class="external-link">A New Derivation and Interpretation of Yule's 'Characteristic' <em>K</em></a>. <em>Zeitschrift
+für angewandte Mathematik und Physik</em>, 6(4): 332--334.</p>
+<p>Maas, H.D. (1972). Über den Zusammenhang zwischen Wortschatzumfang und
+Länge eines Textes. <em>Zeitschrift für Literaturwissenschaft und Linguistik</em>,
+2(8), 73--96.</p>
+<p>McCarthy, P.M. &amp;  Jarvis, S. (2007). vocd: A Theoretical and Empirical
+Evaluation. <em>Language Testing</em>, 24(4), 459--488.
+<a href="https://doi.org/10.1177/0265532207080767" class="external-link">doi:10.1177/0265532207080767</a></p>
+<p>McCarthy, P.M. &amp; Jarvis, S. (2010). <a href="https://link.springer.com/article/10.3758/BRM.42.2.381" class="external-link">MTLD, vocd-D, and HD-D: A Validation Study of Sophisticated Approaches to Lexical Diversity Assessment</a>.
+<em>Behaviour Research Methods</em>, 42(2), 381--392.</p>
+<p>Michalke, M. (2014). <em>koRpus: An R Package for Text Analysis (Version
+0.05-4)</em>. Available from <a href="https://reaktanz.de/?c=hacking&amp;s=koRpus" class="external-link">https://reaktanz.de/?c=hacking&amp;s=koRpus</a>.</p>
+<p>Simpson, E.H. (1949). Measurement of Diversity. <em>Nature</em>, 163: 688.
+<a href="https://doi.org/10.1038/163688a0" class="external-link">doi:10.1038/163688a0</a></p>
+<p>Tweedie. F.J. and Baayen, R.H. (1998). How Variable May a Constant Be?
+Measures of Lexical Richness in Perspective. <em>Computers and the
+Humanities</em>, 32(5), 323--352.  <a href="https://doi.org/10.1023/A%3A1001749303137" class="external-link">doi:10.1023/A:1001749303137</a></p>
+<p>Yule, G. U. (1944)  <em>The Statistical Study of Literary Vocabulary.</em>
+Cambridge: Cambridge University Press.</p>
+    </div>
+    <div id="author">
+    <h2>Author</h2>
+    <p>Kenneth Benoit and Jiong Wei Lua. Many of the formulas have been
+reimplemented from functions written by Meik Michalke in the <span class="pkg">koRpus</span>
+package.</p>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="va">txt</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"Anyway, like I was sayin', shrimp is the fruit of the sea. You can</span></span></span>
+<span class="r-in"><span><span class="st">          barbecue it, boil it, broil it, bake it, saute it."</span>,</span></span>
+<span class="r-in"><span>         <span class="st">"There's shrimp-kabobs,</span></span></span>
+<span class="r-in"><span><span class="st">          shrimp creole, shrimp gumbo. Pan fried, deep fried, stir-fried. There's</span></span></span>
+<span class="r-in"><span><span class="st">          pineapple shrimp, lemon shrimp, coconut shrimp, pepper shrimp, shrimp soup,</span></span></span>
+<span class="r-in"><span><span class="st">          shrimp stew, shrimp salad, shrimp and potatoes, shrimp burger, shrimp</span></span></span>
+<span class="r-in"><span><span class="st">          sandwich."</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">txt</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu">textstat_lexdiv</span><span class="op">(</span>measure <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"TTR"</span>, <span class="st">"CTTR"</span>, <span class="st">"K"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   document       TTR     CTTR         K</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1    text1 0.7916667 2.742414  381.9444</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2    text2 0.6060606 2.461830 1248.8522</span>
+<span class="r-in"><span><span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">txt</span><span class="op">)</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu">textstat_lexdiv</span><span class="op">(</span>measure <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"TTR"</span>, <span class="st">"CTTR"</span>, <span class="st">"K"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   document       TTR     CTTR         K</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1    text1 0.7916667 2.742414  381.9444</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2    text2 0.6060606 2.461830 1248.8522</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="va">toks</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/corpus_subset.html" class="external-link">corpus_subset</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span>, <span class="va">Year</span> <span class="op">&gt;</span> <span class="fl">2000</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_lexdiv</span><span class="op">(</span><span class="va">toks</span>, <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"CTTR"</span>, <span class="st">"TTR"</span>, <span class="st">"MATTR"</span><span class="op">)</span>, MATTR_window <span class="op">=</span> <span class="fl">100</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>     document     CTTR       TTR     MATTR</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1  2001-Bush 10.37904 0.3689198 0.6885984</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2  2005-Bush 11.26505 0.3500724 0.6781998</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3 2009-Obama 12.91628 0.3736402 0.7070275</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4 2013-Obama 11.99681 0.3709369 0.7029654</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5 2017-Trump 10.01461 0.3728344 0.6670238</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6 2021-Biden 10.59754 0.3081150 0.6816012</span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_proxy-class.html b/docs/reference/textstat_proxy-class.html
new file mode 100644
index 0000000..f0de806
--- /dev/null
+++ b/docs/reference/textstat_proxy-class.html
@@ -0,0 +1,142 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>textstat_simil/dist classes — textstat_proxy-class • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="textstat_simil/dist classes — textstat_proxy-class"><meta property="og:description" content="Sparse classes for similarity and distance matrices created by
+textstat_simil() and textstat_dist().
+Sparse classes for similarity and distance matrices created by
+textstat_simil() and
+textstat_dist().
+Print/show method for objects created by textstat_simil and
+textstat_dist."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>textstat_simil/dist classes</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_simil.R" class="external-link"><code>R/textstat_simil.R</code></a></small>
+    <div class="hidden name"><code>textstat_proxy-class.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Sparse classes for similarity and distance matrices created by
+<code><a href="textstat_simil.html">textstat_simil()</a></code> and <code><a href="textstat_simil.html">textstat_dist()</a></code>.</p>
+<p>Sparse classes for similarity and distance matrices created by
+<code><a href="textstat_simil.html">textstat_simil()</a></code> and
+<code><a href="textstat_simil.html">textstat_dist()</a></code>.</p>
+<p>Print/show method for objects created by <code>textstat_simil</code> and
+<code>textstat_dist</code>.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">validate_min_simil</span><span class="op">(</span><span class="va">object</span><span class="op">)</span></span>
+<span></span>
+<span><span class="co"># S4 method for textstat_proxy</span></span>
+<span><span class="fu"><a href="https://rdrr.io/r/methods/show.html" class="external-link">show</a></span><span class="op">(</span><span class="va">object</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>object</dt>
+<dd><p>the textstat_proxy object to be printed</p></dd>
+
+</dl></div>
+    <div id="slots">
+    <h2>Slots</h2>
+    
+
+<dl><dt><code>.Data</code></dt>
+<dd><p>a sparse <span class="pkg">Matrix</span> object, symmetric if selection is
+<code>NULL</code></p></dd>
+
+
+<dt><code>method</code></dt>
+<dd><p>the method used for computing similarity or distance</p></dd>
+
+
+<dt><code>min_simil</code></dt>
+<dd><p>numeric; a threshold for the similarity values below which similarity
+values are not computed</p></dd>
+
+
+<dt><code>margin</code></dt>
+<dd><p>identifies the margin of the dfm on which similarity or
+difference was computed:  <code>"documents"</code> for documents or
+<code>"features"</code> for word/term features.</p></dd>
+
+
+<dt><code>type</code></dt>
+<dd><p>either <code>"textstat_simil"</code> or <code>"textstat_dist"</code></p></dd>
+
+
+<dt><code>selection</code></dt>
+<dd><p>target units, if any</p></dd>
+
+
+</dl></div>
+    <div id="see-also">
+    <h2>See also</h2>
+    <div class="dont-index"><p><code><a href="textstat_simil.html">textstat_simil()</a></code></p>
+<p><code><a href="textstat_simil.html">textstat_simil()</a></code></p></div>
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_proxy.html b/docs/reference/textstat_proxy.html
new file mode 100644
index 0000000..fe4fa39
--- /dev/null
+++ b/docs/reference/textstat_proxy.html
@@ -0,0 +1,136 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>[Experimental] Compute document/feature proximity — textstat_proxy • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="[Experimental] Compute document/feature proximity — textstat_proxy"><meta property="og:description" content="This is an underlying function for textstat_dist and
+textstat_simil but returns TsparseMatrix."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>[Experimental] Compute document/feature proximity</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_simil.R" class="external-link"><code>R/textstat_simil.R</code></a></small>
+    <div class="hidden name"><code>textstat_proxy.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>This is an underlying function for <code>textstat_dist</code> and
+<code>textstat_simil</code> but returns <code>TsparseMatrix</code>.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_proxy</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  y <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  margin <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"documents"</span>, <span class="st">"features"</span><span class="op">)</span>,</span>
+<span>  method <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"cosine"</span>, <span class="st">"correlation"</span>, <span class="st">"jaccard"</span>, <span class="st">"ejaccard"</span>, <span class="st">"dice"</span>, <span class="st">"edice"</span>, <span class="st">"hamann"</span>,</span>
+<span>    <span class="st">"simple matching"</span>, <span class="st">"euclidean"</span>, <span class="st">"chisquared"</span>, <span class="st">"hamming"</span>, <span class="st">"kullback"</span>, <span class="st">"manhattan"</span>,</span>
+<span>    <span class="st">"maximum"</span>, <span class="st">"canberra"</span>, <span class="st">"minkowski"</span><span class="op">)</span>,</span>
+<span>  p <span class="op">=</span> <span class="fl">2</span>,</span>
+<span>  min_proxy <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  rank <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  use_na <span class="op">=</span> <span class="cn">FALSE</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>y</dt>
+<dd><p>if a <a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a> object is provided, proximity between documents or
+features in <code>x</code> and <code>y</code> is computed.</p></dd>
+
+
+<dt>margin</dt>
+<dd><p>identifies the margin of the dfm on which similarity or
+difference will be computed:  <code>"documents"</code> for documents or
+<code>"features"</code> for word/term features.</p></dd>
+
+
+<dt>method</dt>
+<dd><p>character; the method identifying the similarity or distance
+measure to be used; see Details.</p></dd>
+
+
+<dt>p</dt>
+<dd><p>The power of the Minkowski distance.</p></dd>
+
+
+<dt>min_proxy</dt>
+<dd><p>the minimum proximity value to be recoded.</p></dd>
+
+
+<dt>rank</dt>
+<dd><p>an integer value specifying top-n most proximity values to be
+recorded.</p></dd>
+
+
+<dt>use_na</dt>
+<dd><p>if <code>TRUE</code>, return <code>NA</code> for proximity to empty
+vectors. Note that use of <code>NA</code> makes the proximity matrices denser.</p></dd>
+
+</dl></div>
+    <div id="see-also">
+    <h2>See also</h2>
+    <div class="dont-index"><p><code><a href="textstat_simil.html">textstat_dist()</a></code>, <code><a href="textstat_simil.html">textstat_simil()</a></code></p></div>
+    </div>
+
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_readability.html b/docs/reference/textstat_readability.html
new file mode 100644
index 0000000..945ab10
--- /dev/null
+++ b/docs/reference/textstat_readability.html
@@ -0,0 +1,588 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Calculate readability — textstat_readability • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Calculate readability — textstat_readability"><meta property="og:description" content="Calculate the readability of text(s) using one of a variety of computed
+indexes."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Calculate readability</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_readability.R" class="external-link"><code>R/textstat_readability.R</code></a></small>
+    <div class="hidden name"><code>textstat_readability.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Calculate the readability of text(s) using one of a variety of computed
+indexes.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_readability</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  measure <span class="op">=</span> <span class="st">"Flesch"</span>,</span>
+<span>  remove_hyphens <span class="op">=</span> <span class="cn">TRUE</span>,</span>
+<span>  min_sentence_length <span class="op">=</span> <span class="fl">1</span>,</span>
+<span>  max_sentence_length <span class="op">=</span> <span class="fl">10000</span>,</span>
+<span>  intermediate <span class="op">=</span> <span class="cn">FALSE</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a character or <a href="https://quanteda.io/reference/corpus.html" class="external-link">corpus</a> object containing the texts</p></dd>
+
+
+<dt>measure</dt>
+<dd><p>character vector defining the readability measure to calculate.
+Matches are case-insensitive.  See other valid measures under Details.</p></dd>
+
+
+<dt>remove_hyphens</dt>
+<dd><p>if <code>TRUE</code>, treat constituent words in hyphenated as
+separate terms, for purposes of computing word lengths, e.g.
+"decision-making" as two terms of lengths 8 and 6 characters respectively,
+rather than as a single word of 15 characters</p></dd>
+
+
+<dt>min_sentence_length, max_sentence_length</dt>
+<dd><p>set the minimum and maximum
+sentence lengths (in tokens, excluding punctuation) to include in the
+computation of readability.  This makes it easy to exclude "sentences" that
+may not really be sentences, such as section titles, table elements, and
+other cruft that might be in the texts following conversion.</p>
+<p>For finer-grained control, consider filtering sentences prior first,
+including through pattern-matching, using <code><a href="https://quanteda.io/reference/corpus_trim.html" class="external-link">corpus_trim()</a></code>.</p></dd>
+
+
+<dt>intermediate</dt>
+<dd><p>if <code>TRUE</code>, include intermediate quantities in the output</p></dd>
+
+
+<dt>...</dt>
+<dd><p>not used</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p><code>textstat_readability</code> returns a data.frame of documents and
+their readability scores.</p>
+    </div>
+    <div id="details">
+    <h2>Details</h2>
+    <p>The following readability formulas have been implemented, where</p><ul><li><p>Nw = \(n_{w}\) = number of words</p></li>
+<li><p>Nc = \(n_{c}\) = number of characters</p></li>
+<li><p>Nst = \(n_{st}\) = number of sentences</p></li>
+<li><p>Nsy = \(n_{sy}\) = number of syllables</p></li>
+<li><p>Nwf = \(n_{wf}\) = number of words matching the Dale-Chall List
+of 3000 "familiar words"</p></li>
+<li><p>ASL = Average Sentence Length: number of words / number of sentences</p></li>
+<li><p>AWL = Average Word Length: number of characters / number of words</p></li>
+<li><p>AFW = Average Familiar Words: count of words matching the Dale-Chall
+list of 3000 "familiar words" / number of all words</p></li>
+<li><p>Nwd = \(n_{wd}\) = number of "difficult" words not matching the
+Dale-Chall list of "familiar" words</p></li>
+</ul><dl><dt><code>"ARI"</code>:</dt>
+<dd><p>Automated Readability Index (Senter and Smith 1967)
+$$0.5 ASL  + 4.71 AWL - 21.34$$</p></dd>
+
+
+<dt><code>"ARI.Simple"</code>:</dt>
+<dd><p>A simplified version of Senter and Smith's (1967) Automated Readability Index.
+$$ASL + 9 AWL$$</p></dd>
+
+
+<dt><code>"Bormuth.MC"</code>:</dt>
+<dd><p>Bormuth's (1969) Mean Cloze Formula.
+$$0.886593 - 0.03640 \times AWL + 0.161911 \times AFW  - 0.21401 \times
+  ASL - 0.000577 \times ASL^2 - 0.000005 \times ASL^3$$</p></dd>
+
+
+<dt><code>"Bormuth.GP"</code>:</dt>
+<dd><p>Bormuth's (1969) Grade Placement score.
+$$4.275 + 12.881M - 34.934M^2 + 20.388 M^3 + 26.194 CCS -
+  2.046 CCS^2 - 11.767 CCS^3 - 42.285(M \times CCS) + 97.620(M \times CCS)^2 -
+  59.538(M \times CCS)^2$$
+where \(M\) is the Bormuth Mean Cloze Formula as in
+<code>"Bormuth"</code> above, and \(CCS\) is the Cloze Criterion Score (Bormuth,
+1968).</p></dd>
+
+
+<dt><code>"Coleman"</code>:</dt>
+<dd><p>Coleman's (1971) Readability Formula 1.
+$$1.29 \times \frac{100 \times n_{wsy=1}}{n_{w}} - 38.45$$</p>
+<p>where \(n_{wsy=1}\) = Nwsy1 = the number of one-syllable words.  The
+scaling by 100 in this and the other Coleman-derived measures arises
+because the Coleman measures are calculated on a per 100 words basis.</p></dd>
+
+
+<dt><code>"Coleman.C2"</code>:</dt>
+<dd><p>Coleman's (1971) Readability Formula 2.
+$$1.16 \times \frac{100 \times n_{wsy=1}}{
+  Nw + 1.48 \times \frac{100 \times n_{st}}{n_{w}} - 37.95}$$</p></dd>
+
+
+<dt><code>"Coleman.Liau.ECP"</code>:</dt>
+<dd><p>Coleman-Liau Estimated Cloze Percent
+(ECP) (Coleman and Liau 1975).
+$$141.8401 - 0.214590 \times 100
+  \times AWL + 1.079812 \times \frac{n_{st} \times 100}{n_{w}}$$</p></dd>
+
+
+<dt><code>"Coleman.Liau.grade"</code>:</dt>
+<dd><p>Coleman-Liau Grade Level (Coleman
+and Liau 1975).
+$$-27.4004 \times \mathtt{Coleman.Liau.ECP} \times 100 +
+  23.06395$$</p></dd>
+
+
+<dt><code>"Coleman.Liau.short"</code>:</dt>
+<dd><p>Coleman-Liau Index (Coleman and Liau 1975).
+$$5.88 \times AWL + 29.6 \times \frac{n_{st}}{n_{w}} - 15.8$$</p></dd>
+
+
+<dt><code>"Dale.Chall"</code>:</dt>
+<dd><p>The New Dale-Chall Readability formula (Chall
+and Dale 1995).
+$$64 - (0.95 \times 100 \times \frac{n_{wd}}{n_{w}}) - (0.69 \times ASL)$$</p></dd>
+
+
+<dt><code>"Dale.Chall.Old"</code>:</dt>
+<dd><p>The original Dale-Chall Readability formula
+(Dale and Chall (1948).
+$$0.1579 \times 100 \times \frac{n_{wd}}{n_{w}} + 0.0496 \times ASL [+ 3.6365]$$</p>
+<p>The additional constant 3.6365 is only added if (Nwd / Nw) &gt; 0.05.</p></dd>
+
+
+<dt><code>"Dale.Chall.PSK"</code>:</dt>
+<dd><p>The Powers-Sumner-Kearl Variation of the
+Dale and Chall Readability formula (Powers, Sumner and Kearl, 1958).
+$$0.1155 \times
+  100 \frac{n_{wd}}{n_{w}}) + (0.0596 \times ASL) + 3.2672 $$</p></dd>
+
+
+<dt><code>"Danielson.Bryan"</code>:</dt>
+<dd><p>Danielson-Bryan's (1963) Readability Measure 1. $$
+  (1.0364 \times \frac{n_{c}}{n_{blank}}) +
+  (0.0194 \times \frac{n_{c}}{n_{st}}) -
+  0.6059$$</p>
+<p>where \(n_{blank}\) = Nblank = the number of blanks.</p></dd>
+
+
+<dt><code>"Danielson.Bryan2"</code>:</dt>
+<dd><p>Danielson-Bryan's (1963) Readability Measure 2. $$
+  131.059- (10.364 \times \frac{n_{c}}{n_{blank}}) + (0.0194
+   \times \frac{n_{c}}{n_{st}})$$</p>
+<p>where \(n_{blank}\) = Nblank = the number of blanks.</p></dd>
+
+
+<dt><code>"Dickes.Steiwer"</code>:</dt>
+<dd><p>Dickes-Steiwer Index (Dicks and Steiwer 1977). $$
+  235.95993 - (7.3021 \times AWL)  - (12.56438 \times ASL) -
+  (50.03293 \times TTR)$$</p>
+<p>where TTR is the Type-Token Ratio (see <code><a href="textstat_lexdiv.html">textstat_lexdiv()</a></code>)</p></dd>
+
+
+<dt><code>"DRP"</code>:</dt>
+<dd><p>Degrees of Reading Power. $$(1 - Bormuth.MC) *
+  100$$</p>
+<p>where Bormuth.MC refers to Bormuth's (1969)  Mean Cloze Formula (documented above)</p></dd>
+
+
+<dt><code>"ELF"</code>:</dt>
+<dd><p>Easy Listening Formula (Fang 1966): $$\frac{n_{wsy&gt;=2}}{n_{st}}$$</p>
+<p>where \(n_{wsy&gt;=2}\) = Nwmin2sy = the number of words with 2 syllables or more.</p></dd>
+
+
+<dt><code>"Farr.Jenkins.Paterson"</code>:</dt>
+<dd><p>Farr-Jenkins-Paterson's
+Simplification of Flesch's Reading Ease Score (Farr, Jenkins and Paterson 1951). $$
+   -31.517 - (1.015 \times ASL) + (1.599 \times
+  \frac{n_{wsy=1}}{n_{w}})$$</p>
+<p>where \(n_{wsy=1}\) = Nwsy1 = the number of one-syllable words.</p></dd>
+
+
+<dt><code>"Flesch"</code>:</dt>
+<dd><p>Flesch's Reading Ease Score (Flesch 1948).
+$$206.835 - (1.015 \times ASL) - (84.6 \times \frac{n_{sy}}{n_{w}})$$</p></dd>
+
+
+<dt><code>"Flesch.PSK"</code>:</dt>
+<dd><p>The Powers-Sumner-Kearl's Variation of Flesch Reading Ease Score
+(Powers, Sumner and Kearl, 1958). $$ (0.0778 \times
+  ASL) + (4.55 \times \frac{n_{sy}}{n_{w}}) -
+  2.2029$$</p></dd>
+
+
+<dt><code>"Flesch.Kincaid"</code>:</dt>
+<dd><p>Flesch-Kincaid Readability Score (Flesch and Kincaid 1975). $$
+  0.39 \times ASL + 11.8  \times \frac{n_{sy}}{n_{w}} -
+  15.59$$</p></dd>
+
+
+<dt><code>"FOG"</code>:</dt>
+<dd><p>Gunning's Fog Index (Gunning 1952). $$0.4
+  \times (ASL + 100 \times \frac{n_{wsy&gt;=3}}{n_{w}})$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3-syllables or more.
+The scaling by 100 arises because the original FOG index is based on
+just a sample of 100 words)</p></dd>
+
+
+<dt><code>"FOG.PSK"</code>:</dt>
+<dd><p>The Powers-Sumner-Kearl Variation of Gunning's
+Fog Index (Powers, Sumner and Kearl, 1958). $$3.0680 \times
+  (0.0877 \times ASL) +(0.0984 \times 100 \times \frac{n_{wsy&gt;=3}}{n_{w}})$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3-syllables or more.
+The scaling by 100 arises because the original FOG index is based on
+just a sample of 100 words)</p></dd>
+
+
+<dt><code>"FOG.NRI"</code>:</dt>
+<dd><p>The Navy's Adaptation of Gunning's Fog Index (Kincaid, Fishburne, Rogers and Chissom 1975).
+$$(\frac{(n_{wsy&lt;3} + 3 \times n_{wsy=3})}{(100 \times \frac{N_{st}}{N_{w}})}  -
+  3) / 2 $$</p>
+<p>where \(n_{wsy&lt;3}\) = Nwless3sy = the number of words with <em>less than</em> 3 syllables, and
+\(n_{wsy=3}\) = Nw3sy = the number of 3-syllable words. The scaling by 100
+arises because the original FOG index is based on just a sample of 100 words)</p></dd>
+
+
+<dt><code>"FORCAST"</code>:</dt>
+<dd><p>FORCAST (Simplified Version of FORCAST.RGL) (Caylor and
+Sticht 1973). $$ 20 - \frac{n_{wsy=1} \times
+  150)}{(n_{w} \times 10)}$$</p>
+<p>where \(n_{wsy=1}\) = Nwsy1 = the number of one-syllable words. The scaling by 150
+arises because the original FORCAST index is based on just a sample of 150 words.</p></dd>
+
+
+<dt><code>"FORCAST.RGL"</code>:</dt>
+<dd><p>FORCAST.RGL (Caylor and Sticht 1973).
+$$20.43 - 0.11 \times \frac{n_{wsy=1} \times
+  150)}{(n_{w} \times 10)}$$</p>
+<p>where \(n_{wsy=1}\) = Nwsy1 = the number of one-syllable words. The scaling by 150 arises
+because the original FORCAST index is based on just a sample of 150 words.</p></dd>
+
+
+<dt><code>"Fucks"</code>:</dt>
+<dd><p>Fucks' (1955) Stilcharakteristik (Style
+Characteristic). $$AWL * ASL$$</p></dd>
+
+
+<dt><code>"Linsear.Write"</code>:</dt>
+<dd><p>Linsear Write (Klare 1975).
+$$\frac{[(100 - (\frac{100 \times n_{wsy&lt;3}}{n_{w}})) +
+  (3 \times \frac{100 \times n_{wsy&gt;=3}}{n_{w}})]}{(100 \times
+  \frac{n_{st}}{n_{w}})}$$</p>
+<p>where \(n_{wsy&lt;3}\) = Nwless3sy = the number of words with <em>less than</em> 3 syllables, and
+\(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3-syllables or more. The scaling
+by 100 arises because the original Linsear.Write measure is based on just a sample of 100 words)</p></dd>
+
+
+<dt><code>"LIW"</code>:</dt>
+<dd><p>Björnsson's (1968) Läsbarhetsindex (For Swedish
+Texts). $$ASL + \frac{100 \times n_{wsy&gt;=7}}{n_{w}}$$</p>
+<p>where \(n_{wsy&gt;=7}\) = Nwmin7sy = the number of words with 7-syllables or more. The scaling
+by 100 arises because the Läsbarhetsindex index is based on just a sample of 100 words)</p></dd>
+
+
+<dt><code>"nWS"</code>:</dt>
+<dd><p>Neue Wiener Sachtextformeln 1 (Bamberger and
+Vanecek 1984). $$19.35 \times \frac{n_{wsy&gt;=3}}{n_{w}} +
+  0.1672 \times ASL + 12.97 \times \frac{b_{wchar&gt;=6}}{n_{w}} - 3.27 \times
+   \frac{n_{wsy=1}}{n_{w}} - 0.875$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3 syllables or more,
+\(n_{wchar&gt;=6}\) = Nwmin6char = the number of words with 6 characters or more, and
+\(n_{wsy=1}\) = Nwsy1 = the number of one-syllable words.</p></dd>
+
+
+<dt><code>"nWS.2"</code>:</dt>
+<dd><p>Neue Wiener Sachtextformeln 2 (Bamberger and
+Vanecek 1984). $$20.07 \times \frac{n_{wsy&gt;=3}}{n_{w}} + 0.1682 \times ASL +
+  13.73 \times \frac{n_{wchar&gt;=6}}{n_{w}} - 2.779$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3 syllables or more, and
+\(n_{wchar&gt;=6}\) = Nwmin6char = the number of words with 6 characters or more.</p></dd>
+
+
+<dt><code>"nWS.3"</code>:</dt>
+<dd><p>Neue Wiener Sachtextformeln 3 (Bamberger and
+Vanecek 1984). $$29.63 \times \frac{n_{wsy&gt;=3}}{n_{w}} + 0.1905 \times
+  ASL - 1.1144$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3 syllables or more.</p></dd>
+
+
+<dt><code>"nWS.4"</code>:</dt>
+<dd><p>Neue Wiener Sachtextformeln 4 (Bamberger and
+Vanecek 1984). $$27.44 \times \frac{n_{wsy&gt;=3}}{n_{w}} + 0.2656 \times
+  ASL - 1.693$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3 syllables or more.</p></dd>
+
+
+<dt><code>"RIX"</code>:</dt>
+<dd><p>Anderson's (1983) Readability Index. $$
+  \frac{n_{wsy&gt;=7}}{n_{st}}$$</p>
+<p>where \(n_{wsy&gt;=7}\) = Nwmin7sy = the number of words with 7-syllables or more.</p></dd>
+
+
+<dt><code>"Scrabble"</code>:</dt>
+<dd><p>Scrabble Measure. $$Mean
+  Scrabble Letter Values of All Words$$.
+Scrabble values are for English.  There is no reference for this, as we
+created it experimentally.  It's not part of any accepted readability
+index!</p></dd>
+
+
+<dt><code>"SMOG"</code>:</dt>
+<dd><p>Simple Measure of Gobbledygook (SMOG) (McLaughlin 1969). $$ 1.043
+   \times \sqrt{n_{wsy&gt;=3}} \times \frac{30}{n_{st}} + 3.1291$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3 syllables or more.
+This measure is regression equation D in McLaughlin's original paper.</p></dd>
+
+
+<dt><code>"SMOG.C"</code>:</dt>
+<dd><p>SMOG (Regression Equation C) (McLaughlin's 1969) $$0.9986 \times
+  \sqrt{Nwmin3sy \times \frac{30}{n_{st}} +
+  5} +  2.8795$$</p>
+<p>where \(n_{wsy&gt;=3}\) = Nwmin3sy = the number of words with 3 syllables or more.
+This measure is regression equation C in McLaughlin's original paper.</p></dd>
+
+
+<dt><code>"SMOG.simple"</code>:</dt>
+<dd><p>Simplified Version of McLaughlin's (1969) SMOG Measure. $$
+  \sqrt{Nwmin3sy \times \frac{30}{n_{st}}} +
+  3$$</p></dd>
+
+
+<dt><code>"SMOG.de"</code>:</dt>
+<dd><p>Adaptation of McLaughlin's (1969) SMOG Measure for German Texts.
+$$ \sqrt{Nwmin3sy \times \frac{30}{n_{st}}-2}$$</p></dd>
+
+
+<dt><code>"Spache"</code>:</dt>
+<dd><p>Spache's (1952) Readability Measure. $$ 0.121 \times
+  ASL + 0.082 \times \frac{n_{wnotinspache}}{n_{w}}  +
+  0.659$$</p>
+<p>where \(n_{wnotinspache}\) = Nwnotinspache = number of unique words not in the Spache word list.</p></dd>
+
+
+<dt><code>"Spache.old"</code>:</dt>
+<dd><p>Spache's (1952) Readability Measure (Old). $$0.141
+  \times ASL + 0.086 \times \frac{n_{wnotinspache}}{n_{w}}  +
+  0.839$$</p>
+<p>where \(n_{wnotinspache}\) = Nwnotinspache = number of unique words not in the Spache word list.</p></dd>
+
+
+<dt><code>"Strain"</code>:</dt>
+<dd><p>Strain Index (Solomon 2006). $$n_{sy} /
+  \frac{n_{st}}{3} /10$$</p>
+<p>The scaling by 3 arises because the original Strain index is based on just the first 3 sentences.</p></dd>
+
+
+<dt><code>"Traenkle.Bailer"</code>:</dt>
+<dd><p>Tränkle &amp; Bailer's (1984) Readability Measure 1.
+$$224.6814 - (79.8304 \times AWL) - (12.24032 \times
+  ASL) - (1.292857 \times 100 \times \frac{n_{prep}}{n_{w}}$$</p>
+<p>where \(n_{prep}\) = Nprep = the number of prepositions. The scaling by 100 arises because the original
+Tränkle &amp; Bailer index is based on just a sample of 100 words.</p></dd>
+
+
+<dt><code>"Traenkle.Bailer2"</code>:</dt>
+<dd><p>Tränkle &amp; Bailer's (1984) Readability Measure 2.
+$$Tränkle.Bailer2 =  234.1063 - (96.11069 \times AWL
+  ) - (2.05444 \times 100 \times \frac{n_{prep}}{n_{w}}) -
+  (1.02805 \times 100 \times \frac{n_{conj}}{n_{w}}$$</p>
+<p>where \(n_{prep}\) = Nprep = the number of prepositions,
+\(n_{conj}\) = Nconj = the number of conjunctions,
+The scaling by 100 arises because the original Tränkle &amp; Bailer index is based on
+just a sample of 100 words)</p></dd>
+
+
+<dt><code>"Wheeler.Smith"</code>:</dt>
+<dd><p>Wheeler &amp; Smith's (1954) Readability Measure.
+$$ ASL \times 10 \times \frac{n_{wsy&gt;=2}}{n_{words}}$$</p>
+<p>where \(n_{wsy&gt;=2}\) = Nwmin2sy = the number of words with 2 syllables or more.</p></dd>
+
+
+<dt><code>"meanSentenceLength"</code>:</dt>
+<dd><p>Average Sentence Length (ASL).
+$$\frac{n_{w}}{n_{st}}$$</p></dd>
+
+
+<dt><code>"meanWordSyllables"</code>:</dt>
+<dd><p>Average Word Syllables (AWL).
+$$\frac{n_{sy}}{n_{w}}$$</p></dd>
+
+
+
+</dl></div>
+    <div id="references">
+    <h2>References</h2>
+    <p>Anderson, J. (1983). Lix and rix: Variations on a little-known readability
+index. <em>Journal of Reading</em>, 26(6),
+490--496.  <code>https://www.jstor.org/stable/40031755</code></p>
+<p>Bamberger, R. &amp; Vanecek, E. (1984). <em>Lesen-Verstehen-Lernen-Schreiben</em>.
+Wien: Jugend und Volk.</p>
+<p>Björnsson, C. H. (1968). <em>Läsbarhet</em>. Stockholm: Liber.</p>
+<p>Bormuth, J.R. (1969). <a href="https://files.eric.ed.gov/fulltext/ED029166.pdf" class="external-link">Development of Readability Analysis</a>.</p>
+<p>Bormuth, J.R. (1968). Cloze test readability: Criterion reference
+scores. <em>Journal of educational
+measurement</em>, 5(3), 189--196. <code>https://www.jstor.org/stable/1433978</code></p>
+<p>Caylor, J.S. (1973). Methodologies for Determining Reading Requirements of
+Military Occupational Specialities.  <code>https://eric.ed.gov/?id=ED074343</code></p>
+<p>Caylor, J.S. &amp; Sticht, T.G. (1973). <em>Development of a Simple Readability
+Index for Job Reading Material</em>
+<code>https://archive.org/details/ERIC_ED076707</code></p>
+<p>Coleman, E.B. (1971). Developing a technology of written instruction: Some
+determiners of the complexity of prose. <em>Verbal learning research and the
+technology of written instruction</em>, 155--204.</p>
+<p>Coleman, M. &amp; Liau, T.L. (1975). A Computer Readability Formula Designed
+for Machine Scoring. <em>Journal of Applied Psychology</em>, 60(2), 283.
+<a href="https://doi.org/10.1037/h0076540" class="external-link">doi:10.1037/h0076540</a></p>
+<p>Dale, E. and Chall, J.S. (1948). A Formula for Predicting Readability:
+Instructions.  <em>Educational Research
+Bulletin</em>, 37-54.  <code>https://www.jstor.org/stable/1473169</code></p>
+<p>Chall, J.S. and Dale, E. (1995). <em>Readability Revisited: The New Dale-Chall
+Readability Formula</em>. Brookline Books.</p>
+<p>Dickes, P. &amp; Steiwer, L. (1977). Ausarbeitung von Lesbarkeitsformeln für
+die Deutsche Sprache. <em>Zeitschrift für Entwicklungspsychologie und
+Pädagogische Psychologie</em> 9(1), 20--28.</p>
+<p>Danielson, W.A., &amp; Bryan, S.D. (1963). Computer Automation of Two
+Readability
+Formulas.
+<em>Journalism Quarterly</em>, 40(2), 201--206. <a href="https://doi.org/10.1177/107769906304000207" class="external-link">doi:10.1177/107769906304000207</a></p>
+<p>DuBay, W.H. (2004). <a href="https://files.eric.ed.gov/fulltext/ED490073.pdf" class="external-link"><em>The Principles of Readability</em></a>.</p>
+<p>Fang, I. E. (1966). The "Easy listening formula". <em>Journal of Broadcasting
+&amp; Electronic Media</em>, 11(1), 63--68.  <a href="https://doi.org/10.1080/08838156609363529" class="external-link">doi:10.1080/08838156609363529</a></p>
+<p>Farr, J. N., Jenkins, J.J., &amp; Paterson, D.G. (1951). Simplification of
+Flesch Reading Ease Formula. <em>Journal of Applied Psychology</em>, 35(5): 333.
+<a href="https://doi.org/10.1037/h0057532" class="external-link">doi:10.1037/h0057532</a></p>
+<p>Flesch, R. (1948). A New Readability Yardstick. <em>Journal of Applied
+Psychology</em>, 32(3), 221.  <a href="https://doi.org/10.1037/h0057532" class="external-link">doi:10.1037/h0057532</a></p>
+<p>Fucks, W. (1955). Der Unterschied des Prosastils von Dichtern und anderen
+Schriftstellern. <em>Sprachforum</em>, 1, 233-244.</p>
+<p>Gunning, R. (1952). <em>The Technique of Clear Writing</em>.  New York:
+McGraw-Hill.</p>
+<p>Klare, G.R. (1975). Assessing Readability.  <em>Reading Research Quarterly</em>,
+10(1), 62-102.  <a href="https://doi.org/10.2307/747086" class="external-link">doi:10.2307/747086</a></p>
+<p>Kincaid, J. P., Fishburne Jr, R.P., Rogers, R.L., &amp; Chissom, B.S. (1975).
+<a href="https://stars.library.ucf.edu/istlibrary/56/" class="external-link">Derivation of New Readability Formulas (Automated Readability Index, FOG count and Flesch Reading Ease Formula) for Navy Enlisted Personnel</a>.</p>
+<p>McLaughlin, G.H. (1969). <a href="https://ogg.osu.edu/media/documents/health_lit/WRRSMOG_Readability_Formula_G._Harry_McLaughlin__1969_.pdf" class="external-link">SMOG Grading: A New Readability Formula.</a>
+<em>Journal of Reading</em>, 12(8), 639-646.</p>
+<p>Michalke, M. (2014). <em>koRpus: An R Package for Text Analysis (Version 0.05-4)</em>.
+Available from <a href="https://reaktanz.de/?c=hacking&amp;s=koRpus" class="external-link">https://reaktanz.de/?c=hacking&amp;s=koRpus</a>.</p>
+<p>Powers, R.D., Sumner, W.A., and Kearl, B.E. (1958). A Recalculation of
+Four Adult Readability Formulas. <em>Journal of Educational Psychology</em>,
+49(2), 99.  <a href="https://doi.org/10.1037/h0043254" class="external-link">doi:10.1037/h0043254</a></p>
+<p>Senter, R. J., &amp; Smith, E. A. (1967). <a href="https://apps.dtic.mil/sti/pdfs/AD0667273.pdf" class="external-link">Automated readability index.</a>
+Wright-Patterson Air Force Base. Report No. AMRL-TR-6620.</p>
+<p>*Solomon, N. W. (2006). <em>Qualitative Analysis of Media Language</em>. India.</p>
+<p>Spache, G. (1953). "A new readability formula for primary-grade reading
+materials." <em>The Elementary School Journal</em>, 53, 410--413.
+<code>https://www.jstor.org/stable/998915</code></p>
+<p>Tränkle, U. &amp; Bailer, H. (1984). Kreuzvalidierung und Neuberechnung von
+Lesbarkeitsformeln für die deutsche Sprache. <em>Zeitschrift für
+Entwicklungspsychologie und Pädagogische Psychologie</em>, 16(3), 231--244.</p>
+<p>Wheeler, L.R. &amp; Smith, E.H. (1954). A Practical Readability Formula for the
+Classroom Teacher in the Primary Grades. <em>Elementary English</em>, 31,
+397--399.  <code>https://www.jstor.org/stable/41384251</code></p>
+<p>*Nimaldasan is the pen name of N. Watson Solomon, Assistant Professor of
+Journalism, School of Media Studies, SRM University, India.</p>
+    </div>
+    <div id="author">
+    <h2>Author</h2>
+    <p>Kenneth Benoit, re-engineered from Meik Michalke's <span class="pkg">koRpus</span>
+package.</p>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="va">txt</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span>doc1 <span class="op">=</span> <span class="st">"Readability zero one. Ten, Eleven."</span>,</span></span>
+<span class="r-in"><span>         doc2 <span class="op">=</span> <span class="st">"The cat in a dilapidated tophat."</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_readability</span><span class="op">(</span><span class="va">txt</span>, measure <span class="op">=</span> <span class="st">"Flesch"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   document  Flesch</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1     doc1  1.2575</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2     doc2 45.6450</span>
+<span class="r-in"><span><span class="fu">textstat_readability</span><span class="op">(</span><span class="va">txt</span>, measure <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"FOG"</span>, <span class="st">"FOG.PSK"</span>, <span class="st">"FOG.NRI"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   document       FOG  FOG.PSK FOG.NRI</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1     doc1 17.000000 4.608659 -1.3875</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2     doc2  9.066667 3.254382 -1.2600</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="fu">textstat_readability</span><span class="op">(</span><span class="fu">quanteda</span><span class="fu">::</span><span class="va"><a href="https://quanteda.io/reference/data_corpus_inaugural.html" class="external-link">data_corpus_inaugural</a></span><span class="op">[</span><span class="fl">48</span><span class="op">:</span><span class="fl">58</span><span class="op">]</span>,</span></span>
+<span class="r-in"><span>                     measure <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"Flesch.Kincaid"</span>, <span class="st">"Dale.Chall.old"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>        document Flesch.Kincaid Dale.Chall.old</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1   1977-Carter      11.670742       8.218925</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2   1981-Reagan       9.755604       7.580752</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3   1985-Reagan      10.420294       7.430830</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4     1989-Bush       7.147029       6.584037</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5  1993-Clinton      10.381579       7.340028</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 6  1997-Clinton       9.828863       7.388557</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 7     2001-Bush       8.933091       7.216451</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 8     2005-Bush      11.041969       7.622865</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9    2009-Obama      10.234345       7.456305</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 10   2013-Obama      11.734767       7.845061</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 11   2017-Trump       9.171244       6.777431</span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_select.html b/docs/reference/textstat_select.html
new file mode 100644
index 0000000..973ea27
--- /dev/null
+++ b/docs/reference/textstat_select.html
@@ -0,0 +1,140 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Select rows of textstat objects by glob, regex or fixed patterns — textstat_select • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Select rows of textstat objects by glob, regex or fixed patterns — textstat_select"><meta property="og:description" content='Users can subset output object of textstat_collocations,
+textstat_keyness or textstat_frequency based on
+"glob", "regex" or "fixed" patterns using this method.'><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Select rows of textstat objects by glob, regex or fixed patterns</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat-methods.R" class="external-link"><code>R/textstat-methods.R</code></a></small>
+    <div class="hidden name"><code>textstat_select.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Users can subset output object of <code>textstat_collocations</code>,
+<code>textstat_keyness</code> or <code>textstat_frequency</code> based on
+<code>"glob"</code>, <code>"regex"</code> or <code>"fixed"</code> patterns using this method.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_select</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  pattern <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  selection <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"keep"</span>, <span class="st">"remove"</span><span class="op">)</span>,</span>
+<span>  valuetype <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"glob"</span>, <span class="st">"regex"</span>, <span class="st">"fixed"</span><span class="op">)</span>,</span>
+<span>  case_insensitive <span class="op">=</span> <span class="cn">TRUE</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>a <code>textstat</code> object</p></dd>
+
+
+<dt>pattern</dt>
+<dd><p>see <a href="https://quanteda.io/reference/pattern.html" class="external-link">quanteda::pattern</a></p></dd>
+
+
+<dt>selection</dt>
+<dd><p>whether to <code>"keep"</code> or <code>"remove"</code> the rows that
+match the pattern</p></dd>
+
+
+<dt>valuetype</dt>
+<dd><p>the type of pattern matching: <code>"glob"</code> for "glob"-style
+wildcard expressions; <code>"regex"</code> for regular expressions; or <code>"fixed"</code> for
+exact matching. See <a href="https://quanteda.io/reference/valuetype.html" class="external-link">valuetype</a> for details.</p></dd>
+
+
+<dt>case_insensitive</dt>
+<dd><p>logical; if <code>TRUE</code>, ignore case when matching a
+<code>pattern</code> or <a href="https://quanteda.io/reference/dictionary.html" class="external-link">dictionary</a> values</p></dd>
+
+</dl></div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="va">period</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/ifelse.html" class="external-link">ifelse</a></span><span class="op">(</span><span class="fu"><a href="https://quanteda.io/reference/docvars.html" class="external-link">docvars</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span>, <span class="st">"Year"</span><span class="op">)</span> <span class="op">&lt;</span> <span class="fl">1945</span>, <span class="st">"pre-war"</span>, <span class="st">"post-war"</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">dfmat</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm_group.html" class="external-link">dfm_group</a></span><span class="op">(</span>groups <span class="op">=</span> <span class="va">period</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">tstat</span> <span class="op">&lt;-</span> <span class="fu"><a href="textstat_keyness.html">textstat_keyness</a></span><span class="op">(</span><span class="va">dfmat</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_select</span><span class="op">(</span><span class="va">tstat</span>, <span class="st">'america*'</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>          feature        chi2            p n_target n_reference</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5        america 200.5543560 0.000000e+00      148          54</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 9      americans 163.1532091 0.000000e+00       76           7</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 17     america's  93.4124870 0.000000e+00       37           0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 86      american  24.4057333 7.803611e-07       78          94</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1127    americas   0.6901897 4.060998e-01        2           1</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1698  american's   0.2300602 6.314792e-01        1           0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5393 americanism  -0.3961920 5.290624e-01        0           1</span>
+<span class="r-in"><span></span></span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_simil.html b/docs/reference/textstat_simil.html
new file mode 100644
index 0000000..4bdc7c0
--- /dev/null
+++ b/docs/reference/textstat_simil.html
@@ -0,0 +1,435 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Similarity and distance computation between documents or features — textstat_simil • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Similarity and distance computation between documents or features — textstat_simil"><meta property="og:description" content="These functions compute matrixes of distances and similarities between
+documents or features from a dfm() and return a matrix of
+similarities or distances in a sparse format.  These methods are fast
+and robust because they operate directly on the sparse dfm objects.
+The output can easily be coerced to an ordinary matrix, a data.frame of
+pairwise comparisons, or a dist format."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Similarity and distance computation between documents or features</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_simil.R" class="external-link"><code>R/textstat_simil.R</code></a></small>
+    <div class="hidden name"><code>textstat_simil.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>These functions compute matrixes of distances and similarities between
+documents or features from a <code><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm()</a></code> and return a matrix of
+similarities or distances in a sparse format.  These methods are fast
+and robust because they operate directly on the sparse <a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a> objects.
+The output can easily be coerced to an ordinary matrix, a data.frame of
+pairwise comparisons, or a <a href="https://rdrr.io/r/stats/dist.html" class="external-link">dist</a> format.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_simil</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  y <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  selection <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  margin <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"documents"</span>, <span class="st">"features"</span><span class="op">)</span>,</span>
+<span>  method <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"correlation"</span>, <span class="st">"cosine"</span>, <span class="st">"jaccard"</span>, <span class="st">"ejaccard"</span>, <span class="st">"dice"</span>, <span class="st">"edice"</span>, <span class="st">"hamann"</span>,</span>
+<span>    <span class="st">"simple matching"</span><span class="op">)</span>,</span>
+<span>  min_simil <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span>
+<span></span>
+<span><span class="fu">textstat_dist</span><span class="op">(</span></span>
+<span>  <span class="va">x</span>,</span>
+<span>  y <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  selection <span class="op">=</span> <span class="cn">NULL</span>,</span>
+<span>  margin <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"documents"</span>, <span class="st">"features"</span><span class="op">)</span>,</span>
+<span>  method <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"euclidean"</span>, <span class="st">"manhattan"</span>, <span class="st">"maximum"</span>, <span class="st">"canberra"</span>, <span class="st">"minkowski"</span><span class="op">)</span>,</span>
+<span>  p <span class="op">=</span> <span class="fl">2</span>,</span>
+<span>  <span class="va">...</span></span>
+<span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x, y</dt>
+<dd><p>a <a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a> objects; <code>y</code> is an optional target matrix matching
+<code>x</code> in the margin on which the similarity or distance will be computed.</p></dd>
+
+
+<dt>selection</dt>
+<dd><p>(deprecated - use <code>y</code> instead).</p></dd>
+
+
+<dt>margin</dt>
+<dd><p>identifies the margin of the dfm on which similarity or
+difference will be computed:  <code>"documents"</code> for documents or
+<code>"features"</code> for word/term features.</p></dd>
+
+
+<dt>method</dt>
+<dd><p>character; the method identifying the similarity or distance
+measure to be used; see Details.</p></dd>
+
+
+<dt>min_simil</dt>
+<dd><p>numeric; a threshold for the similarity values below which similarity
+values will not be returned</p></dd>
+
+
+<dt>...</dt>
+<dd><p>unused</p></dd>
+
+
+<dt>p</dt>
+<dd><p>The power of the Minkowski distance.</p></dd>
+
+</dl></div>
+    <div id="value">
+    <h2>Value</h2>
+    
+
+<p>A sparse matrix from the <span class="pkg">Matrix</span> package that will be symmetric
+unless <code>y</code> is specified.</p>
+    </div>
+    <div id="details">
+    <h2>Details</h2>
+    <p><code>textstat_simil</code> options are: <code>"correlation"</code> (default),
+<code>"cosine"</code>, <code>"jaccard"</code>, <code>"ejaccard"</code>, <code>"dice"</code>,
+<code>"edice"</code>, <code>"simple matching"</code>, and <code>"hamann"</code>.</p>
+<p><code>textstat_dist</code> options are: <code>"euclidean"</code> (default),
+<code>"manhattan"</code>, <code>"maximum"</code>, <code>"canberra"</code>,
+and <code>"minkowski"</code>.</p>
+    </div>
+    <div id="note">
+    <h2>Note</h2>
+    <p>If you want to compute similarity on a "normalized" dfm object
+(controlling for variable document lengths, for methods such as correlation
+for which different document lengths matter), then wrap the input dfm in
+<code>[dfm_weight](x, "prop")</code>.</p>
+    </div>
+    <div id="conversion-to-other-data-types">
+    <h2>Conversion to other data types</h2>
+    
+
+<p>The output objects from <code>textstat_simil()</code> and <code>textstat_dist()</code> can be
+transformed easily into a list format using
+<code><a href="as.list.textstat_proxy.html">as.list()</a></code>, which returns a list for each unique
+element of the second of the pairs, a data.frame using
+<code><a href="as.list.textstat_proxy.html">as.data.frame()</a></code>, which returns pairwise
+scores, <code><a href="https://rdrr.io/r/stats/dist.html" class="external-link">as.dist()</a></code>for a <a href="https://rdrr.io/r/stats/dist.html" class="external-link">dist</a> object,
+or <code><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix()</a></code> to convert it into an ordinary matrix.</p>
+    </div>
+    <div id="see-also">
+    <h2>See also</h2>
+    <div class="dont-index"><p><code><a href="as.list.textstat_proxy.html">as.list.textstat_proxy()</a></code>, <code><a href="as.list.textstat_proxy.html">as.data.frame.textstat_proxy()</a></code>,
+<code><a href="https://rdrr.io/r/stats/dist.html" class="external-link">stats::as.dist()</a></code></p></div>
+    </div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="co"># similarities for documents</span></span></span>
+<span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">dfmat</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/corpus_subset.html" class="external-link">corpus_subset</a></span><span class="op">(</span><span class="va">data_corpus_inaugural</span>, <span class="va">Year</span> <span class="op">&gt;</span> <span class="fl">2000</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span>remove_punct <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/tokens_select.html" class="external-link">tokens_remove</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/pkg/stopwords/man/stopwords.html" class="external-link">stopwords</a></span><span class="op">(</span><span class="st">"english"</span><span class="op">)</span><span class="op">)</span> <span class="op"><a href="https://magrittr.tidyverse.org/reference/pipe.html" class="external-link">%&gt;%</a></span></span></span>
+<span class="r-in"><span>    <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="op">(</span><span class="va">tstat1</span> <span class="op">&lt;-</span> <span class="fu">textstat_simil</span><span class="op">(</span><span class="va">dfmat</span>, method <span class="op">=</span> <span class="st">"cosine"</span>, margin <span class="op">=</span> <span class="st">"documents"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_simil object; method = "cosine"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2001-Bush 2005-Bush 2009-Obama 2013-Obama 2017-Trump 2021-Biden</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush      1.000     0.520      0.541      0.556      0.452      0.562</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush      0.520     1.000      0.458      0.516      0.435      0.480</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama     0.541     0.458      1.000      0.637      0.448      0.616</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama     0.556     0.516      0.637      1.000      0.455      0.606</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump     0.452     0.435      0.448      0.455      1.000      0.513</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden     0.562     0.480      0.616      0.606      0.513      1.000</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix</a></span><span class="op">(</span><span class="va">tstat1</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2001-Bush 2005-Bush 2009-Obama 2013-Obama 2017-Trump 2021-Biden</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush  1.0000000 0.5204355  0.5411649  0.5561972  0.4518935  0.5619136</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush  0.5204355 1.0000000  0.4575297  0.5163644  0.4349030  0.4797651</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama 0.5411649 0.4575297  1.0000000  0.6373318  0.4481950  0.6158540</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama 0.5561972 0.5163644  0.6373318  1.0000000  0.4546945  0.6061256</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump 0.4518935 0.4349030  0.4481950  0.4546945  1.0000000  0.5133378</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden 0.5619136 0.4797651  0.6158540  0.6061256  0.5133378  1.0000000</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/list.html" class="external-link">as.list</a></span><span class="op">(</span><span class="va">tstat1</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2001-Bush`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden 2013-Obama 2009-Obama  2005-Bush 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  0.5619136  0.5561972  0.5411649  0.5204355  0.4518935 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2005-Bush`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2001-Bush 2013-Obama 2021-Biden 2009-Obama 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  0.5204355  0.5163644  0.4797651  0.4575297  0.4349030 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2009-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama 2021-Biden  2001-Bush  2005-Bush 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  0.6373318  0.6158540  0.5411649  0.4575297  0.4481950 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2013-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama 2021-Biden  2001-Bush  2005-Bush 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  0.6373318  0.6061256  0.5561972  0.5163644  0.4546945 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2017-Trump`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden 2013-Obama  2001-Bush 2009-Obama  2005-Bush </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  0.5133378  0.4546945  0.4518935  0.4481950  0.4349030 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2021-Biden`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama 2013-Obama  2001-Bush 2017-Trump  2005-Bush </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  0.6158540  0.6061256  0.5619136  0.5133378  0.4797651 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/list.html" class="external-link">as.list</a></span><span class="op">(</span><span class="va">tstat1</span>, diag <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2001-Bush`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2001-Bush 2021-Biden 2013-Obama 2009-Obama  2005-Bush 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  1.0000000  0.5619136  0.5561972  0.5411649  0.5204355  0.4518935 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2005-Bush`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2005-Bush  2001-Bush 2013-Obama 2021-Biden 2009-Obama 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  1.0000000  0.5204355  0.5163644  0.4797651  0.4575297  0.4349030 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2009-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama 2013-Obama 2021-Biden  2001-Bush  2005-Bush 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  1.0000000  0.6373318  0.6158540  0.5411649  0.4575297  0.4481950 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2013-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama 2009-Obama 2021-Biden  2001-Bush  2005-Bush 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  1.0000000  0.6373318  0.6061256  0.5561972  0.5163644  0.4546945 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2017-Trump`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump 2021-Biden 2013-Obama  2001-Bush 2009-Obama  2005-Bush </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  1.0000000  0.5133378  0.4546945  0.4518935  0.4481950  0.4349030 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2021-Biden`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden 2009-Obama 2013-Obama  2001-Bush 2017-Trump  2005-Bush </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  1.0000000  0.6158540  0.6061256  0.5619136  0.5133378  0.4797651 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># min_simil</span></span></span>
+<span class="r-in"><span><span class="op">(</span><span class="va">tstat2</span> <span class="op">&lt;-</span> <span class="fu">textstat_simil</span><span class="op">(</span><span class="va">dfmat</span>, method <span class="op">=</span> <span class="st">"cosine"</span>, margin <span class="op">=</span> <span class="st">"documents"</span>, min_simil <span class="op">=</span> <span class="fl">0.6</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_simil object; method = "cosine"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2001-Bush 2005-Bush 2009-Obama 2013-Obama 2017-Trump 2021-Biden</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush          1         .          .          .          .          .</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush          .         1          .          .          .          .</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama         .         .      1.000      0.637          .      0.616</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama         .         .      0.637      1.000          .      0.606</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump         .         .          .          .          1          .</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden         .         .      0.616      0.606          .      1.000</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix</a></span><span class="op">(</span><span class="va">tstat2</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2001-Bush 2005-Bush 2009-Obama 2013-Obama 2017-Trump 2021-Biden</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush          1        NA         NA         NA         NA         NA</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush         NA         1         NA         NA         NA         NA</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama        NA        NA  1.0000000  0.6373318         NA  0.6158540</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama        NA        NA  0.6373318  1.0000000         NA  0.6061256</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump        NA        NA         NA         NA          1         NA</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden        NA        NA  0.6158540  0.6061256         NA  1.0000000</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># similarities for for specific documents</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_simil</span><span class="op">(</span><span class="va">dfmat</span>, <span class="va">dfmat</span><span class="op">[</span><span class="st">"2017-Trump"</span>, <span class="op">]</span>, margin <span class="op">=</span> <span class="st">"documents"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_simil object; method = "correlation"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2017-Trump</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush       0.375</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush       0.355</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama      0.356</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama      0.373</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump      1.000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden      0.449</span>
+<span class="r-in"><span><span class="fu">textstat_simil</span><span class="op">(</span><span class="va">dfmat</span>, <span class="va">dfmat</span><span class="op">[</span><span class="st">"2017-Trump"</span>, <span class="op">]</span>, method <span class="op">=</span> <span class="st">"cosine"</span>, margin <span class="op">=</span> <span class="st">"documents"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_simil object; method = "cosine"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2017-Trump</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush       0.452</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush       0.435</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama      0.448</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama      0.455</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump      1.000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden      0.513</span>
+<span class="r-in"><span><span class="fu">textstat_simil</span><span class="op">(</span><span class="va">dfmat</span>, <span class="va">dfmat</span><span class="op">[</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"2009-Obama"</span>, <span class="st">"2013-Obama"</span><span class="op">)</span>, <span class="op">]</span>, margin <span class="op">=</span> <span class="st">"documents"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_simil object; method = "correlation"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2009-Obama 2013-Obama</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush       0.452      0.479</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush       0.352      0.432</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama      1.000      0.561</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama      0.561      1.000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump      0.356      0.373</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden      0.548      0.543</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># compute some term similarities</span></span></span>
+<span class="r-in"><span><span class="va">tstat3</span> <span class="op">&lt;-</span> <span class="fu">textstat_simil</span><span class="op">(</span><span class="va">dfmat</span>, <span class="va">dfmat</span><span class="op">[</span>, <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"fair"</span>, <span class="st">"health"</span>, <span class="st">"terror"</span><span class="op">)</span><span class="op">]</span>, method <span class="op">=</span> <span class="st">"cosine"</span>,</span></span>
+<span class="r-in"><span>                         margin <span class="op">=</span> <span class="st">"features"</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix</a></span><span class="op">(</span><span class="va">tstat3</span><span class="op">)</span>, <span class="fl">10</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>                    fair    health     terror</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> president     0.3396831 0.6240377 0.09805807</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> clinton       0.4714045 0.4330127 0.00000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> distinguished 0.5773503 0.7071068 0.00000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> guests        0.5773503 0.7071068 0.00000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> fellow        0.4256283 0.7298004 0.14744196</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> citizens      0.7064173 0.6488857 0.07647191</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> peaceful      0.5163978 0.6324555 0.00000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> transfer      0.3333333 0.4082483 0.00000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> authority     0.8164966 0.5000000 0.00000000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> rare          0.4082483 0.5000000 0.00000000</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/list.html" class="external-link">as.list</a></span><span class="op">(</span><span class="va">tstat3</span>, n <span class="op">=</span> <span class="fl">6</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $fair</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>    continue      chance differences     dangers      choose     charity </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>           1           1           1           1           1           1 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $health</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> generations     without        work      common     fathers      nation </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   0.9733285   0.9594032   0.9527861   0.9486833   0.9486833   0.9282422 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $terror</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>    bestowed  sacrifices   ancestors  generosity cooperation  forty-four </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>           1           1           1           1           1           1 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># distances for documents</span></span></span>
+<span class="r-in"><span><span class="op">(</span><span class="va">tstat4</span> <span class="op">&lt;-</span> <span class="fu">textstat_dist</span><span class="op">(</span><span class="va">dfmat</span>, margin <span class="op">=</span> <span class="st">"documents"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_dist object; method = "euclidean"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2001-Bush 2005-Bush 2009-Obama 2013-Obama 2017-Trump 2021-Biden</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush          0      52.8       49.9       48.3       47.6       57.2</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush       52.8         0       60.8       56.9       57.4       66.0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama      49.9      60.8          0       48.0       54.9       56.1</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama      48.3      56.9       48.0          0       53.7       56.5</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump      47.6      57.4       54.9       53.7          0       60.0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden      57.2      66.0       56.1       56.5       60.0          0</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix</a></span><span class="op">(</span><span class="va">tstat4</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2001-Bush 2005-Bush 2009-Obama 2013-Obama 2017-Trump 2021-Biden</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush    0.00000  52.84884   49.94997   48.31149   47.61302   57.22762</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush   52.84884   0.00000   60.84406   56.85948   57.41080   66.00000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama  49.94997  60.84406    0.00000   47.98958   54.91812   56.12486</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama  48.31149  56.85948   47.98958    0.00000   53.73081   56.45352</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump  47.61302  57.41080   54.91812   53.73081    0.00000   59.98333</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden  57.22762  66.00000   56.12486   56.45352   59.98333    0.00000</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/list.html" class="external-link">as.list</a></span><span class="op">(</span><span class="va">tstat4</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2001-Bush`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden  2005-Bush 2009-Obama 2013-Obama 2017-Trump </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   57.22762   52.84884   49.94997   48.31149   47.61302 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2005-Bush`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden 2009-Obama 2017-Trump 2013-Obama  2001-Bush </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   66.00000   60.84406   57.41080   56.85948   52.84884 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2009-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2005-Bush 2021-Biden 2017-Trump  2001-Bush 2013-Obama </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   60.84406   56.12486   54.91812   49.94997   47.98958 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2013-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2005-Bush 2021-Biden 2017-Trump  2001-Bush 2009-Obama </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   56.85948   56.45352   53.73081   48.31149   47.98958 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2017-Trump`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden  2005-Bush 2009-Obama 2013-Obama  2001-Bush </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   59.98333   57.41080   54.91812   53.73081   47.61302 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2021-Biden`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2005-Bush 2017-Trump  2001-Bush 2013-Obama 2009-Obama </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   66.00000   59.98333   57.22762   56.45352   56.12486 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/stats/dist.html" class="external-link">as.dist</a></span><span class="op">(</span><span class="va">tstat4</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2001-Bush 2005-Bush 2009-Obama 2013-Obama 2017-Trump</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush   52.84884                                           </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama  49.94997  60.84406                                 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama  48.31149  56.85948   47.98958                      </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump  47.61302  57.41080   54.91812   53.73081           </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden  57.22762  66.00000   56.12486   56.45352   59.98333</span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="co"># distances for specific documents</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_dist</span><span class="op">(</span><span class="va">dfmat</span>, <span class="va">dfmat</span><span class="op">[</span><span class="st">"2017-Trump"</span>, <span class="op">]</span>, margin <span class="op">=</span> <span class="st">"documents"</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_dist object; method = "euclidean"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2017-Trump</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush        47.6</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush        57.4</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama       54.9</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama       53.7</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump          0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden       60.0</span>
+<span class="r-in"><span><span class="op">(</span><span class="va">tstat5</span> <span class="op">&lt;-</span> <span class="fu">textstat_dist</span><span class="op">(</span><span class="va">dfmat</span>, <span class="va">dfmat</span><span class="op">[</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="st">"2009-Obama"</span> , <span class="st">"2013-Obama"</span><span class="op">)</span>, <span class="op">]</span>, margin <span class="op">=</span> <span class="st">"documents"</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> textstat_dist object; method = "euclidean"</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2009-Obama 2013-Obama</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush        49.9       48.3</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush        60.8       56.9</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama          0       48.0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama       48.0          0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump       54.9       53.7</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden       56.1       56.5</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/matrix.html" class="external-link">as.matrix</a></span><span class="op">(</span><span class="va">tstat5</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>            2009-Obama 2013-Obama</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2001-Bush    49.94997   48.31149</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2005-Bush    60.84406   56.85948</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2009-Obama    0.00000   47.98958</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2013-Obama   47.98958    0.00000</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2017-Trump   54.91812   53.73081</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2021-Biden   56.12486   56.45352</span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/base/list.html" class="external-link">as.list</a></span><span class="op">(</span><span class="va">tstat5</span><span class="op">)</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2009-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2005-Bush 2021-Biden 2017-Trump  2001-Bush 2013-Obama </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   60.84406   56.12486   54.91812   49.94997   47.98958 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> $`2013-Obama`</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>  2005-Bush 2021-Biden 2017-Trump  2001-Bush 2009-Obama </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   56.85948   56.45352   53.73081   48.31149   47.98958 </span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> </span>
+<span class="r-in"><span></span></span>
+<span class="r-in"><span><span class="kw">if</span> <span class="op">(</span><span class="cn">FALSE</span><span class="op">)</span> <span class="op">{</span></span></span>
+<span class="r-in"><span><span class="co"># plot a dendrogram after converting the object into distances</span></span></span>
+<span class="r-in"><span><span class="fu"><a href="https://rdrr.io/r/graphics/plot.default.html" class="external-link">plot</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/stats/hclust.html" class="external-link">hclust</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/stats/dist.html" class="external-link">as.dist</a></span><span class="op">(</span><span class="va">tstat4</span><span class="op">)</span><span class="op">)</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="op">}</span></span></span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/reference/textstat_summary.html b/docs/reference/textstat_summary.html
new file mode 100644
index 0000000..37578a4
--- /dev/null
+++ b/docs/reference/textstat_summary.html
@@ -0,0 +1,136 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><meta name="viewport" content="width=device-width, initial-scale=1.0"><title>Summarize documents as syntactic and lexical feature counts — textstat_summary • quanteda.textstats</title><!-- jquery --><script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.4.1/jquery.min.js" integrity="sha256-CSXorXvZcTkaix6Yvo6HppcZGetbYMGWSFlBw8HfCJo=" crossorigin="anonymous"></script><!-- Bootstrap --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/css/bootstrap.min.css" integrity="sha256-bZLfwXAP04zRMK2BjiO8iu9pf4FbLqX6zitd+tIvLhE=" crossorigin="anonymous"><script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.4.1/js/bootstrap.min.js" integrity="sha256-nuL8/2cJ5NDSSwnKD8VqreErSWHtnEP9E7AySL+1ev4=" crossorigin="anonymous"></script><!-- bootstrap-toc --><link rel="stylesheet" href="../bootstrap-toc.css"><script src="../bootstrap-toc.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous"><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous"><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- pkgdown --><link href="../pkgdown.css" rel="stylesheet"><script src="../pkgdown.js"></script><meta property="og:title" content="Summarize documents as syntactic and lexical feature counts — textstat_summary"><meta property="og:description" content="Count syntactic and lexical features of documents such as tokens, types,
+sentences, and character categories."><!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]--></head><body data-spy="scroll" data-target="#toc">
+    
+
+    <div class="container template-reference-topic">
+      <header><div class="navbar navbar-default navbar-fixed-top" role="navigation">
+  <div class="container">
+    <div class="navbar-header">
+      <button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false">
+        <span class="sr-only">Toggle navigation</span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+        <span class="icon-bar"></span>
+      </button>
+      <span class="navbar-brand">
+        <a class="navbar-link" href="../index.html">quanteda.textstats</a>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.97</span>
+      </span>
+    </div>
+
+    <div id="navbar" class="navbar-collapse collapse">
+      <ul class="nav navbar-nav"><li>
+  <a href="../reference/index.html">Reference</a>
+</li>
+<li>
+  <a href="../news/index.html">Changelog</a>
+</li>
+      </ul><ul class="nav navbar-nav navbar-right"><li>
+  <a href="https://github.com/quanteda/quanteda.textstats/" class="external-link">
+    <span class="fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul></div><!--/.nav-collapse -->
+  </div><!--/.container -->
+</div><!--/.navbar -->
+
+      
+
+      </header><div class="row">
+  <div class="col-md-9 contents">
+    <div class="page-header">
+    <h1>Summarize documents as syntactic and lexical feature counts</h1>
+    <small class="dont-index">Source: <a href="https://github.com/quanteda/quanteda.textstats/blob/HEAD/R/textstat_summary.R" class="external-link"><code>R/textstat_summary.R</code></a></small>
+    <div class="hidden name"><code>textstat_summary.Rd</code></div>
+    </div>
+
+    <div class="ref-description">
+    <p>Count syntactic and lexical features of documents such as tokens, types,
+sentences, and character categories.</p>
+    </div>
+
+    <div id="ref-usage">
+    <div class="sourceCode"><pre class="sourceCode r"><code><span><span class="fu">textstat_summary</span><span class="op">(</span><span class="va">x</span>, <span class="va">...</span><span class="op">)</span></span></code></pre></div>
+    </div>
+
+    <div id="arguments">
+    <h2>Arguments</h2>
+    <dl><dt>x</dt>
+<dd><p>corpus to be summarized</p></dd>
+
+
+<dt>...</dt>
+<dd><p>additional arguments passed through to <code><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm()</a></code></p></dd>
+
+</dl></div>
+    <div id="details">
+    <h2>Details</h2>
+    <p>Count the total number of characters, tokens and sentences as well as special
+tokens such as numbers, punctuation marks, symbols, tags and emojis.</p><ul><li><p>chars = number of characters; equal to <code><a href="https://rdrr.io/r/base/nchar.html" class="external-link">nchar()</a></code></p></li>
+<li><p>sents
+= number of sentences; equal <code>ntoken(tokens(x), what = "sentence")</code></p></li>
+<li><p>tokens = number of tokens; equal to <code><a href="https://quanteda.io/reference/ntoken.html" class="external-link">ntoken()</a></code></p></li>
+<li><p>types = number of unique tokens; equal to <code><a href="https://quanteda.io/reference/ntoken.html" class="external-link">ntype()</a></code></p></li>
+<li><p>puncts = number of punctuation marks (<code>^\p{P}+$</code>)</p></li>
+<li><p>numbers = number of numeric tokens
+(<code>^\p{Sc}{0,1}\p{N}+([.,]*\p{N})*\p{Sc}{0,1}$</code>)</p></li>
+<li><p>symbols = number of symbols (<code>^\p{S}$</code>)</p></li>
+<li><p>tags = number of tags; sum of <code>pattern_username</code> and <code>pattern_hashtag</code>
+in <code><a href="https://quanteda.io/reference/quanteda_options.html" class="external-link">quanteda::quanteda_options()</a></code></p></li>
+<li><p>emojis = number of emojis (<code>^\p{Emoji_Presentation}+$</code>)</p></li>
+</ul></div>
+
+    <div id="ref-examples">
+    <h2>Examples</h2>
+    <div class="sourceCode"><pre class="sourceCode r"><code><span class="r-in"><span><span class="kw">if</span> <span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/Sys.info.html" class="external-link">Sys.info</a></span><span class="op">(</span><span class="op">)</span><span class="op">[</span><span class="st">"sysname"</span><span class="op">]</span> <span class="op">!=</span> <span class="st">"SunOS"</span><span class="op">)</span> <span class="op">{</span></span></span>
+<span class="r-in"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="st"><a href="https://quanteda.io" class="external-link">"quanteda"</a></span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">corp</span> <span class="op">&lt;-</span> <span class="va">data_corpus_inaugural</span><span class="op">[</span><span class="fl">1</span><span class="op">:</span><span class="fl">5</span><span class="op">]</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_summary</span><span class="op">(</span><span class="va">corp</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">toks</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/tokens.html" class="external-link">tokens</a></span><span class="op">(</span><span class="va">corp</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_summary</span><span class="op">(</span><span class="va">toks</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="va">dfmat</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://quanteda.io/reference/dfm.html" class="external-link">dfm</a></span><span class="op">(</span><span class="va">toks</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="fu">textstat_summary</span><span class="op">(</span><span class="va">dfmat</span><span class="op">)</span></span></span>
+<span class="r-in"><span><span class="op">}</span></span></span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>          document chars sents tokens types puncts numbers symbols urls tags</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1 1789-Washington    NA    NA   1537   603    107       0       0    0    0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2 1793-Washington    NA    NA    147    95     12       0       0    0    0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3      1797-Adams    NA    NA   2577   801    259       0       0    0    0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4  1801-Jefferson    NA    NA   1923   687    197       0       0    0    0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5  1805-Jefferson    NA    NA   2380   781    214       0       0    0    0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span>   emojis</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 1      0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 2      0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 3      0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 4      0</span>
+<span class="r-out co"><span class="r-pr">#&gt;</span> 5      0</span>
+</code></pre></div>
+    </div>
+  </div>
+  <div class="col-md-3 hidden-xs hidden-sm" id="pkgdown-sidebar">
+    <nav id="toc" data-toggle="toc" class="sticky-top"><h2 data-toc-skip>Contents</h2>
+    </nav></div>
+</div>
+
+
+      <footer><div class="copyright">
+  <p></p><p>Developed by Kenneth Benoit, Kohei Watanabe, Haiyan Wang, Jiong Wei Lua, Jouni Kuha, European Research Council.</p>
+</div>
+
+<div class="pkgdown">
+  <p></p><p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+      </footer></div>
+
+  
+
+
+  
+
+  </body></html>
+
diff --git a/docs/sitemap.xml b/docs/sitemap.xml
new file mode 100644
index 0000000..60366ef
--- /dev/null
+++ b/docs/sitemap.xml
@@ -0,0 +1,96 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
+  <url>
+    <loc>/404.html</loc>
+  </url>
+  <url>
+    <loc>/CONDUCT.html</loc>
+  </url>
+  <url>
+    <loc>/LICENSE-text.html</loc>
+  </url>
+  <url>
+    <loc>/authors.html</loc>
+  </url>
+  <url>
+    <loc>/index.html</loc>
+  </url>
+  <url>
+    <loc>/news/index.html</loc>
+  </url>
+  <url>
+    <loc>/reference/as.list.textstat_proxy.html</loc>
+  </url>
+  <url>
+    <loc>/reference/as.matrix.textstat_simil_sparse.html</loc>
+  </url>
+  <url>
+    <loc>/reference/check_dots.html</loc>
+  </url>
+  <url>
+    <loc>/reference/compute_lexdiv_stats.html</loc>
+  </url>
+  <url>
+    <loc>/reference/compute_mattr.html</loc>
+  </url>
+  <url>
+    <loc>/reference/compute_msttr.html</loc>
+  </url>
+  <url>
+    <loc>/reference/data_char_wordlists.html</loc>
+  </url>
+  <url>
+    <loc>/reference/dfm_split_hyphenated_features.html</loc>
+  </url>
+  <url>
+    <loc>/reference/diag2na.html</loc>
+  </url>
+  <url>
+    <loc>/reference/head.textstat_proxy.html</loc>
+  </url>
+  <url>
+    <loc>/reference/index.html</loc>
+  </url>
+  <url>
+    <loc>/reference/nscrabble.html</loc>
+  </url>
+  <url>
+    <loc>/reference/nsyllable.tokens.html</loc>
+  </url>
+  <url>
+    <loc>/reference/quanteda.textstats-package.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_collocations.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_entropy.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_frequency.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_keyness.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_lexdiv.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_proxy-class.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_proxy.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_readability.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_select.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_simil.html</loc>
+  </url>
+  <url>
+    <loc>/reference/textstat_summary.html</loc>
+  </url>
+</urlset>

+ All functions + +
+ `data_char_wordlists` +	Word lists for readability statistics
+ `textstat_collocations()` +	Identify and score multi-word expressions
+ `textstat_entropy()` +	Compute entropies of documents or features
+ `textstat_frequency()` +	Tabulate feature frequencies
+ `textstat_keyness()` +	Calculate keyness statistics
+ `textstat_lexdiv()` +	Calculate lexical diversity
+ `textstat_readability()` +	Calculate readability
+ `textstat_simil()` `textstat_dist()` +	Similarity and distance computation between documents or features
+ `textstat_summary()` +	Summarize documents as syntactic and lexical feature counts