Skip to content

Commit e31c55d

Browse files
committed
update catalog-efile download page
Add additional information about the Schedule Table and prevalence of schedules filed in the 2021 tax year.
1 parent ee6516e commit e31c55d

File tree

2 files changed

+193
-81
lines changed

2 files changed

+193
-81
lines changed

catalogs/catalog-efile.html

Lines changed: 109 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -61,16 +61,18 @@ <h1 class="title">990 EFILER Data Catalog</h1>
6161

6262
</header>
6363
<nav id="TOC" role="doc-toc">
64+
<h2 id="toc-title">Efile Data Contents</h2>
6465

6566
<ul>
6667
<li><a href="#about-nccs-efile-data" id="toc-about-nccs-efile-data">About NCCS Efile Data</a>
6768
<ul>
69+
<li><a href="#data-sources-and-processing" id="toc-data-sources-and-processing">Data Sources and Processing</a></li>
6870
<li><a href="#data-dictionary" id="toc-data-dictionary">Data Dictionary</a></li>
6971
<li><a href="#coverage" id="toc-coverage">Coverage</a></li>
7072
<li><a href="#structure-of-table-and-file-names" id="toc-structure-of-table-and-file-names">Structure of Table and File Names</a></li>
71-
<li><a href="#schedule-table" id="toc-schedule-table">Schedule Table</a></li>
73+
<li><a href="#schedule-manifest" id="toc-schedule-manifest">Schedule Manifest</a></li>
7274
</ul></li>
73-
<li><a href="#downloads" id="toc-downloads">Downloads</a>
75+
<li><a href="#available-tables" id="toc-available-tables">Available Tables</a>
7476
<ul>
7577
<li><a href="#f9-p00-t00-header" id="toc-f9-p00-t00-header">F9-P00-T00-HEADER</a></li>
7678
<li><a href="#f9-p01-t00-summary" id="toc-f9-p01-t00-summary">F9-P01-T00-SUMMARY</a></li>
@@ -99,7 +101,7 @@ <h1 class="title">990 EFILER Data Catalog</h1>
99101
<li><a href="#sc-p01-t00-lobby" id="toc-sc-p01-t00-lobby">SC-P01-T00-LOBBY</a></li>
100102
<li><a href="#sc-p02-t00-lobby" id="toc-sc-p02-t00-lobby">SC-P02-T00-LOBBY</a></li>
101103
<li><a href="#sc-p03-t00-lobby" id="toc-sc-p03-t00-lobby">SC-P03-T00-LOBBY</a></li>
102-
<li><a href="#schedule-table-1" id="toc-schedule-table-1">SCHEDULE-TABLE</a></li>
104+
<li><a href="#schedule-table" id="toc-schedule-table">SCHEDULE-TABLE</a></li>
103105
<li><a href="#sd-p01-t00-orgs-donor-advised-funds-oth" id="toc-sd-p01-t00-orgs-donor-advised-funds-oth">SD-P01-T00-ORGS-DONOR-ADVISED-FUNDS-OTH</a></li>
104106
<li><a href="#sd-p02-t00-conserv-easements" id="toc-sd-p02-t00-conserv-easements">SD-P02-T00-CONSERV-EASEMENTS</a></li>
105107
<li><a href="#sd-p03-t00-orgs-collect-art-hist-treasure-oth" id="toc-sd-p03-t00-orgs-collect-art-hist-treasure-oth">SD-P03-T00-ORGS-COLLECT-ART-HIST-TREASURE-OTH</a></li>
@@ -140,26 +142,31 @@ <h1 class="title">990 EFILER Data Catalog</h1>
140142
</ul></li>
141143
</ul>
142144
</nav>
143-
<br>
145+
<p><br></p>
144146
<hr>
145147
<p><br></p>
146148
<section id="about-nccs-efile-data" class="level1">
147149
<h1>About NCCS Efile Data</h1>
148-
<p>IRS 990 Efile data is generated from XML files hosted on the 990 Data Commons:</p>
149-
<p>XML files are parsed using an <strong>XML-to-RDB Concordance</strong> file that has been produced by the Nonprofit Open Data Collective.</p>
150-
<p>XML documents are converted in the data tables described in data dictionary using the <strong>irs990efile</strong> R package:</p>
151-
<p>The NCCS downloads below include <strong>data from 990 and 990EZ returns and their associated schedules</strong>. The database does not include 990PF or 990T data.</p>
150+
<section id="data-sources-and-processing" class="level2">
151+
<h2 class="anchored" data-anchor-id="data-sources-and-processing">Data Sources and Processing</h2>
152+
<p>IRS 990 Efile data is generated from XML files hosted on the <strong>990 Data Commons</strong>.</p>
153+
<p>XML files are parsed using an <strong>XML-to-RDB Concordance</strong> file that has been produced by Jesse Lecy, Hyunrang Han, and the Nonprofit Open Data Collective.</p>
154+
<p>XML documents are converted in the data tables described in data dictionary using the <strong>irs990efile R Package</strong>.</p>
155+
<p>The NCCS downloads below include <strong>data from 990 and 990EZ returns and their associated schedules</strong>.</p>
152156
<hr>
153157
<p><a href="https://www.givingtuesday.org/data-commons/990-data-infrastructure/" class="buttonx" target="_blank"> 990 DATA COMMONS </a></p>
154-
<p><a href="https://github.com/Nonprofit-Open-Data-Collective/irs-efile-master-concordance-file" class="buttonx" target="_blank"> CONCORDANCE CROSSWALK </a></p>
155-
<p><a href="https://github.com/Nonprofit-Open-Data-Collective/irs990efile" class="buttonx" target="_blank"> R PACKAGE: irs990efile </a></p>
158+
<p><a href="https://github.com/Nonprofit-Open-Data-Collective/irs-efile-master-concordance-file" class="buttonx" target="_blank"> CONCORDANCE DETAILS </a></p>
159+
<p><a href="https://github.com/Nonprofit-Open-Data-Collective/irs990efile" class="buttonx" target="_blank"> R PACKAGE: IRS990EFILE </a></p>
160+
<p><a href="https://nonprofit-open-data-collective.github.io/tools/" class="buttonx" target="_blank"> NONPROFIT OPEN DATA COLLECTIVE </a></p>
156161
<hr>
162+
<p>The database does not currently include 990PF or 990T data.</p>
157163
<p><br> <br></p>
164+
</section>
158165
<section id="data-dictionary" class="level2">
159166
<h2 class="anchored" data-anchor-id="data-dictionary">Data Dictionary</h2>
160-
<p>Efile data has been parsed into 125 tables that correspond to sections (“Parts”) on 990 Forms and Schedules.</p>
167+
<p>Efile data has been parsed into 125 tables that correspond to sections (“Parts”) on 990 Forms and Schedules. All years use a consistent data dictionary.</p>
161168
<p><a href="https://nonprofit-open-data-collective.github.io/irs990efile/data-dictionary/data-dictionary.html" class="buttonx" target="_blank"> <b>DATA DICTIONARY</b> </a></p>
162-
<br>
169+
<p><br></p>
163170
<hr>
164171
<p><br></p>
165172
</section>
@@ -168,7 +175,7 @@ <h2 class="anchored" data-anchor-id="coverage">Coverage</h2>
168175
<p>Non-mandatory electronic filing for tax-exempt organizations started around 2010 (note that tax year lags the calendar year by 12-18 months). Utilization of electronic filing options (versus paper returns) grew steadily over the next decade.</p>
169176
<p>Starting in 2016 the IRS began releasing electronic returns as free open data to the general public in a raw XML format. <em>The schemas needed to make sense of the XML file structures, however, have not been made public.</em></p>
170177
<p>Beginning in the 2019/2020 tax year efiling became mandatory for most tax-exempt organizations.</p>
171-
<p><strong>990 Efile Returns by FormType and TaxYear</strong></p>
178+
<p><strong>990 Efile Returns by FormType and TaxYear (990 Data Commons)</strong></p>
172179
<table class="table">
173180
<thead>
174181
<tr class="header">
@@ -294,6 +301,7 @@ <h2 class="anchored" data-anchor-id="coverage">Coverage</h2>
294301
</tr>
295302
</tbody>
296303
</table>
304+
<p><em>Updated February 2024</em></p>
297305
<p>The NCCS downloads below include data from 990 and 990EZ returns and their associated schedules.</p>
298306
<p>The 990PF fields have been mapped within the Concordance File, so technically <a href="https://github.com/Nonprofit-Open-Data-Collective/990pf-dev">it is possible to build 990PF tables with the irs990efile package</a>. Due to the complexity of many 990PF tables (mandatory fields are often formatted as Schedule O amendments or file attachments that require additional parsing) we have not created public files for widespread distribution.</p>
299307
<p><strong>NOTE:</strong></p>
@@ -330,15 +338,91 @@ <h2 class="anchored" data-anchor-id="structure-of-table-and-file-names">Structur
330338
<p>All other EFILE tables have PZ form scope, meaning they include data from both full 990 and 990EZ filers.</p>
331339
<p>For the list of available TTT tables see the <a href="https://nonprofit-open-data-collective.github.io/irs990efile/data-dictionary/data-dictionary.html">DATA DICTIONARY</a>.</p>
332340
</section>
333-
<section id="schedule-table" class="level2">
334-
<h2 class="anchored" data-anchor-id="schedule-table">Schedule Table</h2>
335-
<p>The SCHEDULE-TABLE documents which 990 Schedules each organized has filed in a given tax year.</p>
336-
<p><a href="#schedule-table-1" class="button2" target="_blank"> SCHED TABLE </a></p>
341+
<section id="schedule-manifest" class="level2">
342+
<h2 class="anchored" data-anchor-id="schedule-manifest">Schedule Manifest</h2>
343+
<p><a href="#schedule-table" class="button2" target="_blank"> SCHED TABLE </a></p>
344+
<p>Since schedules are not required for most 990 filers the corresponding schedule data tables only contain organizations that have filed that specific schedule. <strong>SCHEDULE-TABLE.CSV</strong> documents which 990 schedules each organized has filed in a given tax year. It thus can serve as a sampling framework.</p>
345+
<p><strong>Schedule Prevalence in a Single Tax Year:</strong></p>
346+
<table class="table">
347+
<thead>
348+
<tr class="header">
349+
<th style="text-align: left;">990 Schedule</th>
350+
<th style="text-align: left;">Number of Filers</th>
351+
</tr>
352+
</thead>
353+
<tbody>
354+
<tr class="odd">
355+
<td style="text-align: left;">SCHED-A</td>
356+
<td style="text-align: left;">377,609</td>
357+
</tr>
358+
<tr class="even">
359+
<td style="text-align: left;">SCHED-B</td>
360+
<td style="text-align: left;">208,367</td>
361+
</tr>
362+
<tr class="odd">
363+
<td style="text-align: left;">SCHED-C</td>
364+
<td style="text-align: left;">34,831</td>
365+
</tr>
366+
<tr class="even">
367+
<td style="text-align: left;">SCHED-D</td>
368+
<td style="text-align: left;">246,227</td>
369+
</tr>
370+
<tr class="odd">
371+
<td style="text-align: left;">SCHED-E</td>
372+
<td style="text-align: left;">17,536</td>
373+
</tr>
374+
<tr class="even">
375+
<td style="text-align: left;">SCHED-F</td>
376+
<td style="text-align: left;">15,471</td>
377+
</tr>
378+
<tr class="odd">
379+
<td style="text-align: left;">SCHED-G</td>
380+
<td style="text-align: left;">96,717</td>
381+
</tr>
382+
<tr class="even">
383+
<td style="text-align: left;">SCHED-H</td>
384+
<td style="text-align: left;">1,850</td>
385+
</tr>
386+
<tr class="odd">
387+
<td style="text-align: left;">SCHED-I</td>
388+
<td style="text-align: left;">67,438</td>
389+
</tr>
390+
<tr class="even">
391+
<td style="text-align: left;">SCHED-J</td>
392+
<td style="text-align: left;">65,039</td>
393+
</tr>
394+
<tr class="odd">
395+
<td style="text-align: left;">SCHED-K</td>
396+
<td style="text-align: left;">5,387</td>
397+
</tr>
398+
<tr class="even">
399+
<td style="text-align: left;">SCHED-L</td>
400+
<td style="text-align: left;">26,539</td>
401+
</tr>
402+
<tr class="odd">
403+
<td style="text-align: left;">SCHED-M</td>
404+
<td style="text-align: left;">281,73</td>
405+
</tr>
406+
<tr class="even">
407+
<td style="text-align: left;">SCHED-N</td>
408+
<td style="text-align: left;">4,753</td>
409+
</tr>
410+
<tr class="odd">
411+
<td style="text-align: left;">SCHED-O</td>
412+
<td style="text-align: left;">480,069</td>
413+
</tr>
414+
<tr class="even">
415+
<td style="text-align: left;">SCHED-R</td>
416+
<td style="text-align: left;">70,464</td>
417+
</tr>
418+
</tbody>
419+
</table>
420+
<p><a href="https://github.com/Nonprofit-Open-Data-Collective/irs-efile-master-concordance-file/blob/master/README.md#form-990-and-schedule-sections">Schedule Details</a></p>
337421
<p><br> <br></p>
338422
</section>
339423
</section>
340-
<section id="downloads" class="level1">
341-
<h1>Downloads</h1>
424+
<section id="available-tables" class="level1">
425+
<h1>Available Tables</h1>
342426
<hr>
343427
<section id="f9-p00-t00-header" class="level3">
344428
<h3 class="anchored" data-anchor-id="f9-p00-t00-header">F9-P00-T00-HEADER</h3>
@@ -6361,8 +6445,8 @@ <h3 class="anchored" data-anchor-id="sc-p03-t00-lobby">SC-P03-T00-LOBBY</h3>
63616445
</table>
63626446
<p><a href="#top"> Back to Top </a></p>
63636447
</section>
6364-
<section id="schedule-table-1" class="level3">
6365-
<h3 class="anchored" data-anchor-id="schedule-table-1">SCHEDULE-TABLE</h3>
6448+
<section id="schedule-table" class="level3">
6449+
<h3 class="anchored" data-anchor-id="schedule-table">SCHEDULE-TABLE</h3>
63666450
<p><a href="https://nonprofit-open-data-collective.github.io/irs990efile/data-dictionary/data-dictionary.html" class="button2" target="_blank"> DATA DICTIONARY </a></p>
63676451

63686452
<table class="table" style="margin-left: auto; margin-right: auto;">
@@ -14813,7 +14897,6 @@ <h3 class="anchored" data-anchor-id="partviipartvii">partvii/PARTVII</h3>
1481314897
h1.title {
1481414898
font-size: 60px;
1481514899
color: #0a4c6a;
14816-
margin-bottom: 50px;
1481714900
}
1481814901

1481914902

@@ -14824,6 +14907,10 @@ <h3 class="anchored" data-anchor-id="partviipartvii">partvii/PARTVII</h3>
1482414907
margin-bottom: 30px;
1482514908
}
1482614909

14910+
nav#TOC {
14911+
margin-top: 100px;
14912+
}
14913+
1482714914
nav#TOC a {
1482814915
font-family: var(--bs-font-monospace);
1482914916
text-transform: uppercase;

0 commit comments

Comments
 (0)