Skip to content

Commit

Permalink
feat: add high level agenda subslides
Browse files Browse the repository at this point in the history
  • Loading branch information
lorae committed Jun 18, 2024
1 parent 884deff commit 20b49ae
Show file tree
Hide file tree
Showing 2 changed files with 43 additions and 23 deletions.
34 changes: 22 additions & 12 deletions advanced-web-scraping.slides.html
Original file line number Diff line number Diff line change
Expand Up @@ -7497,7 +7497,7 @@
</div>
<div class="jp-InputArea jp-Cell-inputArea"><div class="jp-InputPrompt jp-InputArea-prompt">
</div><div class="jp-RenderedHTMLCommon jp-RenderedMarkdown jp-MarkdownOutput" data-mime-type="text/markdown">
<h1 id="Advanced-Web-Scraping-with-Python">Advanced Web Scraping with Python<a class="anchor-link" href="#Advanced-Web-Scraping-with-Python"></a></h1><br/>
<h1 id="Web-Scraping-with-Python">Web Scraping with Python<a class="anchor-link" href="#Web-Scraping-with-Python"></a></h1><br/>
<h3 id="Lorae-Stojanovic">Lorae Stojanovic<a class="anchor-link" href="#Lorae-Stojanovic"></a></h3><p>June 20, 2024</p>
</div>
</div>
Expand Down Expand Up @@ -7542,19 +7542,29 @@ <h1 id="Agenda">Agenda<a class="anchor-link" href="#Agenda">¶</a></h1><ol>
</div>
<div class="jp-InputArea jp-Cell-inputArea"><div class="jp-InputPrompt jp-InputArea-prompt">
</div><div class="jp-RenderedHTMLCommon jp-RenderedMarkdown jp-MarkdownOutput" data-mime-type="text/markdown">
<p>Libraries we WILL use in this presentation:</p>
<p>This presentation is designed for an audience with a basic understanding of Python and some experience in web scraping.</p>
<p>However, most of the content is accessible even for novices to both topics.</p>
</div>
</div>
</div>
</div></section><section>
<div class="jp-Cell jp-MarkdownCell jp-Notebook-cell">
<div class="jp-Cell-inputWrapper" tabindex="0">
<div class="jp-Collapser jp-InputCollapser jp-Cell-inputCollapser">
</div>
<div class="jp-InputArea jp-Cell-inputArea"><div class="jp-InputPrompt jp-InputArea-prompt">
</div><div class="jp-RenderedHTMLCommon jp-RenderedMarkdown jp-MarkdownOutput" data-mime-type="text/markdown">
<p><strong>By the end of this presentation, you will:</strong></p>
<ul>
<li>requests</li>
<li>BeautifulSoup</li>
<li>JSON</li>
<li>XML</li>
<li>selenium</li>
<li>Understand how your browser interacts with the internet</li>
<li>Be able to explain how an HTML file is used to build a webpage</li>
<li>Be able to gather data from the internet using 3 methods:<ul>
<li>HTTP requests + HTML parsing</li>
<li>Selenium + HTML parsing</li>
<li>API requests</li>
</ul>
<p>These topics will NOT be covered in this presentation:</p>
<ul>
<li>pyppeteer, playwright (an even more advanced way to scrape using Chrome Developer Protocol) [FACT CHECK!!!!]</li>
<li>scrapy</li>
<li>MechanicalSoup</li>
</li>
<li>Understand the advantages and shortfalls of each method</li>
</ul>
</div>
</div>
Expand Down
32 changes: 21 additions & 11 deletions slides/advanced-web-scraping.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@
}
},
"source": [
"# Advanced Web Scraping with Python\n",
"# Web Scraping with Python\n",
"\n",
"<br>\n",
"\n",
Expand Down Expand Up @@ -99,19 +99,29 @@
}
},
"source": [
"Libraries we WILL use in this presentation:\n",
"This presentation is designed for an audience with a basic understanding of Python and some experience in web scraping. \n",
"\n",
"- requests\n",
"- BeautifulSoup\n",
"- JSON\n",
"- XML\n",
"- selenium\n",
"However, most of the content is accessible even for novices to both topics."
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "subslide"
}
},
"source": [
"\n",
"These topics will NOT be covered in this presentation:\n",
"**By the end of this presentation, you will:**\n",
"\n",
"- pyppeteer, playwright (an even more advanced way to scrape using Chrome Developer Protocol) [FACT CHECK!!!!]\n",
"- scrapy\n",
"- MechanicalSoup"
"- Understand how your browser interacts with the internet\n",
"- Be able to explain how an HTML file is used to build a webpage\n",
"- Be able to gather data from the internet using 3 methods:\n",
" - HTTP requests + HTML parsing\n",
" - Selenium + HTML parsing\n",
" - API requests\n",
"- Understand the advantages and shortfalls of each method"
]
},
{
Expand Down

0 comments on commit 20b49ae

Please sign in to comment.