<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Deep Learning | Zijian Zhang</title><link>https://www.zzjharry.com/tag/deep-learning/</link><atom:link href="https://www.zzjharry.com/tag/deep-learning/index.xml" rel="self" type="application/rss+xml"/><description>Deep Learning</description><generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><lastBuildDate>Sat, 18 Dec 2021 00:00:00 +0000</lastBuildDate><image><url>https://www.zzjharry.com/media/icon_hu8617b2913270b57d2d49c0a56c925c16_905369_512x512_fill_lanczos_center_3.png</url><title>Deep Learning</title><link>https://www.zzjharry.com/tag/deep-learning/</link></image><item><title>Clinical Notes De-identification System</title><link>https://www.zzjharry.com/project/datasifter/</link><pubDate>Sat, 18 Dec 2021 00:00:00 +0000</pubDate><guid>https://www.zzjharry.com/project/datasifter/</guid><description>&lt;p>Data Sifter - TODO&lt;/p></description></item><item><title>Image Captioning - Show, Tell, and Beyond</title><link>https://www.zzjharry.com/project/showtellbeyond/</link><pubDate>Sat, 18 Dec 2021 00:00:00 +0000</pubDate><guid>https://www.zzjharry.com/project/showtellbeyond/</guid><description>&lt;p>Show, Tell, and Beyond - TODO&lt;/p></description></item><item><title>Vertical Tourism Search Engine</title><link>https://www.zzjharry.com/project/where2travel/</link><pubDate>Sat, 18 Dec 2021 00:00:00 +0000</pubDate><guid>https://www.zzjharry.com/project/where2travel/</guid><description>&lt;p>Where2Travel - TODO&lt;/p></description></item><item><title>Extreme Weather Events Dashboard</title><link>https://www.zzjharry.com/project/extremeweathereventsdashboard/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://www.zzjharry.com/project/extremeweathereventsdashboard/</guid><description>&lt;p>During the summer of 2023, I worked as a data science intern at ClimateAi, a company dedicated to assisting businesses and organizations in formulating long-term climate resilience strategies through various AI-based services.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Project Overview" srcset="
/project/extremeweathereventsdashboard/ClimateAiInternProjectOverview_huc03b481c742eeb108cd284260239cbd6_550095_f1d96add90dec43e2126db1f9d67446c.png 400w,
/project/extremeweathereventsdashboard/ClimateAiInternProjectOverview_huc03b481c742eeb108cd284260239cbd6_550095_906cb9c0f7c93125eee5e5b7fa19248a.png 760w,
/project/extremeweathereventsdashboard/ClimateAiInternProjectOverview_huc03b481c742eeb108cd284260239cbd6_550095_1200x1200_fit_lanczos_3.png 1200w"
src="https://www.zzjharry.com/project/extremeweathereventsdashboard/ClimateAiInternProjectOverview_huc03b481c742eeb108cd284260239cbd6_550095_f1d96add90dec43e2126db1f9d67446c.png"
width="760"
height="391"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;p>I built a comprehensive extreme weather events dashboard, which contains various key attributes (e.g., type of extreme weather events, location, time, crops impacted, quantitative impact) of heatwaves, droughts, hurricanes, and other kinds of extreme weather events happening worldwide.&lt;/p>
&lt;p>There are four main components in the end-to-end pipeline: Web Scraping ⇒ Article Filtering ⇒ Information Extraction ⇒ Creating the Dashboard.&lt;/p>
&lt;h2 id="stage-1---web-scraping">Stage 1 - Web Scraping&lt;/h2>
&lt;ul>
&lt;li>This stage is to scrape publicly available articles from the internet.&lt;/li>
&lt;li>Data Source: 80,000+ articles (e.g., United Nations).&lt;/li>
&lt;li>Tech Stack: BeautifulSoap and Selenium.&lt;/li>
&lt;/ul>
&lt;h2 id="stage-2---article-filtering">Stage 2 - Article Filtering&lt;/h2>
&lt;ul>
&lt;li>This stage is to filter out the irrelevant articles.&lt;/li>
&lt;li>Article Filtering Approaches
&lt;ul>
&lt;li>Approach 1 - Stemming and Keywords Matching&lt;/li>
&lt;li>Approach 2 - Directly Use Pretrained NLP Models&lt;/li>
&lt;li>Approach 3 - Sentence Embedding &amp;amp; Similarity [Preferred]
&lt;ul>
&lt;li>Accuracy - 95%&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;h2 id="stage-3---information-extraction">Stage 3 - Information Extraction&lt;/h2>
&lt;ul>
&lt;li>This stage leverages OpenAI LLMs API and prompt engineering to extract the key attributes from the structured textual data into relatively structured insights.&lt;/li>
&lt;li>OpenAI API - GPT-3.5-turbo&lt;/li>
&lt;/ul>
&lt;h2 id="stage-4---creating-the-dashboard">Stage 4 - Creating the Dashboard&lt;/h2>
&lt;ul>
&lt;li>This stage is the final stage for the creation of the final dashboard, including concatenating the smaller dashboards from different data sources, cleaning &amp;amp; structuring the dashboard, and adding new features for better visualization (e.g., filters based on specific values of an attribute).&lt;/li>
&lt;/ul></description></item></channel></rss>