5. - 6. Mai 2021

Graduate Center Masterclasses

WEB CRAWLING with Prof. Dr. Timm Teubner (TU Berlin)

Termin

5. - 6. Mai 2021

10:00-12:00

Ort

Online

Sprecher*innen

Prof. Dr. Timm Teubner

Zum Kalender hinzufügen

Organiser: Prof. Dr. Timm Teubner (TU Berlin)

In this beginner’s guide to web crawling, we will cover the basics of how to automatically extract information from static and dynamic websites. The course will include a fair share of “hands on” work, in which we will write and run code ourselves (Java). If you plan to code along (highly recommended), please have a Java IDE ready for the workshop (e.g. using Eclipse). Beyond the basic principles of how to access websites, we will learn how to navigate the retrieved HTML code (JSoup), how to deal with dynamic and interactive pages (Selenium), and consider some important legal aspects.

Tomaso Duso

Abteilungsleiter Abteilung Unternehmen und Märkte

+49 30 89789 - 520
tduso@diw.de

Abteilungen und SOEP

Forschungsgruppen

Prognose und Projekte

Aktuelles

Über uns

SOEP-Daten

Forschung

WEB CRAWLING with Prof. Dr. Timm Teubner (TU Berlin)

Termin

Ort

Sprecher*innen

Kontakt