Skip to yearly menu bar Skip to main content


Poster

WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data

Maurice Weber ⋅ Carlo Siebenschuh ⋅ Rory Butler ⋅ Anton Alexandrov ⋅ Valdemar Thanner ⋅ Georgios Tsolakis ⋅ Haris Jabbar ⋅ Ian Foster ⋅ Bo Li ⋅ Rick Stevens ⋅ Ce Zhang
2023 Poster
[ Paper [ Slides [ Poster [ OpenReview

Abstract

Video

Chat is not available.