Scholarly Document Processing @ EMNLP 2020
  • Home
  • Call for Papers
    • Call for Papers
    • Topics of Interest
    • Shared Tasks
    • Submission Information
    • Important Dates
    • Journal Extension
    • Keynote Speakers
    • Committees
  • Shared Tasks
    • Call for Participation
    • CL-SciSumm
    • CL-LaySumm
    • LongSumm
    • Registration
    • Important Dates
    • Organizers
  • ProgramNEW!
    • Keynotes
    • Program
    • Accepted Papers-Research Track
  • Committees
    • Organizing Committee
    • Steering Committee
    • Program Committee
    • Contact Us
  • Other Workshops
  • Venue

Call for Papers

You are invited to participate in the 1st Workshop on Scholarly Document Processing (SDP 2020) to be held in conjunction with the 2020 Conference in Empirical Methods in Natural Language Processing (EMNLP 2020) on November 19. The workshop will be held VIRTUALLY with EMNLP 2020.

Important Updates!

  • The workshop will be held virtually on November 19. Details about mode of participation will be released closer to the workshop.
  • The new submission deadline for research papers is August 15, 2020.
  • The new deadline for the shared tasks system runs is August 15, 2020.
  • All three shared tasks are still open for participant registration.
  • We are delighted to announce that we will have with us two eminent researchers as our keynote speakers: Kuansan Wang, Managing Director, MSR Outreach Academic Services and Steinn Sigurðsson, Scientific Director of arXiv, Professor in the Department of Astronomy & Astrophysics at The Pennsylvania State University

About the workshop

The workshop will consist of a research track and a shared task track.

The shared task track includes the 6th edition of the CL-SciSumm shared task and two new summarization tasks – CL-LaySumm and LongSumm – geared towards easier access to scientific methods and results.

SDP is led by the organizers of BIRNDL and the WOSP workshop series.

Details about mode of participation will be announced later on our Website and Twitter.

Navigation

  • Introduction
  • Topics of Interest
  • Shared Tasks
  • Submission Information
  • Important Dates
  • Journal Extension
  • Keynote Speakers
  • Committees

Introduction

In addition to the long-standing challenge faced by scholars of keeping up with the growing literature in their own and related fields, they must now compete with malign pseudo-science and disinformation in informing public policy and behavior. This has stimulated workshops and research focused on enhancing search, retrieval, summarization, and analysis of scholarly documents. However, the general research community on scholarly document processing remains fragmented, and efforts towards natural language understanding of scholarly text that is central to vastly improve all the said downstream applications are not widespread.

To address these gaps, we propose the first Workshop on Scholarly Document Processing. We seek to reach to the broader NLP and AI/ML community to pool the distributed efforts to improve scholarly document understanding and enable intelligent access to the published research. The goal of SDP is two-fold: to increase collaboration between communities interested in leveraging knowledge stored in scientific literature and data and to establish SDP as the single-focused primary venue for the field.

We seek to appeal to the mainstream NLP and ML community working on SDP tasks – which are NLP tasks – to publish at SDP as we seek to establish SDP as the integrated premier venue. We have established a steering committee to help us turn SDP into a conference in the forthcoming years.

Topics of interest

We invite submissions from all communities interested in natural language processing, information retrieval, and data mining problems in scientific documents; and in processing scientific documents for easier access to various audiences. The topics of interest include, but are not limited to:

  • Information extraction, text mining and parsing of scholarly literature
  • Reproducibility and peer review
  • Lay Summarization (i.e., summaries created for non-experts) of individual and collections of scholarly documents
  • Discourse modeling and argument mining
  • Summarization and question-answering for scholarly documents
  • Semantic and network-based indexing, search and navigation in structured text
  • Graph analysis/mining including citation and co-authorship networks
  • Analysing and mining of citation contexts for document understanding and retrieval
  • New scholarly language resources and evaluation
  • Connecting and interlinking publications, data, tweets, blogs or their parts
  • Disambiguation, metadata extraction, enrichment, and data quality assurance for scholarly documents
  • Bibliometrics, scientometrics, and altmetrics approaches and applications
  • Other aspects of scientific workflows including open access/science, and research assessment
  • Infrastructures for accessing scientific publications and/or research data
  • Results and research questions on the COVID-19 Open Research Dataset (CORD-19)

Shared Tasks

The shared task track includes the following tasks:

  1. CL-SciSumm 2020: The 6th Computational Linguistics Scientific Document Summarization Shared Task
  2. CL-LaySumm 2020: The 1st Computational Linguistics Lay Summary Challenge Shared Task
  3. LongSumm 2020: The 1st Shared Task on Generating Long Summaries for Scientific Documents

For detailed information about the tasks, data, evaluation, and organizers, please see the shared tasks page.

Submission Information

Authors are invited to submit short and full papers with unpublished, original work. Submissions will be subject to a double blind peer review process. Accepted papers will be presented by the authors at the workshop either as a talk or a poster. All accepted papers will be published in the workshop proceedings.

Submission Website: Submission is electronic, using the Softconf START conference management system.

The submissions should be in PDF format and anonymized for review. SDP follows anonymity guidelines set by EMNLP 2020. Please do not include links to systems, codes or datasets introduced in your submission that will compromise the anonymity constraints. However, you can use properly anonymized links for the same. You can also include textual supplementary material as appendix at the end of the paper. Additionally you can make a placeholder with a note in the submission with a statement: Code/Data will be released on acceptance. Then put the de-anonymized links in your camera-ready version on acceptance.

All submissions must be written in English and follow the EMNLP 2020 formatting requirements, which will be provided on the conference website soon.

Long paper submissions: up to 8 pages of content, plus unlimited references.

Short paper submissions: up to 4 pages of content, plus unlimited references.

Final versions of accepted papers will be allowed 1 additional page of content so that reviewer comments can be taken into account.

Shared Task registration: Participants of all shared tasks need to register here by April 30th, 2020 (remains open till evaluation window starts). For more information please see the shared tasks page.

Important Dates NEW!

For shared task dates please see the shared tasks page. All submission deadlines are 11:59 PM AoE (Anywhere on Earth) Time Zone (UTC-12).

Event Date
Submission Deadline Aug 15, 2020
Retraction of workshop papers accepted to EMNLP main conference Sep 15, 2020
Notification of acceptance Sep 29, 2020
Camera-ready submissions due Oct 10, 2020
Workshop Nov 19, 2020

Journal Extension

In the past, the accepted authors were invited to submit an extended version of their work to a special issue of a selected journal. The organizers are currently in the process of identifying appropriate journals to host a similar special issue this year. Relevant updates including topics and requirements for this special issue will be shared on the workshop website in due time.

Keynote Speakers

  • Kuansan Wang, Managing Director, Microsoft Research Outreach Academic Services
  • Prof. Steinn Sigurdsson, Scientific Director of arXiv, Professor in the Department of Astronomy & Astrophysics at The Pennsylvania State University

Committees

  • Organizing Committee
  • Steering Committee
  • Program Committee

With kind regards,

SDP 2020 organizing committee

Links

EMNLP 2020 formatting requirements

Important Dates NEW!

Submission deadline – Aug 15, 2020

Retraction of papers accepted to EMNLP main conf. – Sep 15, 2020

Notification of acceptance – Sep 29, 2020

Camera-ready – Oct 10, 2020

Workshop – Nov 19, 2020

Contact

Please contact sdproc@googlegroups.com with any questions regarding the workshop.



Contact: sdproc@googlegroups.com

Follow us: https://twitter.com/SDProc

© 2020 Oak Ridge National Laboratory

Back to top