Springer book: Citation-based Plagiarism Detection

Available in July 2014:

  • [PDF] B. Gipp, Citation-based Plagiarism Detection – Detecting Disguised and Cross-language Plagiarism using Citation Pattern Analysis, Springer Vieweg Research, 2014.
    [Bibtex]
    @BOOK{ThesisBelaGipp,
      title = {Citation-based Plagiarism Detection - Detecting Disguised and Cross-language
      Plagiarism using Citation Pattern Analysis},
      publisher = {Springer Vieweg Research},
      year = {2014},
      author = {Gipp, Bela},
      pages = {350},
      isbn = {978-3-658-06393-1},
      school = {Department of Computer Science, Otto-von-Guericke-University Magdeburg,
      Germany},
      url = {http://www.springer.com/978-3-658-06393-1}
    }

book cover

Publisher website                           Pre-print download

Paid Internships in Tokyo, Japan for MS and PhD Students

We are offering two internships related to various research and software development projects at the National Institute of Informatics Tokyo – Japan’s premiere research institute in computer science.

Key Facts:
• Location: Tokyo, Japan
• Duration: minimum 2 months, maximum flexible
• Salary: approx. 170,000 JPY per month (sufficient to cover living expenses in Tokyo)

Requirements:
• Interns must be currently enrolled master or doctoral students.
• Background in computer science or related disciplines
• Motivation, curiosity, ability to work methodologically and independent

Contact:
For more information on potential tasks or any other questions, please contact Bela Gipp (gipp@nii.ac.jp). Please include a short description of yourself, your background and the period in which you would like to do the internship.

Apply for a Paid Internship at SciPlore – Summer 2014

In cooperation with the German Academic Exchange Service (DAAD), we are offering a paid internship for a computer science Bachelor student, who is passionate about prototype design and development. Prerequisite is that you are a student studying at a German university. More details on prerequisites here, download the project description as PDF.

At SciPlore you will be working with an international team of researchers, affiliated with the University of California Berkeley and the University of Magdeburg in Germany. As an intern, you will have the chance to spend 6-12 weeks abroad at a research institute collaborating with the SciPlore team.

The SciPlore team develops novel approaches in citation and semantic text analysis for quantifying similarities between scientific articles. Similarity assessments are crucial to many Information Retrieval (IR) tasks, such as clustering of documents, recommending academic literature, or automatically detecting academic plagiarism.

Below is a screenshot of CitePlag – the first prototype of a citation-based plagiarism detection system, which we developed. To see for yourself the document similarity visualization of CitePlag, use the system on a real plagiarism case that translated from Chinese, or on a plagiarized medical publication that was retracted with the help of CitePlag.

Guttenberg_report

Plagiarism Case by K.-T. zu Guttenberg

 Screenshot showing document visualization in CitePlag – click on the image to open the interactive web-based prototype

This internship with SciPlore is made possible by the German Academic Exchange Service (DAAD). To apply, simply create a profile with the DAAD on their RISE-Weltweit 2014 website and search for SciPlore’s project page in their database, which describes all the details of the internship project, including your tasks and the skills required.

Application deadline is January 12, 2014.

Compensation: The DAAD will provide you with a monthly stipend and a fixed payment to cover your travel expenses!

Berkeley – 1,000 EUR per month + 1,250 EUR onetime payment for travel expenses

If you have further questions please contact Bela Gipp at gipp@sciplore.org.

 

New release of SciPlore Mindmapping / Docear!

Version 1.0 is out now!

We’ve added several new features and made many improvements over the past few months.

See for yourself:

New user manual

To the download

For those of you still wondering…what is Docear?

Docear is a unique solution to literature management that seamlessly integrates mind map creation and a reference manager to easily organize the notes you made inside PDF documents and the annotations you made while reading. Docear’s approach helps you with all three steps of the reading – creative thinking – and writing process:

  • organizing your thoughts and annotations (mind map)
  • creating new documents (exporting outlines from mind map)
  • and discovering new academic literature (recommender system)

Docear does this in a single-section user-interface. This distinguishes the software from the interfaces of competitors, including Zotero, JabRef, Mendeley, Endnote, etc. in that it allows a more comprehensive organization of your e-literature (PDFs) and all the notes, highlights, comments, bookmarks, etc. you created while reading! Our goal is that you to never get lost or forget an idea again.

Docear’s ‘literature suite concept’ helps you draft and write academic articles, papers, publications, or books and theses by providing you with all the annotations you previously created right there at your finger tips! Docear works with standard PDF annotations, so you can use your favorite PDF viewer.

The literature suite concept also includes a recommender system that allows you to discover new academic literature. Docear is free and open source and offers many additional features, which you can check out here: http://www.docear.org/software/details/

CitePlag got a makeover

CitePlag is the first prototype of a citation-based Plagiarism Detection (CbPD) System.

The prototype was just recently demonstrated at this year’s SIGIR conference.

So what’s novel about CitePlag?

In contrast to existing text-based approaches to plagiarism detection, CitePlag does not analyze literal text matches alone to determine document suspiciousness – but rather, CitePlag makes use of the unique citation placement in the full-text of documents to determine similarity and detect potential plagiarism.

In examining citation placement, position, and order, CitePlag forms a text-independent / and even language-barrier transcending “fingerprint” of the semantic content of documents, which can then be used to detect potential unoriginality and plagiarism.

Chines_English_CitePlag

CitePlag has come a long way from it’s humble beginnings in 2010, when we first proposed a citation-based approach to detect semantic similarity between documents for use in plagiarism detection. A year later, we developed the algorithms, and today we have a working prototype available for public use!

CitePlag now received a new homepage featuring improved functionality.

You can:

  1. upload your own files  (PDF/ text documents)
  2. examine the most plagiarism findings and example of retracted plagiarism cases
  3. compare any two publications from the Open Access subset of the PubMed’s database  (200,000+ medical publications)

Test the CitePlag prototype for yourself at its new home on the web: http://citeplag.org/

If you’re curious about the project, see our related publications for more details on CbPD, or request a copy of the doctoral thesis of Bela Gipp, which narrows in on all aspects of Citation-based Plagiarism Detection.

Docear goes CeBIT

From March 6th to 10th, 2012 we will be at the CeBIT in Hannover, Germany. CeBIT is the digital industry’s largest and most international trade show. This event is always worth a visit. We will be at Halle 9, Stand A 10 to present Docear. You are welcome to visit us and meet the Docear team (or at least part of the team).

First public version of Docear released (1.0 Beta 1)

Today, on February 15th, 2012 we released the first public version of Docear. It’s a Beta version and still has some bugs and missing features but overall it will give you a thorough impression of what we consider an academic literature suite should be.

The main idea behind Docear is that you annotate everything you consider important in a PDF. That means, you highlight text, write comments, or create bookmarks in the PDF. To create bookmarks and comments, you can use almost any PDF reader (highlighting text is a bit more complicated). These PDF annotations are then imported by Docear to a mind map. In this mind map you can organize all your annotations into categories, create further nodes and add more text. With the integrated reference manager, bibliographic data can be added to each of the PDF annotations (and all other nodes in the mind map). Subsequently, you can create a new mind map, drag e.g. a research paper, copy your annotations to the draft and if you need more information you just click on the PDF annotation and the PDF will open on the page the annotation was made. Watch this video to get a better idea of what Docear can do (watch it in full-screen mode). Read more First public version of Docear released (1.0 Beta 1)

Experimental Releases

We decided to make our internal test releases public. Every one or two weeks we will publish an experimental release of Docear in our forum. Those releases are not thoroughly tested but have the latest features. You can subscribe to the forum to be notified about new experimental releases (we won’t post them in the Blog or mention them in the newsletter). If you try an experimental release, your bug reports are highly welcome in our bug forum.

Docear Video Teaser Online

We created an introductory video for Docear. It aims at providing a brief overview of Docear’s capabilities and the basic workflow. Please let us know what you think could be improved. Personally, I believe it could be done much better in general but we don’t have the skills to do it ;-). If you have experience with video editing, please don’t hesitate to make your own video. If someone is interested I would also gladly share the raw material.

(Bachelor) Students: Do a (Paid) Internship at Docear in Magdeburg, Germany or Berkeley, USA

We are happy to announce that we can offer paid internships to German, British, and North American (USA & Canada) Bachelor students, in cooperation with the German Academic Exchange Service (DAAD). Some of the internships are in cooperation with our partner SciPlore or for our project Mr. DLib. However, all results will be used by Docear, too. These are the internships we are offering (all in the field of software development).

Docear Internships (preferably in Magdeburg, Germany)

PDF Header/Metadata Extraction

Creating a Recommender System for Docear

Joined Internships by SciPlore/Docear/Mr. DLib (preferably in Berkeley, US)

Deduplication of Records in Academic Literature Databases

Header Extraction from Scientific PDF Documents

Recommendation Engine for Academic Literature

And here is how it works:

If you are a Bachelor student from the US, Canada or the UK you can apply for a DAAD scholarship which will fully fund your internship (only “Docear internships”). To do so, visit the DAAD Rise website, register, search in the database for the Docear internships and apply. In case you application is successful, your internship will take place in Magdeburg, Germany in summer 2012. Deadline for applications is January 31, 2012.

If you are a German Bachelor student, you can apply for a DAAD scholarship too, but only for the “joined internships”. To do so, visit the DAAD Rise Worldwide website, register, search in the database for the joined internships, and apply. In case your application is successful, your internship will take place in fall 2012 in Berkeley (USA). Deadline for applications is January 15, 2012.

If you are neither a German nor US/UK/Canadian Bachelor student or if you don’t like the location or time the sponsored internships require, you can still apply for the internships with a date, location and duration of your choice. However,  we won’t be able to pay you. If you are still interested, send us an email with application letter and resume to team@sciplore.org.

It is also possible (and highly welcome) to combine the internship with writing a (Bachelor) thesis.