Citation Proximity Analysis: Recommendation and Clustering Algorithms for Academic Literature

Citation Proximity Analysis [1, 2, 3] is a method for computing similarities between academic documents developed to provide relevant literature recommendations and more precise clustering capabilities.

The approach is an advancement of co-citation analysis. In addition to the known co citation approach it considers the proximity of citations to each other within an article’s full text. The underlying idea is that the closer citations are to each other, the higher is their probability to be related.

In comparison to existing approaches, such as bibliographic coupling, co-citation analysis or keyword-based similarity computations, CPA achieves a higher precision and offers the possibility to pinpoint related sections within the text of academic documents. Moreover, CPA allows a more precise automatic document classification.

Related publications

[1] [pdf] Bela Gipp and Joeran Beel. Citation Proximity Analysis (CPA) – A new approach for identifying related work based on Co-Citation Analysis. In Birger Larsen and Jacqueline Leta, editors, Proceedings of the 12th international conference on scientometrics and informetrics (issi’09), volume 2, pages 571-575, Rio de Janeiro (Brazil), jul 2009. International Society for Scientometrics and Informetrics. ISSN 2175-1935. Available at http://gipp.com/pub
[Bibtex]
@INPROCEEDINGS{Gipp09a,
author = {Gipp, Bela and Beel, Joeran},
title = {{C}itation {P}roximity {A}nalysis ({CPA}) - {A} new approach for identifying related work based on {C}o-{C}itation {A}nalysis},
booktitle = {Proceedings of the 12th International Conference on Scientometrics and Informetrics (ISSI'09)},
year = {2009},
editor = {Birger Larsen and Jacqueline Leta},
volume = {2},
pages = {571-575},
address = {Rio de Janeiro (Brazil)},
month = jul,
publisher = {International Society for Scientometrics and Informetrics},
note = {ISSN 2175-1935. Available at http://gipp.com/pub}
}
[2] Bela Gipp and Joeran Beel. Identifying Related Documents For Research Paper Recommender By CPA And COA. In S. I. Ao, C. Douglas, W. S. Grundfest, and J. Burgstone, editors, Proceedings of the world congress on engineering and computer science 2009, volume 1 of Lecture Notes in Engineering and Computer Science, pages 636-639, Berkeley (USA), oct 2009. International Association of Engineers (IAENG), Newswood Limited. Available at http://gipp.com/pub
[Bibtex]
@INPROCEEDINGS{Gipp09c,
author = {Bela Gipp and Joeran Beel},
title = {{I}dentifying {R}elated {D}ocuments {F}or {R}esearch {P}aper {R}ecommender {B}y {CPA} {A}nd {COA}},
booktitle = {Proceedings of The World Congress on Engineering and Computer Science 2009},
year = {2009},
editor = {S. I. Ao and C. Douglas and W. S. Grundfest and J. Burgstone},
volume = {1},
series = {Lecture Notes in Engineering and Computer Science},
pages = {636--639},
address = {Berkeley (USA)},
month = oct,
organization = {International Association of Engineers (IAENG)},
publisher = {Newswood Limited},
note = {Available at http://gipp.com/pub},
isbn = {978-988-17012-6-8}
}
[3] Bela Gipp. Measuring Document Relatedness by Citation Proximity Analysis and Citation Order Analysis. In M. Lalmas, J. Jose, A. Rauber, F. Sebastiani, and I. Frommholz, editors, Proceedings of the 14th european conference on digital libraries (ecdl’10): research and advanced technology for digital libraries, volume 6273 of Lecture Notes of Computer Science (LNCS). Springer, sep 2010. Available at http://gipp.com/pub
[Bibtex]
@INPROCEEDINGS{Gipp10d,
author = {Bela Gipp},
title = {{M}easuring {D}ocument {R}elatedness by {C}itation {P}roximity {A}nalysis and {C}itation {O}rder {A}nalysis},
booktitle = {Proceedings of the 14th European Conference on Digital Libraries (ECDL'10): Research and Advanced Technology for Digital Libraries},
year = {2010},
editor = {M. Lalmas and J. Jose and A. Rauber and F. Sebastiani and I. Frommholz},
volume = {6273},
series = {Lecture Notes of Computer Science (LNCS)},
month = sep,
publisher = {Springer},
note = {Available at http://gipp.com/pub}
}