GSoC 2026 Introduction: PostgreSQL as Full GUI Data Backend (Issue #12708)
Hello , @koppor, @InAnYan and the JabRef community,
My name is Aritra Banerjee, and I am preparing my GSoC 2026 proposal for migrating JabRef’s main table data layer to PostgreSQL.
My background heavily involves bridging complex data streams into Java Spring Boot objects, and I have engineered low-level data structures in C from scratch. This gives me a strict appreciation for the memory allocation constraints currently bottlenecking JabRef’s in-memory model, and I understand exactly why loading entire libraries into RAM is no longer scalable.
I have reviewed the BibDatabaseContext and the current data architecture, and I am currently drafting my proposal around two core technical pillars:
-
Hybrid Database Schema: To handle the dynamic, user-defined nature of
BibTeXfields without breaking relational structure, I am designing a schema that utilizes standard columns for core metadata alongside PostgreSQL’sJSONBformat for robust, indexable custom field storage. -
Keyset Pagination (Cursor-based): To ensure the JavaFX UI remains perfectly smooth and memory-efficient, the Java DAO layer will utilize keyset pagination rather than
OFFSET/LIMIT, guaranteeing that fetching the 10,000th citation is as fast as the first. I plan to containerize the local PostgreSQL testing environment to ensure clean development.
I am finalizing the architectural diagrams and milestones in my proposal document now, and I will share the link in this thread very shortly for your feedback.
I look forward to contributing!
