Spaces:
Sleeping
Sleeping
metadata
title: PaperIndex
emoji: π
colorFrom: blue
colorTo: purple
sdk: docker
sdk_version: latest
app_file: app.py
pinned: false
Paper Index - AI Paper Evaluation System
A comprehensive system for evaluating AI research papers using advanced language models.
Features
- Daily Paper Crawling: Automatically fetches papers from Hugging Face daily
- AI Evaluation: Uses Claude Sonnet to evaluate papers across multiple dimensions
- Interactive Dashboard: Beautiful web interface for browsing and evaluating papers
- Database Storage: Persistent storage of papers and evaluations
- Smart Navigation: Intelligent date navigation with fallback mechanisms
Hugging Face Spaces Deployment
This application is configured for deployment on Hugging Face Spaces.
Configuration
- Port: 7860 (Hugging Face Spaces standard)
- Health Check:
/api/healthendpoint - Docker: Optimized Dockerfile for containerized deployment
Deployment Steps
- Fork/Clone this repository to your Hugging Face account
- Create a new Space on Hugging Face
- Select Docker as the SDK
- Set Environment Variables:
ANTHROPIC_API_KEY: Your Anthropic API key for Claude access
- Deploy: The Space will automatically build and deploy
Environment Variables
ANTHROPIC_API_KEY=your_api_key_here
PORT=7860 # Optional, defaults to 7860
Local Development
Prerequisites
- Python 3.9+
- Anthropic API key
Installation
Clone the repository:
git clone <repository-url> cd paperindexInstall dependencies:
pip install -r requirements.txtSet environment variables:
export ANTHROPIC_API_KEY=your_api_key_hereRun the application:
python app.pyAccess the application:
- Main interface: http://localhost:7860
- API documentation: http://localhost:7860/docs
API Endpoints
Core Endpoints
GET /api/daily- Get daily papers with smart navigationGET /api/paper/{paper_id}- Get paper detailsGET /api/eval/{paper_id}- Get paper evaluationGET /api/health- Health check endpoint
Evaluation Endpoints
POST /api/papers/evaluate/{arxiv_id}- Start paper evaluationGET /api/papers/evaluate/{arxiv_id}/status- Get evaluation status
Cache Management
GET /api/cache/status- Get cache statisticsPOST /api/cache/clear- Clear all cached dataPOST /api/cache/refresh/{date}- Refresh cache for specific date
Architecture
Frontend
- HTML/CSS/JavaScript: Modern, responsive interface
- Real-time Updates: Dynamic content loading
- Theme Support: Light/dark mode toggle
Backend
- FastAPI: High-performance web framework
- SQLite: Lightweight database for paper storage
- Async Processing: Background evaluation tasks
- Caching: Intelligent caching system for performance
AI Integration
- Claude Sonnet: Advanced paper evaluation
- Multi-dimensional Analysis: Comprehensive evaluation criteria
- Structured Output: JSON-based evaluation results
Database Schema
Papers Table
CREATE TABLE papers (
arxiv_id TEXT PRIMARY KEY,
title TEXT NOT NULL,
authors TEXT NOT NULL,
abstract TEXT,
categories TEXT,
published_date TEXT,
evaluation_content TEXT,
evaluation_score REAL,
overall_score REAL,
evaluation_tags TEXT,
evaluation_status TEXT DEFAULT 'not_started',
is_evaluated BOOLEAN DEFAULT FALSE,
evaluation_date TIMESTAMP,
created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);
Evaluation Dimensions
The system evaluates papers across 12 key dimensions:
- Task Formalization - Clarity of problem definition
- Data & Resource Availability - Access to required data
- Input-Output Complexity - Complexity of inputs/outputs
- Real-World Interaction - Practical applicability
- Existing AI Coverage - Current AI capabilities
- Automation Barriers - Technical challenges
- Human Originality - Creative contribution
- Safety & Ethics - Responsible AI considerations
- Societal/Economic Impact - Broader implications
- Technical Maturity Needed - Development requirements
- 3-Year Feasibility - Short-term potential
- Overall Automatability - Comprehensive assessment
Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
License
This project is licensed under the MIT License - see the LICENSE file for details.