Spaces:

Shreyas094
/

GPT-Researcher

Running

App Files Files Community

GPT-Researcher / CURSOR_RULES.md

Shreyas094

Upload 528 files

372531f verified about 1 month ago

preview code

raw

history blame contribute delete

6.43 kB

	> Note: This is a readable copy of the `.cursorrules` file maintained for legibility. The actual rules are implemented from the `.cursorrules` file in the root directory.

	# GPT-Researcher Cursor Rules

	## Project Overview
	This project, named GPT-Researcher, is an LLM-based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations. It is built using Next.js and TypeScript, integrating various libraries for their strengths.

	Your primary goal is to help with:
	- Next.js app router patterns
	- TypeScript type safety
	- Tailwind CSS best practices
	- Code quality standards
	- Python/FastAPI backend optimizations

	## Key URLs
	- Project Home Page: https://gptr.dev/
	- GitHub Repository: https://github.com/assafelovic/gpt-researcher
	- Documentation: https://docs.gptr.dev/

	## Project Structure
	- Frontend user interface built with Next.js, TypeScript, and Tailwind CSS in `/frontend`
	- Static FastAPI version for lightweight deployments
	- Next.js version for production use with enhanced features

	- Multi-agent research system using LangChain and LangGraph in `/backend/multi_agents`
	- Browser, Editor, Researcher, Reviewer, Revisor, Writer, and Publisher agents
	- Task configuration and agent coordination

	- Document processing using Unstructured and PyMuPDF in `/backend/document_processing`
	- PDF, DOCX, and web content parsing
	- Text extraction and preprocessing

	- Report generation using LangChain and Jinja2 templates in `/backend/report_generation`
	- Template-based report structuring
	- Dynamic content formatting

	- Multiple output formats in `/backend/output_formats`
	- PDF via md2pdf
	- Markdown via mistune
	- DOCX via python-docx
	- Format conversion utilities
	- Export functionality

	- GPT Researcher core functionality in `/gpt_researcher`
	- Web scraping and content aggregation
	- Research planning and execution
	- Source validation and tracking
	- Query processing and response generation

	- Testing infrastructure in `/tests`
	- Unit tests for individual components
	- Integration tests for agent interactions
	- End-to-end research workflow tests
	- Mock data and fixtures for testing

	## Language Model Configuration
	- Default model: gpt-4-turbo
	- Alternative models: gpt-3.5-turbo, claude-3-opus
	- Temperature settings for different tasks
	- Context window management
	- Token limit handling
	- Cost optimization strategies

	## Error Handling
	- Research failure recovery
	- API rate limiting
	- Network timeout handling
	- Invalid input management
	- Source validation errors
	- Report generation failures

	## Performance
	- Parallel processing strategies
	- Caching mechanisms
	- Memory management
	- Response streaming
	- Resource allocation
	- Query optimization

	## Development Workflow
	- Branch naming conventions
	- Commit message format
	- PR review process
	- Testing requirements
	- Documentation updates
	- Version control guidelines

	## API Documentation
	- REST endpoints
	- WebSocket events
	- Request/Response formats
	- Authentication methods
	- Rate limits
	- Error codes

	## Monitoring
	- Performance metrics
	- Error tracking
	- Usage statistics
	- Cost monitoring
	- Research quality metrics
	- User feedback tracking

	## Frontend Components
	- Static FastAPI version for lightweight deployments
	- Next.js version for production use with enhanced features

	## Backend Components
	- Multi-agent system architecture
	- Document processing pipeline
	- Report generation system
	- Output format handlers

	## Core Research Components
	- Web scraping and aggregation
	- Research planning and execution
	- Source validation
	- Query processing

	## Testing
	- Unit tests
	- Integration tests
	- End-to-end tests
	- Performance testing

	## Rule Violation Monitoring
	- Alert developer when changes conflict with project structure
	- Warn about deviations from coding standards
	- Flag unauthorized framework or library additions
	- Monitor for security and performance anti-patterns
	- Track API usage patterns that may violate guidelines
	- Report TypeScript strict mode violations
	- Identify accessibility compliance issues

	## Development Guidelines
	- Use TypeScript with strict mode enabled
	- Follow ESLint and Prettier configurations
	- Ensure components are responsive and accessible
	- Use Tailwind CSS for styling, following the project's design system
	- Minimize AI-generated comments, prefer self-documenting code
	- Follow React best practices and hooks guidelines
	- Validate all user inputs and API responses
	- Use existing components as reference implementations

	## Important Scripts
	- `npm run dev`: Start development server
	- `npm run build`: Build for production
	- `npm run test`: Run test suite
	- `python -m pytest`: Run Python tests
	- `python -m uvicorn backend.server.server:app --host=0.0.0.0 --port=8000`: Start FastAPI server
	- `python -m uvicorn backend.server.server:app --reload`: Start FastAPI server with auto-reload for development
	- `python main.py`: Run the main application directly
	- `docker-compose up`: Start all services
	- `docker-compose run gpt-researcher-tests`: Run test suite in container

	## AI Integration Guidelines
	- Prioritize type safety in all AI interactions
	- Follow LangChain and LangGraph best practices
	- Implement proper error handling for AI responses
	- Maintain context window limits
	- Handle rate limiting and API quotas
	- Validate AI outputs before processing
	- Log AI interactions for debugging

	## Lexicon
	- GPT Researcher: Autonomous research agent system
	- Multi-Agent System: Coordinated AI agents for research tasks
	- Research Pipeline: End-to-end research workflow
	- Agent Roles: Browser, Editor, Researcher, Reviewer, Revisor, Writer, Publisher
	- Source Validation: Verification of research sources
	- Report Generation: Process of creating final research output

	## Additional Resources
	- [Next.js Documentation](https://nextjs.org/docs)
	- [TypeScript Handbook](https://www.typescriptlang.org/docs/)
	- [Tailwind CSS Documentation](https://tailwindcss.com/docs)
	- [LangChain Documentation](https://python.langchain.com/docs/)
	- [FastAPI Documentation](https://fastapi.tiangolo.com/)
	- [Project Documentation](https://docs.gptr.dev/)

	_Note: End all your comments with a :-) symbol._