Project NETRA

███╗   ██╗███████╗████████╗██████╗  █████╗
████╗  ██║██╔════╝╚══██╔══╝██╔══██╗██╔══██╗
██╔██╗ ██║█████╗     ██║   ██████╔╝███████║
██║╚██╗██║██╔══╝     ██║   ██╔══██╗██╔══██║
██║ ╚████║███████╗   ██║   ██║  ██║██║  ██║
╚═╝  ╚═══╝╚══════╝   ╚═╝   ╚═╝  ╚═╝╚═╝  ╚═╝

An AI‑powered financial intelligence platform for detecting and investigating suspicious activity across accounts, persons, and companies.

Demo

Watch the demo on YouTube: https://youtu.be/r_G-eIlJKkU

Overview

Project NETRA provides a unified workflow for ingesting datasets (CSV/ZIP), calculating hybrid risk scores, inspecting networks, and generating AI‑assisted PDF reports. It ships with synthetic datasets and lets investigators upload data from the UI.

Highlights:

Hybrid risk scoring (rules) with a CSV pipeline; AlertScores.csv is the canonical output.
Upload your own CSVs or a ZIP of CSVs from Settings; schema validation and re‑analysis run automatically.
Graph view uses Neo4j when available; otherwise, a small network is synthesized from CSVs.
PDF report generation; the endpoint accepts either a person_id or a case_id.
Optional AI summary via Google Gemini with a deterministic local fallback.
Authentication is token‑based; mock tokens are supported for local/demo.

Architecture

flowchart LR
  %% Client Layer
  subgraph Client
    U[User]
    FE[Frontend - React + Vite]
  end

  %% Backend API and Services
  subgraph Backend
    API[REST API]
    RS[risk_scoring.py]
    RG[report_generator.py]
    GA[graph_analysis.py]
    CM[case_manager.py]
    AS[ai_summarizer.py]
  end

  %% Data Sources
  subgraph Data Sources
    CSV[CSV files<br/>backend/generated-data]
    NEO[Neo4j optional]
    FS[Firestore optional]
  end

  %% Client -> API
  U -->|actions| FE
  FE -->|Fetch JSON or PDF| API
  FE -->|Upload ZIP or CSV| API

  %% API -> Services
  API --> RS
  API --> RG
  API --> GA
  API --> CM
  API --> AS

  %% Services <-> Data
  RS --> CSV
  GA --> NEO
  CM --> FS
  RS -->|AlertScores.csv| CSV

  %% Responses
  RG -->|PDF| FE
  API -->|JSON| FE

Connection Flow

sequenceDiagram
    participant User
    participant FE as Frontend (React)
    participant API as Flask API (/api)
    participant Risk as Risk Scoring
    participant Graph as Graph Analysis
    participant Report as Report Generator
    participant Store as Firestore (optional)
    participant Neo4j as Neo4j (optional)
    participant CSV as CSV Data

    User->>FE: Open app / Login
    FE->>API: GET /alerts (Bearer token)
    API->>Risk: Load scores
    Risk->>CSV: Read AlertScores.csv
    API-->>FE: Alerts JSON

    User->>FE: Upload dataset (CSV/ZIP)
    FE->>API: POST /datasets/upload
    API->>CSV: Replace files
    API->>Risk: Re-run analysis
    API-->>FE: Upload OK

    User->>FE: Create case
    FE->>API: POST /cases
    API->>Store: Create/Update case (if configured)
    API-->>FE: Case created (caseId)

    User->>FE: Investigate person/case
    FE->>API: GET /graph/:personId
    API->>Graph: Build graph
    Graph->>Neo4j: Query (if available)
    Graph->>CSV: Synthesize fallback
    API-->>FE: Graph JSON

    User->>FE: Generate report
    FE->>API: GET /report/:id
    API->>Report: Compile PDF
    Report->>Risk: Pull scores
    Report->>CSV: Fetch details
    API-->>FE: PDF (blob)

    FE-->>User: Render views / Download report

Detailed Application Flow

flowchart TD
    Start([User opens app]) --> AuthCheck[Auth check via AuthProvider]
    AuthCheck -->|Authenticated| GoDashboard[Route to /dashboard]
    AuthCheck -->|No auth| GoLogin[Route to /login]

    subgraph Dashboard
      GoDashboard --> FetchAlerts[GET /alerts]
      FetchAlerts --> ShowAlerts[Render alerts & metrics]
      ShowAlerts --> ActionTriage[Open Triage]
      ShowAlerts --> ActionInvestigate[Open Investigation]
      ShowAlerts --> ActionReporting[Open Reporting]
    end

    subgraph Triage
      ActionTriage --> CreateCase[POST /cases]
      CreateCase --> CaseCreated[(caseId)]
      CaseCreated --> NavWorkspace[Go to /workspace/:caseId]
    end

    subgraph Investigation_Workspace
      ActionInvestigate --> LoadGraph[GET /graph/:personId]
      NavWorkspace --> LoadGraph
      LoadGraph --> ViewGraph[React Flow graph + details]
      ViewGraph --> UpdateNotes[PUT /cases/:id/notes]
  UpdateNotes --> NotesSaved[Notes saved - Firestore or local]
    end

    subgraph Reporting
      ActionReporting --> GetReport[GET /report/:id]
      GetReport --> PDF[PDF blob]
      PDF --> Download[Trigger download]
    end

    subgraph Settings_and_Datasets
      Settings[Open Settings] --> Upload[POST /datasets/upload - CSV or ZIP]
      Upload --> Reanalyze[Run analysis]
      Reanalyze --> AlertsUpdated[Updated AlertScores.csv]
      AlertsUpdated --> FetchAlerts
    end

  GoLogin --> LoginFlow[Login - Firebase or mock token]
    LoginFlow --> AuthCheck

Backend (Flask):

Data loader with schema validation (CSVs under backend/generated-data/).
Risk scoring (services/risk_scoring.py), AI summarizer, report generator, optional graph analyzer.
Case management with Firebase Firestore if configured; otherwise, a local JSON fallback.

Frontend (React + Vite):

Centralized API client with envelope unwrapping and a token provider.
Pages: Dashboard, Triage, Investigation Workspace, Reporting, and Settings.
Settings provides dataset uploads (CSV/ZIP) and a metadata view.

Data generation:

data-generation/generate_data.py produces CSVs into backend/generated-data/.
Optional Neo4j loading via scripts in backend/data-generation/.

Quick Start (Local)

Prerequisites: Python 3.10+, Node 18+.

Backend:

cd backend
python -m venv venv (Windows: venv\Scripts\activate, macOS/Linux: source venv/bin/activate)
pip install -r requirements.txt
Set environment variables (optional but recommended):

GEMINI_API_KEY for AI summaries (otherwise a rule‑based fallback is used)
FRONTEND_URL for CORS (e.g., http://localhost:5173)
FIREBASE_CREDENTIALS or GOOGLE_APPLICATION_CREDENTIALS if using Firestore

Run: python app.py (serves at http://localhost:5001)

Frontend:

cd frontend
npm install
Optional: set VITE_API_URL to your backend API base (e.g., http://localhost:5001/api). If unset, it auto‑detects localhost and uses http://localhost:5001/api.
npm run dev (http://localhost:5173)

Authentication (local/mock):

The backend accepts Authorization: Bearer mock-jwt-token-12345.
The frontend includes a mock token provider in development to call protected APIs.

Dataset Uploads (CSV/ZIP)

Upload via the UI: Settings → Data Management.

ZIP upload: include any of these exact filenames (case‑insensitive):
- Persons.csv, BankAccounts.csv, Transactions.csv, Companies.csv, Directorships.csv, Properties.csv, PoliceCases.csv
Single CSV upload: choose which dataset it represents (dropdown in the UI).
After upload, the server reloads all CSVs and re‑runs analysis to regenerate alerts.

Schemas (minimal required columns):

persons: person_id, full_name, dob, pan_number, address, monthly_salary_inr, tax_filing_status
accounts: account_number, owner_id, bank_name, account_type, open_date, balance_inr
transactions: transaction_id, from_account, to_account, amount_inr, timestamp, payment_mode, remarks
companies: cin, company_name, registered_address, incorporation_date, company_status, paid_up_capital_inr
directorships: directorship_id, person_id, cin, appointment_date
properties: property_id, owner_person_id, property_address, purchase_date, purchase_value_inr
cases: case_id, person_id, case_details, case_date, status

Sample data that triggers alerts:

See samples/ at the repo root (ready‑named CSVs) or frontend/public/samples/ for individual examples and a README.

Key Endpoints (Backend)

Base path: /api.

GET /alerts → list alerts (reads AlertScores.csv).
GET /persons?q=<query> → search persons.
GET /investigate/<person_id> → risk breakdown for a person.
GET /graph/<person_id> → network; synthesizes from CSVs if Neo4j is empty or unavailable.
GET /report/<person_or_case_id> → PDF report; accepts person_id or case_id.
POST /cases → create a case; body must include person_id or embed it in risk_profile.person_details.
GET /cases → list cases (Firestore or local fallback).
POST /datasets/upload → upload CSV/ZIP; validates, reloads, and reruns analysis.
GET /datasets/metadata → seed/snapshot/counts (if metadata.json is present; counts are derived regardless).
POST /run-analysis (or ?sync=1) and GET /run-analysis/status → control risk analysis.
Settings: /settings/profile, /settings/api-key, /settings/theme, /settings/regenerate-data, /settings/clear-cases.

Auth:

All protected routes use Authorization: Bearer <token>.
Mock token accepted: mock-jwt-token-12345.

Reporting

The Reporting page downloads a PDF via /api/report/<id>. You can pass a caseId or personId.
The PDF score is harmonized with AlertScores.csv to match the dashboard.

Data Generation

data-generation/generate_data.py writes CSVs to backend/generated-data/.
From the backend Settings page, you can trigger regeneration.
If using Neo4j, see backend/data-generation/load_to_neo4j.py for loading.

Project Structure

project-NETRA/
├── backend/
│   ├── app.py                 # Flask API (CORS, endpoints, uploads, reports)
│   ├── services/              # risk_scoring, report_generator, graph_analysis, case_manager, ai_summarizer
│   ├── utils/                 # data_loader (schemas), auth (mock/real)
│   └── generated-data/        # CSVs + AlertScores.csv (+ metadata.json if present)
├── frontend/
│   ├── src/pages/             # Dashboard, Triage, Investigation, Reporting, Settings
│   ├── src/services/api.js    # API base resolver + token provider + endpoints
│   └── public/samples/        # Downloadable sample CSVs
├── data-generation/           # generate_data.py, patterns.py (synthetic data)
├── samples/                   # Ready‑named CSVs to ZIP & upload (alerts guaranteed)
└── README.md

Configuration

Backend environment:

FRONTEND_URL (for CORS), e.g., http://localhost:5173
GEMINI_API_KEY (optional): for AI summaries
FIREBASE_CREDENTIALS (JSON or base64 JSON) or GOOGLE_APPLICATION_CREDENTIALS (path) for Firestore
RISK_ALERT_THRESHOLD (optional, default 10)

Frontend environment:

VITE_API_URL (optional): override API base; otherwise auto‑detects localhost http://localhost:5001/api or same‑origin /api in production
VITE_USE_MOCK_AUTH (optional): use mock auth in local development

Notes

Graph view gracefully falls back to synthesized edges if Neo4j is not available.
If reports fail with “Person ID not found,” ensure the case points to a person present in the current CSVs; the report endpoint also resolves from caseId.

Contributing

Fork the repository.
Create a feature branch (git checkout -b feature/your-change).
Commit and push.
Open a pull request.

Group Members

Anurag Waskle	Soham S. Malvankar	Harshit Kushwaha	Aryan Pandey	Deepti Singh

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
backend		backend
docs		docs
frontend		frontend
samples		samples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
extra_info.md		extra_info.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project NETRA

Demo

Overview

Architecture

Connection Flow

Detailed Application Flow

Quick Start (Local)

Dataset Uploads (CSV/ZIP)

Key Endpoints (Backend)

Reporting

Data Generation

Project Structure

Configuration

Notes

Contributing

Group Members

License

About

Uh oh!

Languages

License

harshitt13/NETRA

Folders and files

Latest commit

History

Repository files navigation

Project NETRA

Demo

Overview

Architecture

Connection Flow

Detailed Application Flow

Quick Start (Local)

Dataset Uploads (CSV/ZIP)

Key Endpoints (Backend)

Reporting

Data Generation

Project Structure

Configuration

Notes

Contributing

Group Members

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages