FDA Orange Book - Official listing of approved drug products with therapeutic equivalence evaluations
~ (tilde) delimiterThe Orange Book data includes:
patent.txt - Patent information for approved drugsproducts.txt - Approved drug productsexclusivity.txt - Market exclusivity informationdiscontinued.txt - Discontinued drug products (optional)~ (tilde character)patent.txt ColumnsAppl_Type~Appl_No~Product_No~Patent_No~Patent_Expire_Date_Text~Drug_Substance_Flag~Drug_Product_Flag~Patent_Use_Code~Delist_Flag~Submission_Date
Key columns:
Patent_No: USPTO patent number (7-digit format, e.g., 7625884) ✅ Matches USPTO formatAppl_Type: Application type (N = NDA, A = ANDA)Appl_No: Application number (part of composite key)Product_No: Product number (part of composite key)Patent_Expire_Date_Text: Patent expiration dateDrug_Substance_Flag: Y = chemical compound patent, blank = noDrug_Product_Flag: Y = formulation patent, blank = noPatent_Use_Code: Use code (e.g., U-141, U-986)products.txt ColumnsIngredient~DF;Route~Trade_Name~Applicant~Strength~Appl_Type~Appl_No~Product_No~TE_Code~Approval_Date~RLD~RS~Type~Applicant_Full_Name
Key columns:
Ingredient: Active ingredient name (e.g., BUDESONIDE, MINOCYCLINE HYDROCHLORIDE) ⭐ This maps to genesTrade_Name: Brand nameAppl_Type: Application type (composite key)Appl_No: Application number (composite key)Product_No: Product number (composite key)Approval_Date: FDA approval dateApplicant_Full_Name: Company nameexclusivity.txt ColumnsAppl_Type~Appl_No~Product_No~Exclusivity_Code~Exclusivity_Date
Key columns:
Appl_Type: Application type (composite key)Appl_No: Application number (composite key)Product_No: Product number (composite key)Exclusivity_Code: Type of exclusivity (e.g., RTO, D-193, I-975)Exclusivity_Date: Exclusivity expiration dateAll three files link via: Appl_Type + Appl_No + Product_No
Products (ingredients, trade names, approval dates)
↓ [Appl_Type + Appl_No + Product_No]
Patents (patent numbers, expiration dates)
↓ [Appl_Type + Appl_No + Product_No]
Exclusivity (exclusivity periods)
patent.txt:products.txt:Appl_Type + Appl_No + Product_NoIngredient column of products.txtIngredient → GeneApproval_Date? (e.g., “Apr 12, 2023” or “2023-04-12”)Ready to run exploration:
patent.txtproducts.txtexclusivity.txtcd /home/user/cguth7.github.io/data/orange_book/
python explore_orange_book.py
~ delimiterOnce exploration is complete, we can:
Ingredient → Gene mapping strategy