Protein Info for mRNA_8313 in Rhodosporidium toruloides IFO0880

Name: 16681
Annotation: K11541 URA2 carbamoyl-phosphate synthase / aspartate carbamoyltransferase

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2000 2253 transmembrane" amino acids 2115 to 2135 (21 residues), see Phobius details amino acids 2234 to 2252 (19 residues), see Phobius details PF00988: CPSase_sm_chain" amino acids 2 to 143 (142 residues), 152.2 bits, see alignment 3.9e-48 TIGR01368: carbamoyl-phosphate synthase, small subunit" amino acids 2 to 391 (390 residues), 431.1 bits, see alignment E=4.7e-133 PF00117: GATase" amino acids 211 to 387 (177 residues), 151.2 bits, see alignment E=1.8e-47 TIGR01369: carbamoyl-phosphate synthase, large subunit" amino acids 422 to 1486 (1065 residues), 1477.7 bits, see alignment E=0 PF02786: CPSase_L_D2" amino acids 557 to 760 (204 residues), 281.6 bits, see alignment E=2.2e-87 amino acids 1095 to 1292 (198 residues), 96.3 bits, see alignment E=1.2e-30 PF02222: ATP-grasp" amino acids 567 to 710 (144 residues), 27.2 bits, see alignment 1.6e-09 amino acids 1102 to 1264 (163 residues), 32 bits, see alignment 5.6e-11 PF07478: Dala_Dala_lig_C" amino acids 582 to 728 (147 residues), 27.4 bits, see alignment 1.5e-09 amino acids 1115 to 1262 (148 residues), 36.6 bits, see alignment 2.2e-12 PF02787: CPSase_L_D3" amino acids 846 to 967 (122 residues), 135.3 bits, see alignment 8.7e-43 PF02142: MGS" amino acids 1374 to 1474 (101 residues), 69.5 bits, see alignment (E = 1.4e-22) PF02729: OTCace_N" amino acids 1950 to 2091 (142 residues), 151.9 bits, see alignment 8.3e-48 TIGR00670: aspartate carbamoyltransferase" amino acids 1950 to 2248 (299 residues), 337.4 bits, see alignment E=1e-104 PF00185: OTCace" amino acids 2098 to 2247 (150 residues), 98.3 bits, see alignment 2.9e-31

Best Hits

KEGG orthology group: K11541, carbamoyl-phosphate synthase / aspartate carbamoyltransferase [EC: 2.1.3.2 6.3.5.5] (inferred from 68% identity to afm:AFUA_6G11310)

Predicted SEED Role

"Carbamoyl-phosphate synthase large chain (EC 6.3.5.5) / Carbamoyl-phosphate synthase small chain (EC 6.3.5.5) / Aspartate carbamoyltransferase (EC 2.1.3.2) / Dihydroorotase (EC 3.5.2.3)" in subsystem De Novo Pyrimidine Synthesis or Macromolecular synthesis operon (EC 2.1.3.2, EC 3.5.2.3, EC 6.3.5.5)

MetaCyc Pathways

KEGG Metabolic Maps

Isozymes

Compare fitness of predicted isozymes for: 3.5.2.3, 6.3.5.5

Use Curated BLAST to search for 2.1.3.2 or 3.5.2.3 or 6.3.5.5

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Search structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (2253 amino acids)

>mRNA_8313 K11541 URA2 carbamoyl-phosphate synthase / aspartate carbamoyltransferase (Rhodosporidium toruloides IFO0880)
MLLELADGSAYEGYSFGADKSIAGECVFTTGMVGYPESLTDPSYSGQILVLTYPLIGSYG
VPKRPAKGTVNPASLPHEFESYKIHVAGLIVSNYSGAGDDFSNHSAFSELADWLKEEGIP
AIYGVDTRDLTKKLRERGSTIGRLSQRKKGVEVEEGAFNWREAYEQLEYHDPNADNLVAA
VSTPQPVLYPADSGVSPLKHTSGRNLRVICVDVGMKYNQIRCFTKRGVELKVVPYDYDYL
SPNEEPFDGLFVSNGPGDPATLKSTTAILAKAIEAAKVPIFGICLGHQLLARASGASTLK
LKYGNRGANIPCTSLVTGRCYITSQNHGYAVDTQTLQQGWKELFINANDSSNEGIYSTER
PYFSVQFHPESAPGPRDTEFLFDTFISAIQKSLEKQASGEKLTGPVVFPGGDAAENAAKT
PRVDVRKVLVLGSGGLSIGQAGEFDYSGSQAIKALKEEGIYTILINRELERVVDARPPAA
NIATIQTSKGLADKVYFLPVTADSVRKVIQHERPDGIYCTFGGQTALQVGIKLKDEFEGL
GVKVLGTPIETIITTEDRELFAQAMEQIGEKCAKSAAANSVQECIDAANDIGYPVIVRAA
FALGGLGSGFANNEKELVELCTKAFATSSQVLVERSMKGWKEIEYEVVRDCRDNCITVCN
MENFDPLGIHTGDSIVVAPSQTLTDSDYNMLRTTAVNVIRHLGVVGECNIQYALNPHSQE
YCIIEVNARLSRSSALASKATGYPLAYTAAKLGLNIPLNEIKNSVTKNTSACFEPSLDYL
VVKIPRWDLKKFTRVSSSLSSSMKSVGEVMAIGKTFEETIQKAIRAIDPSFAGFGKNAFP
MEEVDEELQNPTDRRLFAIANALHAGRSVEEVHKLTNIDRWFLHKLEHIVQVDKTVAQYT
ASQIPRGLLHCAKRTGFSDAQIAKALGSNELAVRRLRLDYGITPRVKQIDTVAAEFPAFT
NYLYMTYNGAAHDVNFGDKGIMVLGSGVYRIGSSVEFDWCAVRAIRTIRERGFKTVMVNY
NPETVSTDYDEADRLYFEVISLETILDIYEAETSSGVVVSMGGQTSNNIALPLHRQGVKI
LGTSPENIDMAENRYKFSRMLDKLGVDQPLWKELSTMEAAHEFCNKVGYPVLVRPSYVLS
GAAMNVVYSGDDLDAYLKQAAAVSREYPVVVTKYVEEAKEIEMDAVARDGKLVMHYVSEH
VENAGVHSGDATLILPPQDLDPETVRRIEEATRKIGAALNVSGPFNIQFLAKNNEIKVIE
CNVRAARSFPFVSKVTGIDAIELATDIMLGLPVTPYPELNLPKDYVGIKVPQFSFGRLAG
ADPILGVEMASTGEVACFGRDKYEAYLKALLSTGMRLPKQNILFSIGPYKEKIELLPSIQ
KLHSLGYNIFATVGTADFLQEHGVPVKYLEALDEEDDRKAQKSEYSLREHIANNKIDLYV
NLPSKNRYRRPANYMSKGYRSRRMAVDFAVPLVTNVKCAKLLIQAIARNPSLEISNVDYK
TSSRTVVLPGLVNTRCFVPGVAAPGSKDFAAVTKAALAGGFTTVQVLPVGDKANVVDSPS
LDVARDNAASAACDYSLAIAATADNASHLTGELIAGARTLFLPSQNLSASNDKVSTVAEH
FSAWPLNKPIVTDARTTDLASVLLLASLHGRSVHVTNVQTENDISLISLSKDKDLKVTCD
VSIYSLFLTREEFPGSTCLPTAKDQAALWDHLDTIDVFAVGALPFRLATELGKPYSPEAG
QEEAMQLLLTAVHDGRLSLDDIISRLSENPRAIFDLPPSPTETYVEVEVDRTVVVPRRSY
WSPLEGKKLAGAVHRVVLRGETLLLDGQFVARGAPAGRDLSALAGASKTERRQARFSVSA
TRPSLASIGMPTRENVTSPTMAPVSSQALATKSPKLASARSPRLEATESALQPSLMSLAT
DPVASSLALVSPMASLRKLLSAASRAYNRRHVLSVRSFNRDDLHTLFGVASEMRMLTERG
VPIDIMRGRVLSTMFYEPSTRTSASFEAAMARLGGSTVAVTPETSSVVKGETLADTIRTL
GCYADVIALRHPAAGSAQEAARYSPVPVINAGDGIGEHPTQAFLDIYTIREELGTVNGLT
ITMVGDLKNGRTVHSLVKLLSLYSVSLIFVAPPTLSMPESVKAEAEKAGIPVFETTSLAS
VIGRTDVLYVTRVQQERFASQAEYDAVKDSYIVNNEILSGAKEHCIVMHPLPRNAELDPS
VDLDARRAAFWRQVRFGLFVRMALLALVAAGSA