Protein Info for mRNA_8409 in Rhodosporidium toruloides IFO0880

Name: 16777
Annotation: K09667 OGT protein O-GlcNAc transferase

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2015 PF13374: TPR_10" amino acids 798 to 826 (29 residues), 20.4 bits, see alignment (E = 2.1e-07) PF13432: TPR_16" amino acids 1193 to 1239 (47 residues), 19.4 bits, see alignment (E = 7.5e-07) PF13181: TPR_8" amino acids 1209 to 1240 (32 residues), 19.2 bits, see alignment (E = 6e-07) amino acids 1244 to 1274 (31 residues), 17.6 bits, see alignment (E = 1.9e-06) PF13844: Glyco_transf_41" amino acids 1521 to 1652 (132 residues), 160.9 bits, see alignment 2.1e-50 amino acids 1752 to 1906 (155 residues), 170.6 bits, see alignment 2.5e-53

Best Hits

Predicted SEED Role

"TPR domain protein, putative component of TonB system" in subsystem Ton and Tol transport systems

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Search structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (2015 amino acids)

>mRNA_8409 K09667 OGT protein O-GlcNAc transferase (Rhodosporidium toruloides IFO0880)
MSGPPPPRPSGTPTGSTSLPSGTNAPPPSAVPSVQLPLPTSSDSGSVGSPAKKRKSSVSG
AMPILAPSEPVAREAGDRADAGEQEELSRRAPRTQANGATSSSSSTIPPAQLPISEPKNA
TLSTRLSPPPLAPLSLPTFPVLPPSIPSPSQTFQTPSQVSSILSLPPNLLLQLARTDSPL
EQQAIAGKKGVKGYLSSMAIPPPSPTTLVTTLVKEGGGGGLIGLPVVREASEREEGAAVP
ISSAGGLSARTSTPSPLPAYSLPPDLSLTSSSATTSSALVTTSPSSSSSSSAAGAGHPAI
PVLLGQIFAPPPLTPGVLAPIPGTSGIGEILRGQGHGHGPASVREFASVEDWSYRTAKDQ
VFFSLSSPSTSAAARPALTTSPSFTEREISSLASQIKDWALSEQEKLISHQVVESKSAAV
VLAAAHATTSAKEEQGEDLSAYAEVYKKQLDALASGYFAQLHRDALKRLTEEKVGSQSVS
RRGSETDQPVEAALPSTSSGHTDALISTVPVLSSSQAPTLTSLTSLTSSRSQAEAMHVRA
SAAAAVAAQVMAATSLARQAGEKLADLGVDPRKVVEEVEELVGTREGRRGSAGMEELGKI
LERAGSRRGSRQEEREEKGKGKEAVQPERRMSEAGKSDAMSIEEVGQQPEAASTSADAVA
STSAQVADARPASAADLASLPTAQRPSRTSPSSSRSASVAQAAPPAPVFVPPSQPSDLVA
LASPAKRDYLLAYAHQLYSTNPASGELLPLLHTLENVHPDHLPTLLLISCVYYTRGELES
SLYYNKRLLEHDPSYVEAMSNIGTTLRAMGKWNEAEAWWWKAIKLRPTYWDATENLLGVL
CNPTTAASPEAVAPAGVAPPPIEPRYQEALALCDYVESQIFAHAASLAHPALPPTAAYFD
PLPRLLRPRTLPSVIPQNHVHRLQNLLYAKGNLRLTTQDPALAQDEYEKAIEFALSLPEW
ARRLPGLHWPVEGCTTRDLVVAATVVGKILAAFAEAGSNPVSAQKVSQMAQQLGVADERG
SVPFERLFRTIKDGGDAYVQRLLAMGGGVLPTVLLEPQMLVQLPGMLFSEMRGTLPSMLD
PAIVSGEAHEARDPTRQQTVQSTNQTTSTMLLTLAKGLQDSLGPTSASRSTIGGIPASQS
LLLPLYYVALALYPSPSTCNNLGILLSTLNATTVVAGADPSKPPVVVTGQMLALRYYEAG
LKLDPKHPHLYTNYGSLLKDLGKLPEAVAMYKRAVEFNPNFDVALANLANAVKDTGQIQE
SIPYYRRAVELNPSFPEAICGLVNALGGVCDWQNRGGVDEEWLVDDKLQLYRVKKPTNGK
RIQEGYHGAIADLVRKQLHDGYSYGVGSLAACGTVQQWLSVISQALYGVEPAQAGDLMKP
WVARLTWLFGKYDRGASLVNEGGYLIRLVERLMRRIQRRWYLATFGPQVFANPAAPAPQP
IFPKTSDIAHYRRPMLPPSLPAIPVPTVLPFHCFTLPVTARETRLISHRTGLRISHATLN
QPWMPPIVYPPPRPPIDGKINVGYVSSDLGNHPLSHLMQSVFGLHDLSRFNVFVYATSPS
DESPYRQKIEAESQHFLDVSHLGTQQIVERIVHDQIHILINLSGYTKGARNEVFAARPSP
VQMSYMGFASTLSAGWCDYFIVDPIVCPPHLVSGNQWRYNNGYTLERPPNPQPLLRSTDL
EGDIDPECDSNRFVYTEKLIYLPHSYFVTDHKQAWREEETAAPVTTSLAPSEETTWALEE
AKRLQMRRHMFPNLRDDTVIFANWNQLYKIDPFIFRIWLEILKKHPNSVLWLLRFPAPGE
AHLKETALRWAGKEVADRVIFTDVANKNDHIHRGRIADLFLDTTECNAHTTAADILWSGT
PILTFPRHAHKMCSRVAASIAMATGFGPQMIVNNEHEYERRALELAAGLVYDTIPADPSK
PATTLEGREQRRSKGELAELRKKLFLTREQSPLFDTKRWVRNLEKGLVEAWTRWVQGSEF
EDAPEWLAGPGRASAAIFIPDDVDDTNLESRKPYF