Protein Info for mRNA_2533 in Rhodosporidium toruloides IFO0880

Name: 10901
Annotation: K10706 SETX, ALS4 senataxin

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2046 transmembrane" amino acids 449 to 474 (26 residues), see Phobius details amino acids 491 to 506 (16 residues), see Phobius details PF12726: SEN1_N" amino acids 128 to 948 (821 residues), 535 bits, see alignment E=9.8e-164 PF13086: AAA_11" amino acids 1400 to 1687 (288 residues), 230.8 bits, see alignment E=7.9e-72 PF13245: AAA_19" amino acids 1413 to 1683 (271 residues), 30.1 bits, see alignment E=1.7e-10 PF13087: AAA_12" amino acids 1695 to 1891 (197 residues), 209.3 bits, see alignment E=1.4e-65

Best Hits

Predicted SEED Role

No annotation

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Search structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (2046 amino acids)

>mRNA_2533 K10706 SETX, ALS4 senataxin (Rhodosporidium toruloides IFO0880)
MPAQPAASTSRPPTALSSPTFTPPPRLVSNLAALRDPTRVSDDLWTETTEASVLWLLSLK
PPGAASTSPHQPSSKGKERARVDEDGVAVDEDLVHWFCGAKGAQECWEPAIFCIRLLGMK
RIGEVANWRDSFERLMDSCPACVRAYQTAKIDFRDNYLRHYKPDKSIQTFTNGVEAIEFA
SVMRSFAAAGFDTPPSNTPAGDSSWRPSRRARLNDVPEAAVENVLSNPRLFESDNVIEML
LSGLDSHDFRPLALPPTPSAGLLAFRLHIDHDLSALAETQIRRCTQQPDADAFRRQGLAA
VMDSHLGALASRDRGEVNVPAGIHLHYTDQRNDFMGGVATCLWSLSADAIKQVLVRQPTS
LAASLDVVHLVAAHLADRGDHLVSVVDCFTVLLERLGTDFWSVGDEKYEEVVLHAILDND
DFHEAFEEQAHQQAATSEAQTFDTSWLAWLPPFLASVAQSPALFTNALALITSTFLDRLQ
RPRFDPLVRTAALRLAVGILSDVFISNTAPTPSSATADIVAVAPRYPHAAAATKVLDLHA
TTIASFAFSASYATAEWASAVELARTFVGSVLKRDGKTIARAVYSLATFSQNHHERERRE
KKRLQALANGDAKAKKEAEPAPIPPRNVAFAKAMWDQAYASVREGDAAGIAVLVQGAAPT
AQFEKLTSRSWAIKDLVRPQMKAINDALATARDPTVGVIMQLADERVDVLLDFLGRPGVV
QAVIALLISPVEAVHNAVQGLVKQAFDVTTRRDVFRCLIGRWPEQSLRGLAHTLQLFQTS
SRLLPEACGLAKRLVRCFSDVLDVLCDTTDGLLRDPDFVRRGRDVKLQAKLLALWKLMAE
ALGLLFNRTPEWANYFENDEMTEWMRDAVLFGADMLEQIRVLETIIAGQALDRFASGGSA
VLDSPAQKSALAKESSTAEQMIGVLAEPLEESIAWLRLNDVDLLNQTFALVLKMVGRFTR
SRIPLRETTHAKLKRMADRPAAGADQRRSTILRESQLLEIREALEDNEDAVRRRKAGAPV
EVLDLSDDETPKPASRASSIASSSTGTLRVKGQVKTSLVPPQPQASKTGRTASSLASSRA
RPTPAIPARPRGVPWTTYSSKKAESESESSDDEDAVRGADGKKLTGLALLAKDQKPSIKK
VTTGQRKIQLIGSDGRPTASASSRTRTPIGIGRSKEDLQALRAARLRSAQDLSRLHRAVL
RWDPAATEDAPPNVDLLKRLPESFKTPKEYFAAFEPLLLTECWEQIRQAKLEALKEGDVI
HADIAGRQSVDDFVDVFCTIQHGQLRSGTYFGDTDLVWLRQGPRQIFAKIQAVSRKREHI
ELTLRCHLGKDVHNAGSGLTARTKWEMVKLVNLSTVHREYAALQALEFIDLCGDVLAPRP
PPSIKTDPRTIEKTMAAYKVNEPQAIAIHGALRTQGFSLIQGPPGTGKTSTIVGLVGAFV
DSRPRVAAPIDVGRPTDTSQIPPVAKVLLCAPSNAAVDEVAKRLKEGVRLMDGSLYVPKV
VRIGADSAIDIAVKDVFIDELVERATSGNKSTTGTNDAQSRMQNMRNEIDSLRSDRDAKK
VEMDSITNNEFRRGELNLELRKIKARIFELSQMLDTEKDKAQQSRRTMDAEQRKMRLKIL
SEADVICSTLSGAGHDYMSQLPFDFETVIIDEAAQSIELSSLIPLKYGCTRCILVGDPLQ
LPPTVISGVAARGGYDRSLFVRVMQRGPQAVHLLSIQYRMHPNISAFPSAAFYQSRLTDG
PEMDKKTLQPWHANALFPPYTFYHIEGQEMQGRHHSYTNPVEAATALAIYERLRRDYPSS
DFDYRIGVVTPYKGQVIELKRTFRQKYGEEIVTKIAFNTVDGFQGQEKDIIILSCVRGGS
ADKGVGFLADTRRMNVALTRARSSIWILGDSNKLRANQYWGQLVADAETRGLFRRADVQL
FRSSYSAPPVVRTALPSVPVSRSRAGAIEMSPAYSSGQNGLVAPTPPHVVNGTGSALKKR
LTDAMDVDSPAKRPKMEDGEGEEQKPRVQVPRPPVQRPQVPPQQQQPLRRKAPPSLFVPK
KRPPPK