H-InvDB_9.0 released on May 27, 2015.
Search by
Keyword
H-Inv ID (HIT)
H-Inv cluster ID (HIX)
H-Inv protein ID (HIP)
H-Inv gene family/group (HIF)
Accession number
Chromosome number
Chromosome band
Definition*
Data source ID
---
CCDS ID
dbSNP ID (rs number)
EC number
Ensembl ID
EntrezGene ID
FR ID
FR Accession number
GO ID
GO name*
HGNC gene symbol
HGNC gene name*
InterPro ID
InterPro name*
OMIM ID
OMIM title*
Pathway ID
Pathway name*
RefSeq (gene) ID
RefSeq (protein) ID
SCOP ID
UniProt
for
Advanced Search
Home
Quick guide
Navi
BLAST
Site map
Download
Contact us
Help
H-Invitational ID:
HIT000046831
Accession number:
AK126958
Created date:
26-Mar-2013
Last modified:
27-May-2015
Definition:
Conserved hypothetical protein.
Select format
Flat file
XML file
Nucleotide sequence fasta
Protein sequence fasta
Transcript original information
Accession number
AK126958.1
CAGE tag ID
NA
EST ID
NA
Clone Number
BRAWH3013049
Experimental resources
NBRC
;
HGPD
;
Sequence data provider
Project:FLJ; Provider:FLJ/HRI;
Annotation project
H-Invitational FLcDNA
Length of cDNA
3771[bp] (No. of exon:4)[A:1037 T:1188 G:747 C:799]
Devision
HUM
Molecular type
mRNA
Library origin
Cell type
NA
Tissue type
brain
Develpmental stage
NA
Sequence quality information
CDS feature
Complete CDS
Kozak sequence
NA
PolyA
NA
Vector/adapter sequence
NA
Frame shift
NA
Remaining intron
NA
Splice site acceptor (NAGNAG)
NA
Transcript quality feature
NMD predicted;
Notes
NA
ATTGTGAGAAGTTTTTATGCTGCTCTGGGTCATGGCTTGGAGACTGCCTC CTCAGAGCTTAAATGGGCAGATTTCATATTCTTCCCATCCTTGGGAATAC TTCATAAAACAGAATTACAAGAGTCCATGTTTTTTTCGGAGCACCGTGCA GTGGGAGAGCCTGTCTTGCCTTTTTTTTTTTTTTTTTTTTTTGCTGTCGT TTTATACAGGTATTTTTTTTTCTTTCTATTTCTTTTTTCTTTTACTTTTT ATTTCCTCTTTAAAGAAAATGATTGGCAGCACTCAACCTCAAGGAACTGA TCTATCAAAACCAAGCTGGGATAAGTATTTCTTTGAGAAATAATATATAT TTACCAACAGGCTCTATTCTGCCCCCTTGTTTCACAGCACCTTGAATATC ACTTCCTTTTCCTGCCCAAAGCAGGAGGTAACCCTGTATTAAAAGCATAG TAGGTGTGTGTGTGTACAATACACACATACAGCACACATGTCTCTATTTA GAGATTCCATGATATGTGTTCTATATACACTTTACAGTCCCTTTTCTTAA TGTCAAAAATATAATTTCCAGCGTCTAAAGAGTGTTTTCAAAGACTTTTG CCCTATTTTTTAAAATAGTGTCTAAATATGATTAAGTGTCTTCCCAGAGA AAAGTCAAAGAGGCTCCTAGTGTTAATTTCCATATTGCTTAAGACTTAAG CTTTTAATTTATTTTATCAAGGTTGAGGGAGTATAGTAATTGTTGGAACA ACGCCCTTCCAAAAGAAATCGCCTGCACTTGTTTTTAAGTTCAATTTGTT TTCTCACACAGATTTTTGATACACTCTTAAATAAACTAGAAATCACAATT ATTTTAATGTGGCAGAGTTGTAACCAGGAAATTGCATATATTTTCATAAA CTAGGCTGTTATAGATTTATTAATATTTATTAATAACATTATTTTAAGAA ATTTTTTTAACGTCTTTCAACCCTGAAAGGGCCTGCTTGAAACATACACG GCAGCAAAATCACTGGAGTCCAGGGTTTTTGTCACACACACACACAGCAC AGATTTTTTTGATTATGTAGATTTCTTTTCCTACTGCAGTTTCATATGCA AAGCATGCATCAGCCACATCCATGCTATCTCCTAAAGCAGTCATTCTGAA ACCCAATCTGGGGACAGTGCAGACTAAATAAAATTTCATTTTGATTTGGC CTCCCGCTGCGAATGGGACTGCTTTGTCTCAGCCGACAGAACCAGCAAGC CAGAAAGGGAATGAAGACATTTGGAGAAGAAGCGTTTACTGAAAGGAGAT TCCTGGGCTGGAGGAAAACCATGATCTACAGTTCATGTTGACAGATATTG TATTTTTGGTCCTAGATCTGACTTTTGAAATGACTTATCACAAATTCAAA TTTAGCAATCATGAGTGAAGTACCCACGATGTGCTAAGCATCTACCTCTG CAGATAGAAGAACACTTACTGTTCCTGAGCATGGGAGGGAATAGAAGTTC CTGGGTCCTGGCCTCACCTCATCAGGCACTGCCAAATTTGACACCTCCTT CCTGTTCACGGCAGTCCTTTGCGATAGGGTATTAGATGCCACCAAGTAGC TGCATAACTCACCCATTTGGTCAGAGCCTGGGTCTAAGCCAGGTCTGACA GACTCCAAAGCAAGCCCTGTTTACAGCATGACTGGGATCCAGTGGGTAAG ACCAACCTGTCAATGCCCAAGTGCAGATGCAATAATAGTGCTTCAACTGA TTGTTGCAAACCCCTCATGAGGGATGCCAAATCCCAGTGCAGATGGTAAC ACAGGCTATTTTTCACCTCCAACCTCTAATATCCTTTCTTTCTTTCTTTC CTTTTTTTTTTTTTTTTTTTGAGATAGAGTCTCCTCTGTCACCCAGGCTG GAGTGCAACAGCACGATCCCAGCTCACTGCAACCTCCACCTCCTGAGTTC AAGCGATTCTCTCGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACCC GCCATCATGTCCAGCTAATTTTTGTATATTTGTAGAGACGGGGTTTCACC ATGTTGGCCAGACTGGTCTTGAACTCCTGACCTCAGGTGATCTGCCCGCC TCAGCCTCCCAAAGTGCTAGGATTACAAGTGTGAGCCACCACGCCCGGCC CCAATATTCTTTCTTACTAGACACTGTGTAAGCTAATTATTCAGCAAGTA TTTTGCACTGAACTCTAAATAGGTAAGTATTTTGTGTCCATTTTACAGAC AAGAGAAATAAAATAGGTAAAGTATTTCCCCAAAGCTATGATAAATGAAT CCCATAGCTAGAAATGGCAGAAAACCCATGGAAACTTCTATGGGGAGTGG ATGTCTGCCTTACCTTGGCACAATGCTTGGTTCTCAGATATCATTTCAGT GACATTGCCAAATGGGAGAAACCATGTCTGGTACACGGGACTCAAGCTGC CTCAAAATATCCTGAATGCATTTTTTCAGGGGAGAAAATCCATTTGAATA AGAATAGCTAGTTATAGCTTATTATGTGCCAGACACTATTCAAAGTGTTT AATGTGTATTAACTTCTCAAAACCACTCTATAAGTTAGGTACTATGATTT CCATCTCCATTTTACAGACGAGGGAACTGAGCTGTCAAAACTAAGTGACT TGATTGAGGCCATGCAGCTGGCGAGGGGAGGACCAGGATCTGAACCCAAC TGGGTCTCCCAAGCCTGGGCCACTGAACCATTTCACCTCTTGGAAAAACT TAATGGTCTGAGTGGCCCACCCAGTCCATGGACAGCATGGAGTGGTATGG TGGAAGAGGAGGGAAGAAAACCTGTGTATTGTAAGTCATATTCTTCTATG GTGTGAAGGTTAACGTAAGTTTTAAGAAAGTTTTTTTTTTAATTTTATTA TATGATCTTGCTCTAAAGGGATGTAAATTCAGATGCTGAATTGCACACAT CCTTTCTCTTTCATATTTTCAATATTGAACGTAATTTCAATATTTAACAT AAAACAATGAGTACATTGTCTCCACCTCATTGTTTTTGGGTGGTGTTTGT GAGGAGCCTGCGTTCTTTGGAAGGAAGACCTTTCAGATGACCTGACCAGT CCTCCTCTTCAGAACCCGGATGCTGGCGAAGCAGTTTGAATTTTATGCTT GTGAAAGGGCTCCGTTGATGATTTTGATGTCTGCAGCTTTCCACCCGTAT GACCAGACACATTCTCACCAGCTCCATATATCAAGATAAGGAGGGGAAAC CTGGTAGCTTTTCCTTCTGTTAACTGCTGGCATCAGCTGAGTGATGCAGA CATTTCTCTATTAAGAACTGAGCTGAGACTGAAGCTTCATTTTGTATGAG ACTGTGCAGAGGTCGTCTAAAGTCTCTCCCAGGTGTGGTTATTAAGATCC TGGATTTGAAACTGTGACCTACTGGTTTGCCAGATGCCAAGAACAAATGC TCTGAAATTGATTTGCCAAAAGACATGATGAGCTGCTCTAACTTGCCTGG GAAGAGTGGAATATTTAACCTGTGGGTGAGACTCCCTCTTGCTACCTATC AGCTTTGCCACTGCTCTGATAGAGAAACATCTTGGGAGCAGAGTTGGTAA GAGTGAATCAGACGTATCTGCCGGAATAGGACTCGTGGCACCTGCTTGTT CGATCCCTCATTTCCACCCCCTCTATCCTTTGCCTATTGTCAGTCATTGT GGTTGGTCCATTCAGAAGAATCGTGAATATTCATAGCCACCCTAATTTAC CAATATATATTCAATAATGTC
Gene structure information
H-Inv cluster ID
HIX0002336
Genomic location
Chromosome
2
Location
2q12.1
Position
105049725- 105137290
Strand
+
Possible duplicated location(s)
NA
Gene structure
4 exon(s)
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:150568
;
KEGG GENES
KEGG GENES(150568)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links
H-DBAS
;
G-integra
;
cDNA-genome alignment
;
Predicted CDS information
HIP ID
HIP000089044
Predicted CDS
2662..2856; 64[aa]; Orientation:+1;
Codon Adaptation Index (CAI).
0.775
MQLARGGPGSEPNWVSQAWATEPFHLLEKLNGLSGPPSPWTAWSGMVEEE GRKPVYCKSYSSMV*
Gene function information
H-Inv ID
HIT000046831
H-Inv cluster ID
HIX0002336
Accession number
AK126958.1
CAGE tag ID
NA
EST ID
NA
Transcript feature
Representative transcript;
Splicing isoform
Coding potential
Protein coding;
Definition
Conserved hypothetical protein.
Similarity category
Category: Conserved hypothetical protein(Category IV).
Conserved hypothetical protein.
Experimental evidence
NA
PubMed ID
NA
Gene family/group
H-Inv gene family/group ID
NA
Gene family/group name
NA
Evidence motif (InterPro) ID
NA
Gene symbol/name
HGNC symbol
NA
HGNC aliases
NA
HGNC name
NA
DDBJ
NA
UniProt
NA
EC number
NA
GGDB
(GlycoGene Database)
Gene symbol
NA
Familly
NA
Designation
NA
Expression
NA
KEGG metabolic pathway
NA
Protein-protein interaction (PPI)
H-Inv protein ID
HIP000089044
No. of interaction
NA
Interaction partner(s)
NA
BIND
NA
DIP
NA
MINT
NA
HPRD
NA
IntAct
NA
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:150568
;
KEGG GENES
KEGG GENES(150568)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
etc
Human-Gene diversity Of Life-style related Diseases
;
Curation status
Human curated
Notes
NA
Related H-InvDB links
Gene family;
Similarity Search Tool
;
TACT
;
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome.
NA
Subcellular localization information
Last modified:27-May-2015
WoLF PSORT
extracellular; nuclear; cytosol;
Target P
Other
SOSUI
soluble protein
TMHMM
soluble protein
PTS1
Not targeted
Related H-InvDB links
LIFEdb;
JRE-1.4.0 or later is required.
Download JRE at
Sun's web site.
Gene expression information
Last modified:27-May-2015
Tissue-specific expression
NA
Probe
information
AceGene
NA
Affymetrix
GeneChip
HG-Focus
NA
HG-U133
NA
HG-U133A
NA
HG-U133A_2
NA
HG-U133B
NA
HG-U133_Plus_2
NA
HG-U95
NA
HG-U95A
NA
HG-U95B
NA
HG-U95C
NA
HG-U95D
NA
HG-U95E
NA
HG-U95Av2
NA
HuEx-1_0
2497654; 2497659; 2497660; 2497661; 2497670; 2497671; 2497672; 2497673; 2568326; 2568352; 2568353; 2568356; 2568358; 3717314;
HuGeneFL
NA
Agilent
Human 1A Oligo Microarray:PGID215
NA
Whole Human Genome Oligo Microarray:PGID247
NA
Related H-InvDB links
H-ANGEL
;
DNAProbeLocator
;
Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information
Single Nucleotide Polymorphism (SNP) and indel
Location
Variation
dbSNP ID
Strand
CDS/UTR
Translation
145 .. 145
C/A
rs62149680
+
5'UTR
167 .. 167
T/G
rs6750708
+
5'UTR
197 .. 197
T/C
rs12162407
+
5'UTR
246 .. 246
T/C
rs186030293
+
5'UTR
279 .. 279
G/A
rs149690376
+
5'UTR
297 .. 297
C/G
rs188882931
+
5'UTR
505 .. 505
T/C
rs1519633
+
5'UTR
517 .. 517
T/C
rs1519634
+
5'UTR
519 .. 519
T/G
rs181974418
+
5'UTR
536 .. 536
A/T
rs186358224
+
5'UTR
573 .. 573
G/A
rs192145711
+
5'UTR
637 .. 637
T/G
rs79047296
+
5'UTR
741 .. 741
T/G
rs149449057
+
5'UTR
794 .. 794
A/C
rs143039420
+
5'UTR
809 .. 809
C/G
rs55864239
+
5'UTR
915 .. 915
A/C
rs148208172
+
5'UTR
1045 .. 1045
C/G
rs78607502
+
5'UTR
1163 .. 1163
G/A
rs182606449
+
5'UTR
1306 .. 1306
G/T
rs59518628
+
5'UTR
1411 .. 1411
A/G
rs145710315
+
5'UTR
1471 .. 1471
G/C
rs186068278
+
5'UTR
1481 .. 1481
A/G
rs77291749
+
5'UTR
1488 .. 1488
G/A
rs147239397
+
5'UTR
1559 .. 1559
C/T
rs140679040
+
5'UTR
1602 .. 1602
G/T
rs112474527
+
5'UTR
1696 .. 1696
G/A
rs7583889
+
5'UTR
1863 .. 1863
T/G
rs77102204
+
5'UTR
1954 .. 1954
C/T
rs149717439
+
5'UTR
1956 .. 1956
A/T
rs13402409
+
5'UTR
2097 .. 2097
C/T
rs113625240
+
5'UTR
2098 .. 2098
G/A
rs191128853
+
5'UTR
2107 .. 2107
T/A
rs180747139
+
5'UTR
2137 .. 2137
C/A
rs148897680
+
5'UTR
2142 .. 2142
C/T
rs61213833
+
5'UTR
2143 ^ 2144
-/G
rs34675891
+
5'UTR
2436 .. 2436
C/T
rs145599498
+
5'UTR
2525 .. 2525
T/G
rs186606380
+
5'UTR
2673 .. 2673
G/A
rs190064711
+
CDS
Synonymous[Ala4Ala]
2678 .. 2678
G/A
rs138138986
+
CDS
Nonsynonymous[Gly6Glu]
2755 .. 2755
G/A
rs76581053
+
CDS
Nonsynonymous[Gly32Ser]
2759 .. 2759
T/A
rs3762500
+
CDS
Nonsynonymous[Leu33Gln]
2762 .. 2762
G/A
rs183191102
+
CDS
Nonsynonymous[Ser34Asn]
2772 .. 2772
C/T
rs187480049
+
CDS
Synonymous[Pro37Pro]
2802 .. 2802
G/A
rs114643228
+
CDS
Synonymous[Val47Val]
2851 .. 2851
G/A
rs74321923
+
CDS
Nonsynonymous[Val64Met]
2890 .. 2890
T/A
rs181022474
+
3'UTR
2922 .. 2922
T/G
rs11885382
+
3'UTR
3006 .. 3006
A/G
rs77396095
+
3'UTR
3062 .. 3062
G/A
rs143234682
+
3'UTR
3075 .. 3075
A/G
rs185051388
+
3'UTR
3089 .. 3089
G/A
rs191550727
+
3'UTR
3097 .. 3097
C/T
rs73946473
+
3'UTR
3118 .. 3118
G/A
rs75991058
+
3'UTR
3289 .. 3289
G/A
rs142916750
+
3'UTR
3419 .. 3419
C/T
rs151101585
+
3'UTR
3487 .. 3487
T/C
rs141085803
+
3'UTR
3613 .. 3613
C/T
rs186374115
+
3'UTR
3619 .. 3619
T/G
rs116087331
+
3'UTR
3622 .. 3622
C/T
rs145180318
+
3'UTR
3623 .. 3623
G/A
rs35295399
+
3'UTR
3636 .. 3636
T/C
rs115505403
+
3'UTR
3666 .. 3666
A/T
rs191237620
+
3'UTR
3675 .. 3675
A/G
rs139024660
+
3'UTR
3682 .. 3682
G/C
rs183196370
+
3'UTR
3731 .. 3731
T/C
rs186053164
+
3'UTR
3755 .. 3755
A/G
rs56243127
+
3'UTR
Microsatellite (Short Tandem Repeat, STR)
No data available
Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
Repeat
Type
Start
End
Strand
MIRb
1618
1682
-
AluSx1
1843
2150
-
MIRb
2221
2321
-
MIR
2517
2732
-
LFSINE_Vert
3291
3426
-
Database links
Human-Gene diversity Of Life-style related Diseases(H-GOLD)
;
Related H-InvDB links
VaryGene
;
Repeat Mask Viewer
;