H-InvDB_9.0 released on May 27, 2015.
Search by
Keyword
H-Inv ID (HIT)
H-Inv cluster ID (HIX)
H-Inv protein ID (HIP)
H-Inv gene family/group (HIF)
Accession number
Chromosome number
Chromosome band
Definition*
Data source ID
---
CCDS ID
dbSNP ID (rs number)
EC number
Ensembl ID
EntrezGene ID
FR ID
FR Accession number
GO ID
GO name*
HGNC gene symbol
HGNC gene name*
InterPro ID
InterPro name*
OMIM ID
OMIM title*
Pathway ID
Pathway name*
RefSeq (gene) ID
RefSeq (protein) ID
SCOP ID
UniProt
for
Advanced Search
Home
Quick guide
Navi
BLAST
Site map
Download
Contact us
Help
H-Invitational ID:
HIT000328481
Accession number:
AB073894
Created date:
26-Mar-2013
Last modified:
27-May-2015
Definition:
Similar to Zinc finger protein ZFPM2; Friend of GATA protein 2; FOG-2; Friend of GATA 2; hFOG-2; Zinc finger protein 89B; Zinc finger protein multitype 2;
Select format
Flat file
XML file
Nucleotide sequence fasta
Protein sequence fasta
Transcript original information
Accession number
AB073894.1
CAGE tag ID
NA
EST ID
NA
Clone Number
Nbla03139
Experimental resources
NBRC
;
HGPD
;
Sequence data provider
NA
Annotation project
NA
Length of cDNA
3351[bp] (No. of exon:1)[A:1046 T:923 G:645 C:737]
Devision
HUM
Molecular type
mRNA
Library origin
Cell type
NA
Tissue type
NA
Develpmental stage
NA
Sequence quality information
CDS feature
N-truncated
Kozak sequence
NA
PolyA
NA
Vector/adapter sequence
NA
Frame shift
NA
Remaining intron
NA
Splice site acceptor (NAGNAG)
NA
Transcript quality feature
NA
Notes
NA
TTCGGCTTCCAGACTCAGAGGGAGTTATTGCAGCACCAGGAGCTCCATGT CCCTAGCGGCAAACTTCCCAGAGAAAGTGACATGGAACACTCTCCAAGTG CAACTGAAGACAGCTTACAGCCAGCCACAGACTTATTGACCAGAAGCGAA CTTCCCCAGAGCCAAAAGGCCATGCAGACTAAAGATGCGAGCTCTGACAC AGAGCTGGACAAGTGTGAGAAAAAGACTCAGCTCTTTCTCACGAACCAGA GACCAGAGATACAGCCTACAACAAATAAACAAAGCTTTTCTTACACAAAA ATAAAGTCTGAGCCCTCTAGCCCAAGACTTGCCTCATCTCCAGTTCAGCC TAATATTGGGCCTTCTTTCCCTGTGGGCCCTTTCCTATCTCAGTTTTCTT TCCCCCAAGATATCACCATGGTCCCTCAAGCTTCAGAGATCTTAGCTAAG ATGTCTGAACTGGTGCATCGGCGACTGAGGCATGGCAGTAGTAGCTACCC TCCCGTCATTTACAGCCCTTTGATGCCCAAGGGGGCTACTTGTTTTGAGT GTAACATAACATTCAATAATTTGGATAATTATCTAGTGCACAAAAAGCAT TATTGCAGCAGCCGATGGCAGCAGATGGCTAAGTCCCCAGAGTTCCCTAG TGTGTCAGAAAAGATGCCTGAAGCTTTGAGTCCCAACACTGGCCAAACCT CCATAAACCTTCTCAACCCAGCTGCTCATTCTGCTGATCCTGAGAATCCA CTTCTTCAAACATCTTGCATCAATTCTTCCACTGTCTTAGATTTAATTGG GCCAAATGGGAAGGGCCATGACAAGGACTTTTCCACTCAAACTAAGAAGC TCTCCACCTCCAGTAACAATGATGACAAAATTAATGGAAAACCTGTTGAT GTGAAAAATCCCAGTGTCCCCTTAGTGGATGGGGAAAGTGACCCAAATAA GACTACCTGTGAAGCTTGCAACATTACCTTCAGCCGGCACGAAACATACA TGGTCCACAAACAGTATTACTGTGCTACACGCCACGACCCTCCACTGAAG AGGTCTGCTTCCAACAAAGTGCCTGCCATGCAGAGAACCATGCGCACACG CAAGCGCAGAAAGATGTATGAGATGTGCCTACCTGAGCAGGAACAAAGGC CTCCACTGGTTCAGCAGAGATTTCTTGACGTAGCCAACCTCAATAATCCT TGTACCTCCACTCAAGAACCCACAGAAGGGCTAGGAGAGTGCTACCACCC AAGATGTGATATCTTTCCAGGAATTGTCTCTAAACACTTGGAAACTTCTC TGACGATCAACAAGTGTGTTCCAGTTTCCAAATGTGATACTACTCATTCC AGTGTTTCCTGCCTAGAGATGGACGTGCCCATAGATCTCAGCAAAAAGTG TTTATCTCAGTCTGAGCGGACGACCACGTCTCCCAAAAGGCTGCTGGACT ATCACGAGTGCACTGTGTGCAAGATCAGTTTCAATAAGGTAGAAAACTAT CTGGCCCACAAGCAGAATTTCTGCCCGGTTACTGCACATCAGCGTAATGA CCTGGGTCAACTGGACGGCAAAGTGTTTCCGAATCCAGAAAGCGAACGAA ACAGCCCTGATGTCAGCTACGAAAGAAGCATAATAAAATGTGAGAAAAAT GGGAATTTGAAGCAGCCTTCCCCCAATGGAAACTTATTTTCATCCCACCT AGCAACCCTGCAAGGCTTGAAGGTCTTTAGTGAAGCTGCTCAGCTCATTG CTACAAAAGAAGAAAACAGACATTTGTTTCTTCCACAATGCCTTTACCCT GGAGCAATAAAGAAAGCAAAAGGAGCCGACCAGCTTTCTCCATATTATGG AATCAAGCCAAGTGATTATATTTCTGGTTCTCTTGTCATCCATAACACTG ACATCGAGCAAAGCAGAAATGCAGAAAATGAATCTCCTAAAGGCCAGGCT TCCTCAAATGGGTGTGCTGCGCTGAAGAAAGATTCTCTGCCATTGTTGCC CAAAAATCGAGGAATGGTAATAGTGAATGGTGGACTGAAACAAGATGAGA GACCTGCTGCCAACCCACAGCAAGAGAACATTTCCCAGAATCCTCAGCAC GAAGACGACCACAAATCTCCCTCGTGGATCTCTGAGAACCCATTAGCTGC CAATGAGAATGTCTCACCAGGAGTTCCCTCAGCAGAGGAACAGTTGTCTA GTATAGCAAAAGGTGTGAATGGTTCCAGCCAGGCTCCAACCAGTGGGAAA TATTGCCGGCTATGTGATATCCAGTTCAACAACCTTTCAAACTTTATAAC TCACAAGAAGTTTTATTGCTCATCACATGCAGCAGAACATGTCAAATGAA CTAACTAAACATCAGTCACCTTTGGTATCAGTGTTTAGTATGTTGTTCTA ACCAGTCCAGAAAAAAAAATAAGCTGTTTGAATTACATCTGGGCAATCAG GAGATAATTCATTATGGCTGAGTTGAAGACTTAAGGTGTAATTTCATTAC AGTCCATTAGTAAAGTGTATTATTGGTGCCATTTTCAAAAAAATTAATTT ATTTTACCAGCAGTATTCATAGCTGTGGTTATGTTATTTTTTATTTAAAA ACTTTATATTAAAGTCATTTGTAATGTTATTGTATAGTTATTGTGTAGCA CATATGGTTTGCACTGTATAGTAGCTTTTAAAGAAAATAGTCACAATACA GAAAAGCATTTTAGAAATAGCTTCAAAAGCACTTGTGTATCTTGATTTTT TCTTATATGCTGTTGCAGATATATGTATATGCTAAAATATAACTTGCAAA GATGTTCTAAATACACATGCTATAAGTTCGCCTTAAGATTTCAATTCTTG GATAATCAGGCTCTGTTTGCACTTTATATTTTAGCAGATACAGTCTCTTA GTCACTAGGCTTTGCATTTGTATGTAGCTGTATGTTTCCGTCCATTTTCT TAATCCTGAACCTGTATGTTAAATGAAGATGGCAATTTTTTTCTTGTATA GTACTTGTATTTTCTTTCGCTGATGCAGCTCTGTCTCAATTTTTAAACCT TTGCTGTTAAATGCAATACTTTATAAAGAATGAACAAAATTACTGGAAGC AGTATTGTAAGTAATGAGGTAGTATTAATCAGTTTTATCTTTTGAAAGGC ACAGTCTAAATCGAAACCCTAAACTCAATGCTGCAAGTATGAATTTAATT CATATATAAGATCTATTTAAATATAAGAGTAGCAATACTGCACCTGGTGA TCACAAAGATAATGTTCTACTTCTGATAGAAATAATTTCTCAACAAATGT TGTTACTATGCATGTATATGGATGGAATAAAATTCCAGATTGTTGGAAAA A
Gene structure information
H-Inv cluster ID
HIX0218090
Genomic location
Chromosome
8
Location
8q23.1
Position
106813418- 106816767
Strand
+
Possible duplicated location(s)
NA
Gene structure
1 exon(s)
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:23414
;
KEGG GENES
KEGG GENES(23414)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
Related H-InvDB links
H-DBAS
;
G-integra
;
cDNA-genome alignment
;
Predicted CDS information
HIP ID
HIP000171884
Predicted CDS
1..2349; 782[aa]; Orientation:+1;
Codon Adaptation Index (CAI).
0.752
FGFQTQRELLQHQELHVPSGKLPRESDMEHSPSATEDSLQPATDLLTRSE LPQSQKAMQTKDASSDTELDKCEKKTQLFLTNQRPEIQPTTNKQSFSYTK IKSEPSSPRLASSPVQPNIGPSFPVGPFLSQFSFPQDITMVPQASEILAK MSELVHRRLRHGSSSYPPVIYSPLMPKGATCFECNITFNNLDNYLVHKKH YCSSRWQQMAKSPEFPSVSEKMPEALSPNTGQTSINLLNPAAHSADPENP LLQTSCINSSTVLDLIGPNGKGHDKDFSTQTKKLSTSSNNDDKINGKPVD VKNPSVPLVDGESDPNKTTCEACNITFSRHETYMVHKQYYCATRHDPPLK RSASNKVPAMQRTMRTRKRRKMYEMCLPEQEQRPPLVQQRFLDVANLNNP CTSTQEPTEGLGECYHPRCDIFPGIVSKHLETSLTINKCVPVSKCDTTHS SVSCLEMDVPIDLSKKCLSQSERTTTSPKRLLDYHECTVCKISFNKVENY LAHKQNFCPVTAHQRNDLGQLDGKVFPNPESERNSPDVSYERSIIKCEKN GNLKQPSPNGNLFSSHLATLQGLKVFSEAAQLIATKEENRHLFLPQCLYP GAIKKAKGADQLSPYYGIKPSDYISGSLVIHNTDIEQSRNAENESPKGQA SSNGCAALKKDSLPLLPKNRGMVIVNGGLKQDERPAANPQQENISQNPQH EDDHKSPSWISENPLAANENVSPGVPSAEEQLSSIAKGVNGSSQAPTSGK YCRLCDIQFNNLSNFITHKKFYCSSHAAEHVK*
Motif information
a.a.
length
InterPro
Name
22
IPR015880
Zinc finger, C2H2-like [Domain]
28
IPR015880
Zinc finger, C2H2-like [Domain]
21
IPR015880
Zinc finger, C2H2-like [Domain]
27
IPR015880
Zinc finger, C2H2-like [Domain]
Gene function information
H-Inv ID
HIT000328481
H-Inv cluster ID
HIX0218090
Accession number
AB073894.1
CAGE tag ID
NA
EST ID
NA
Transcript feature
NO;
Coding potential
Protein coding;
Definition
Similar to Zinc finger protein ZFPM2; Friend of GATA protein 2; FOG-2; Friend of GATA 2; hFOG-2; Zinc finger protein 89B; Zinc finger protein multitype 2;
Similarity category
Category: Similar to known protein(Category II).
Similar to known protein (
Q8WW38
) [Identity/coverage = 99.872%/67.94%] to Homo sapiens (Human). protein.
Experimental evidence
Protein evidence
PubMed ID
10438528
;
14517948
;
15489334
;
16103912
;
23226341
;
ALL
;
Gene family/group
H-Inv gene family/group ID
NA
Gene family/group name
NA
Evidence motif (InterPro) ID
NA
Gene symbol/name
HGNC symbol
NA
HGNC aliases
NA
HGNC name
NA
DDBJ
NA
UniProt
ZFPM2
EC number
NA
GGDB
(GlycoGene Database)
Gene symbol
NA
Familly
NA
Designation
NA
Expression
NA
KEGG metabolic pathway
NA
Protein-protein interaction (PPI)
H-Inv protein ID
HIP000171884
No. of interaction
NA
Interaction partner(s)
NA
BIND
NA
DIP
NA
MINT
NA
HPRD
NA
IntAct
NA
Database links
RefSeq
NA
Ensembl
NA
Entrez Gene
Entrez Gene ID:23414
;
KEGG GENES
KEGG GENES(23414)
;
GeneCard
NA
*GeneCards is provided free to academic non-profit institutions.
etc
Human-Gene diversity Of Life-style related Diseases
;
Curation status
Auto-annotated
Notes
NA
Related H-InvDB links
Gene family;
Similarity Search Tool;
TACT
;
fRNAdb (The functional RNA database) : overlapping fRNAdb entries with H-InvDB transcripts based on the location of the genome.
NA
Subcellular localization information
Last modified:27-May-2015
WoLF PSORT
nuclear; cytosol;
Target P
Other
SOSUI
soluble protein
TMHMM
soluble protein
PTS1
Not targeted
Related H-InvDB links
LIFEdb;
JRE-1.4.0 or later is required.
Download JRE at
Sun's web site.
Gene expression information
Last modified:27-May-2015
Tissue-specific expression
NA
Probe
information
AceGene
AGhsA200114;
Affymetrix
GeneChip
HG-Focus
219778_at;
HG-U133
219778_at;
HG-U133A
219778_at;
HG-U133A_2
219778_at;
HG-U133B
NA
HG-U133_Plus_2
219778_at;
HG-U95
49982_at;
HG-U95A
NA
HG-U95B
49982_at;
HG-U95C
NA
HG-U95D
NA
HG-U95E
NA
HG-U95Av2
NA
HuEx-1_0
3110894; 3110895;
HuGeneFL
NA
Agilent
Human 1A Oligo Microarray:PGID215
A_23_P168909;
Whole Human Genome Oligo Microarray:PGID247
A_23_P168909;
Related H-InvDB links
H-ANGEL;
DNAProbeLocator
;
Polymorphism (SNP, indel), microsatellite (Short Tandem Repeat, STR) and repeat information
Single Nucleotide Polymorphism (SNP) and indel
Location
Variation
dbSNP ID
Strand
CDS/UTR
Translation
53 .. 53
C/G
rs181007123
+
CDS
Nonsynonymous[Pro18Arg]
76 .. 76
A/G
rs201091005
+
CDS
Nonsynonymous[Ser26Gly]
101 .. 101
C/G
rs11993776
+
CDS
Nonsynonymous[Ala34Gly]
104 .. 104
C/T
rs185077999
+
CDS
Nonsynonymous[Thr35Ile]
120 .. 120
G/T
rs201439692
+
CDS
Nonsynonymous[Gln40His]
163 .. 163
C/G
rs191011780
+
CDS
Nonsynonymous[Gln55Glu]
169 .. 169
G/A
rs35843564
+
CDS
Nonsynonymous[Ala57Thr]
211 .. 211
A/G
rs200311467
+
CDS
Nonsynonymous[Lys71Glu]
251 .. 251
G/T
rs181764739
+
CDS
Nonsynonymous[Arg84Ile]
255 .. 255
A/G
rs920628
+
CDS
Synonymous[Pro85Pro]
462 .. 462
G/C
rs920629
+
CDS
Synonymous[Leu154Leu]
471 .. 471
G/T
rs200643137
+
CDS
Synonymous[Arg157Arg]
525 .. 525
G/A
rs187043152
+
CDS
Nonsynonymous[Met175Ile]
600 .. 600
T/G
rs200445538
+
CDS
Nonsynonymous[His200Gln]
668 .. 668
C/G
rs201515430
+
CDS
Nonsynonymous[Pro223Arg]
669 .. 669
T/C
rs16873732
+
CDS
Synonymous[Pro223Pro]
764 .. 764
C/G
rs34248551
+
CDS
Nonsynonymous[Ser255Cys]
862 .. 862
A/G
rs28374544
+
CDS
Nonsynonymous[Ser288Gly]
990 .. 990
C/T
rs139368368
+
CDS
Synonymous[His330His]
1000 .. 1000
A/C
rs121908603
+
CDS
Nonsynonymous[Met334Leu]
1050 .. 1050
G/A
rs111634505
+
CDS
Synonymous[Lys350Lys]
1180 .. 1180
G/A
rs117908591
+
CDS
Nonsynonymous[Val394Ile]
1239 .. 1239
G/C
rs2920048
+
CDS
Nonsynonymous[Glu413Asp]
1278 .. 1278
C/G
rs35998713
+
CDS
Synonymous[Val426Val]
1323 .. 1323
A/T
rs200002039
+
CDS
Synonymous[Pro441Pro]
1336 .. 1336
G/A
rs201197925
+
CDS
Nonsynonymous[Asp446Asn]
1394 .. 1394
A/G
rs113289249
+
CDS
Nonsynonymous[Lys465Arg]
1420 .. 1420
A/G
rs121908604
+
CDS
Nonsynonymous[Thr474Ala]
1558 .. 1558
C/G
rs146423225
+
CDS
Nonsynonymous[Gln520Glu]
1580 .. 1580
C/T
rs200840311
+
CDS
Nonsynonymous[Pro527Leu]
1594 .. 1594
G/A
rs199580917
+
CDS
Nonsynonymous[Glu532Lys]
1621 .. 1621
G/A
rs201981625
+
CDS
Nonsynonymous[Glu541Lys]
1817 .. 1817
C/G
rs200308363
+
CDS
Nonsynonymous[Ala606Gly]
1828 .. 1828
G/A
rs201644250
+
CDS
Nonsynonymous[Asp610Asn]
1856 .. 1856
A/C
rs139881948
+
CDS
Nonsynonymous[Lys619Thr]
1862 .. 1862
G/A
rs201707218
+
CDS
Nonsynonymous[Ser621Asn]
1869 .. 1869
T/C
rs1442320
+
CDS
Synonymous[Tyr623Tyr]
1928 .. 1928
A/G
rs201558304
+
CDS
Nonsynonymous[Asn643Ser]
1930 .. 1930
G/C
rs200487685
+
CDS
Nonsynonymous[Glu644Gln]
1942 .. 1942
G/A
rs201106296
+
CDS
Nonsynonymous[Gly648Ser]
1971 .. 1971
G/A
rs200049316
+
CDS
Synonymous[Ala657Ala]
1979 .. 1979
A/T
rs201729935
+
CDS
Nonsynonymous[Lys660Ile]
2057 .. 2057
C/T
rs16873741
+
CDS
Nonsynonymous[Ala686Val]
2071 .. 2071
C/G
rs201190084
+
CDS
Nonsynonymous[Gln691Glu]
2083 ^ 2084
-/C
rs35944979
+
CDS
2100 .. 2100
C/T
rs11995760
+
CDS
Synonymous[His700His]
2161 .. 2161
G/A
rs191385674
+
CDS
Nonsynonymous[Val721Ile]
2180 .. 2180
C/T
rs183300898
+
CDS
Nonsynonymous[Ser727Leu]
2187 .. 2187
G/C
rs149688628
+
CDS
Nonsynonymous[Glu729Asp]
2191 .. 2191
C/G
rs199678497
+
CDS
Nonsynonymous[Gln731Glu]
2258 .. 2258
G/A
rs202084698
+
CDS
Nonsynonymous[Arg753Gln]
2262 .. 2262
A/G
rs16873744
+
CDS
Synonymous[Leu754Leu]
2403 .. 2403
C/G
rs16873745
+
3'UTR
2439 .. 2439
C/G
rs6991211
+
3'UTR
2546 .. 2546
A/T
rs188136328
+
3'UTR
2562 .. 2562
A/G
rs190229532
+
3'UTR
2583 .. 2583
G/T
rs78644456
+
3'UTR
2691 ^ 2692
-/CA
rs201063987
+
3'UTR
2692 ^ 2693
-/ACAA
rs113429187
+
3'UTR
2693 ^ 2694
-/ACAA
rs35611262
+
3'UTR
2696 ^ 2697
-/ACAA
rs10690000
+
3'UTR
2864 .. 2864
T/C
rs6469016
+
3'UTR
2872 ^ 2873
-/TT
rs199956937
+
3'UTR
2919 .. 2919
T/G
rs182396880
+
3'UTR
3069 .. 3069
C/T
rs1053039
+
3'UTR
3094 ^ 3095
-/G
rs35654205
+
3'UTR
3112 .. 3112
T/G
rs186732745
+
3'UTR
3242 .. 3242
A/C
rs1053040
+
3'UTR
3284 .. 3284
A/G
rs75915476
+
3'UTR
Microsatellite (Short Tandem Repeat, STR)
No data available
Microsatellite: Human-Gene diversity Of Life-style related Diseases (H-GOLD)
No data available
Repeat
No data available
Database links
Human-Gene diversity Of Life-style related Diseases(H-GOLD)
;
Related H-InvDB links
VaryGene
;
Repeat Mask Viewer
;