Amino acid dipepetide frequency for Goose hemorrhagic polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.591AlaAla: 15.591 ± 7.585
0.538AlaCys: 0.538 ± 0.375
1.075AlaAsp: 1.075 ± 0.749
8.065AlaGlu: 8.065 ± 2.093
2.151AlaPhe: 2.151 ± 0.793
5.376AlaGly: 5.376 ± 2.929
0.0AlaHis: 0.0 ± 0.0
5.376AlaIle: 5.376 ± 2.455
2.151AlaLys: 2.151 ± 0.764
5.914AlaLeu: 5.914 ± 2.118
1.613AlaMet: 1.613 ± 0.574
2.688AlaAsn: 2.688 ± 1.032
4.839AlaPro: 4.839 ± 2.811
3.226AlaGln: 3.226 ± 1.108
4.839AlaArg: 4.839 ± 1.54
9.14AlaSer: 9.14 ± 3.526
5.376AlaThr: 5.376 ± 1.076
4.839AlaVal: 4.839 ± 1.474
0.0AlaTrp: 0.0 ± 0.0
3.226AlaTyr: 3.226 ± 0.868
0.0AlaXaa: 0.0 ± 0.0
Cys
2.688CysAla: 2.688 ± 1.436
1.613CysCys: 1.613 ± 0.824
1.075CysAsp: 1.075 ± 0.669
0.538CysGlu: 0.538 ± 0.739
1.075CysPhe: 1.075 ± 0.749
0.538CysGly: 0.538 ± 0.535
0.538CysHis: 0.538 ± 0.535
0.0CysIle: 0.0 ± 0.0
2.151CysLys: 2.151 ± 0.764
1.613CysLeu: 1.613 ± 0.794
0.0CysMet: 0.0 ± 0.0
1.613CysAsn: 1.613 ± 1.423
1.075CysPro: 1.075 ± 0.556
1.075CysGln: 1.075 ± 0.749
0.538CysArg: 0.538 ± 0.375
1.075CysSer: 1.075 ± 0.674
1.613CysThr: 1.613 ± 1.338
1.613CysVal: 1.613 ± 1.024
0.538CysTrp: 0.538 ± 0.535
0.538CysTyr: 0.538 ± 0.535
0.0CysXaa: 0.0 ± 0.0
Asp
2.688AspAla: 2.688 ± 0.964
1.075AspCys: 1.075 ± 0.749
1.613AspAsp: 1.613 ± 1.338
4.301AspGlu: 4.301 ± 1.677
1.075AspPhe: 1.075 ± 0.749
3.226AspGly: 3.226 ± 1.552
0.538AspHis: 0.538 ± 0.375
4.839AspIle: 4.839 ± 1.432
2.151AspLys: 2.151 ± 1.349
3.763AspLeu: 3.763 ± 1.476
1.075AspMet: 1.075 ± 1.368
2.151AspAsn: 2.151 ± 1.111
3.763AspPro: 3.763 ± 1.419
1.075AspGln: 1.075 ± 0.749
2.688AspArg: 2.688 ± 0.592
5.376AspSer: 5.376 ± 1.748
0.538AspThr: 0.538 ± 0.715
2.688AspVal: 2.688 ± 0.964
1.075AspTrp: 1.075 ± 0.701
1.613AspTyr: 1.613 ± 0.634
0.0AspXaa: 0.0 ± 0.0
Glu
7.527GluAla: 7.527 ± 4.007
1.075GluCys: 1.075 ± 0.556
8.065GluAsp: 8.065 ± 2.498
6.989GluGlu: 6.989 ± 3.25
2.151GluPhe: 2.151 ± 0.947
6.452GluGly: 6.452 ± 1.934
0.0GluHis: 0.0 ± 0.0
4.301GluIle: 4.301 ± 1.911
3.763GluLys: 3.763 ± 1.125
6.452GluLeu: 6.452 ± 1.606
1.613GluMet: 1.613 ± 1.359
3.226GluAsn: 3.226 ± 1.156
2.151GluPro: 2.151 ± 1.029
3.763GluGln: 3.763 ± 0.881
3.763GluArg: 3.763 ± 0.92
2.151GluSer: 2.151 ± 1.088
4.839GluThr: 4.839 ± 2.265
4.301GluVal: 4.301 ± 1.098
2.151GluTrp: 2.151 ± 1.402
3.226GluTyr: 3.226 ± 1.289
0.0GluXaa: 0.0 ± 0.0
Phe
2.151PheAla: 2.151 ± 0.858
3.226PheCys: 3.226 ± 2.008
1.075PheAsp: 1.075 ± 0.669
2.151PheGlu: 2.151 ± 0.77
0.538PhePhe: 0.538 ± 0.739
1.613PheGly: 1.613 ± 1.024
1.613PheHis: 1.613 ± 0.794
0.0PheIle: 0.0 ± 0.0
0.538PheLys: 0.538 ± 0.375
2.151PheLeu: 2.151 ± 0.947
0.538PheMet: 0.538 ± 0.375
1.075PheAsn: 1.075 ± 0.749
1.613PhePro: 1.613 ± 0.634
1.613PheGln: 1.613 ± 0.634
3.226PheArg: 3.226 ± 1.248
2.688PheSer: 2.688 ± 1.024
2.688PheThr: 2.688 ± 0.964
2.151PheVal: 2.151 ± 0.526
1.075PheTrp: 1.075 ± 0.701
2.151PheTyr: 2.151 ± 0.526
0.0PheXaa: 0.0 ± 0.0
Gly
4.301GlyAla: 4.301 ± 0.71
1.613GlyCys: 1.613 ± 0.686
3.763GlyAsp: 3.763 ± 1.419
6.989GlyGlu: 6.989 ± 3.126
2.688GlyPhe: 2.688 ± 1.199
7.527GlyGly: 7.527 ± 0.973
0.538GlyHis: 0.538 ± 0.375
2.151GlyIle: 2.151 ± 0.631
3.763GlyLys: 3.763 ± 1.36
8.065GlyLeu: 8.065 ± 2.602
1.075GlyMet: 1.075 ± 0.525
1.075GlyAsn: 1.075 ± 0.701
9.14GlyPro: 9.14 ± 2.431
6.989GlyGln: 6.989 ± 1.784
2.688GlyArg: 2.688 ± 1.52
3.763GlySer: 3.763 ± 1.974
2.688GlyThr: 2.688 ± 1.103
5.376GlyVal: 5.376 ± 2.882
1.075GlyTrp: 1.075 ± 0.701
3.763GlyTyr: 3.763 ± 1.044
0.0GlyXaa: 0.0 ± 0.0
His
2.688HisAla: 2.688 ± 2.022
1.075HisCys: 1.075 ± 0.749
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.075HisPhe: 1.075 ± 0.556
1.075HisGly: 1.075 ± 0.525
2.151HisHis: 2.151 ± 1.494
0.0HisIle: 0.0 ± 0.0
0.538HisLys: 0.538 ± 0.375
1.075HisLeu: 1.075 ± 0.749
1.613HisMet: 1.613 ± 1.024
1.075HisAsn: 1.075 ± 0.749
1.075HisPro: 1.075 ± 0.669
0.538HisGln: 0.538 ± 0.375
1.613HisArg: 1.613 ± 0.634
1.613HisSer: 1.613 ± 0.696
1.613HisThr: 1.613 ± 0.783
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.538HisTyr: 0.538 ± 0.375
0.0HisXaa: 0.0 ± 0.0
Ile
4.301IleAla: 4.301 ± 1.472
1.075IleCys: 1.075 ± 0.803
1.613IleAsp: 1.613 ± 0.794
4.839IleGlu: 4.839 ± 1.39
0.538IlePhe: 0.538 ± 0.375
1.075IleGly: 1.075 ± 0.604
1.075IleHis: 1.075 ± 0.749
2.151IleIle: 2.151 ± 1.094
2.151IleLys: 2.151 ± 1.046
5.376IleLeu: 5.376 ± 2.221
0.538IleMet: 0.538 ± 0.352
3.226IleAsn: 3.226 ± 1.349
2.151IlePro: 2.151 ± 0.76
0.538IleGln: 0.538 ± 0.375
0.538IleArg: 0.538 ± 0.535
1.613IleSer: 1.613 ± 1.003
3.226IleThr: 3.226 ± 1.349
1.613IleVal: 1.613 ± 1.134
0.0IleTrp: 0.0 ± 0.0
0.538IleTyr: 0.538 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
5.376LysAla: 5.376 ± 0.855
0.538LysCys: 0.538 ± 0.375
2.151LysAsp: 2.151 ± 1.111
3.226LysGlu: 3.226 ± 1.044
0.0LysPhe: 0.0 ± 0.0
4.301LysGly: 4.301 ± 1.624
2.688LysHis: 2.688 ± 1.874
1.075LysIle: 1.075 ± 0.749
3.226LysLys: 3.226 ± 1.268
2.688LysLeu: 2.688 ± 1.42
1.613LysMet: 1.613 ± 0.72
1.075LysAsn: 1.075 ± 0.525
2.688LysPro: 2.688 ± 1.026
1.075LysGln: 1.075 ± 0.856
8.065LysArg: 8.065 ± 1.443
2.151LysSer: 2.151 ± 1.046
3.763LysThr: 3.763 ± 1.036
2.688LysVal: 2.688 ± 0.835
0.538LysTrp: 0.538 ± 0.375
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.839LeuAla: 4.839 ± 1.822
1.075LeuCys: 1.075 ± 0.556
3.763LeuAsp: 3.763 ± 1.846
6.989LeuGlu: 6.989 ± 1.766
6.989LeuPhe: 6.989 ± 1.766
4.839LeuGly: 4.839 ± 1.316
2.151LeuHis: 2.151 ± 0.793
6.989LeuIle: 6.989 ± 2.603
4.839LeuLys: 4.839 ± 1.432
9.677LeuLeu: 9.677 ± 2.816
4.301LeuMet: 4.301 ± 2.479
5.914LeuAsn: 5.914 ± 1.94
8.065LeuPro: 8.065 ± 1.923
3.763LeuGln: 3.763 ± 2.153
2.151LeuArg: 2.151 ± 0.764
5.914LeuSer: 5.914 ± 1.471
2.688LeuThr: 2.688 ± 0.983
2.151LeuVal: 2.151 ± 1.111
0.0LeuTrp: 0.0 ± 0.0
1.613LeuTyr: 1.613 ± 0.696
0.0LeuXaa: 0.0 ± 0.0
Met
1.613MetAla: 1.613 ± 0.696
0.538MetCys: 0.538 ± 0.535
1.075MetAsp: 1.075 ± 1.429
1.613MetGlu: 1.613 ± 0.794
2.151MetPhe: 2.151 ± 0.764
1.075MetGly: 1.075 ± 0.604
0.0MetHis: 0.0 ± 0.0
0.538MetIle: 0.538 ± 0.375
1.075MetLys: 1.075 ± 0.669
4.301MetLeu: 4.301 ± 0.869
0.538MetMet: 0.538 ± 0.375
1.075MetAsn: 1.075 ± 0.556
1.613MetPro: 1.613 ± 1.508
0.538MetGln: 0.538 ± 0.535
0.0MetArg: 0.0 ± 0.0
2.151MetSer: 2.151 ± 0.947
0.538MetThr: 0.538 ± 0.535
1.075MetVal: 1.075 ± 0.556
0.538MetTrp: 0.538 ± 0.535
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.613AsnAla: 1.613 ± 0.794
0.538AsnCys: 0.538 ± 0.535
0.538AsnAsp: 0.538 ± 0.375
2.151AsnGlu: 2.151 ± 1.029
0.538AsnPhe: 0.538 ± 0.375
4.301AsnGly: 4.301 ± 1.748
1.075AsnHis: 1.075 ± 0.525
1.075AsnIle: 1.075 ± 0.674
1.075AsnLys: 1.075 ± 0.749
3.763AsnLeu: 3.763 ± 1.82
0.538AsnMet: 0.538 ± 0.483
0.538AsnAsn: 0.538 ± 0.375
3.226AsnPro: 3.226 ± 1.32
2.688AsnGln: 2.688 ± 1.283
1.075AsnArg: 1.075 ± 0.749
2.151AsnSer: 2.151 ± 0.76
2.151AsnThr: 2.151 ± 1.111
1.613AsnVal: 1.613 ± 0.783
0.538AsnTrp: 0.538 ± 0.535
2.151AsnTyr: 2.151 ± 1.046
0.0AsnXaa: 0.0 ± 0.0
Pro
2.688ProAla: 2.688 ± 2.848
0.538ProCys: 0.538 ± 0.535
8.065ProAsp: 8.065 ± 1.548
6.989ProGlu: 6.989 ± 2.791
0.538ProPhe: 0.538 ± 0.535
7.527ProGly: 7.527 ± 1.942
3.763ProHis: 3.763 ± 1.434
0.538ProIle: 0.538 ± 0.535
4.301ProLys: 4.301 ± 1.636
5.914ProLeu: 5.914 ± 1.526
1.613ProMet: 1.613 ± 1.162
0.0ProAsn: 0.0 ± 0.0
8.065ProPro: 8.065 ± 2.148
1.613ProGln: 1.613 ± 0.634
5.914ProArg: 5.914 ± 1.92
4.301ProSer: 4.301 ± 2.336
1.075ProThr: 1.075 ± 0.701
6.989ProVal: 6.989 ± 2.006
2.688ProTrp: 2.688 ± 1.533
2.688ProTyr: 2.688 ± 0.633
0.0ProXaa: 0.0 ± 0.0
Gln
2.688GlnAla: 2.688 ± 1.493
0.538GlnCys: 0.538 ± 0.739
1.075GlnAsp: 1.075 ± 0.701
2.151GlnGlu: 2.151 ± 1.499
2.688GlnPhe: 2.688 ± 1.032
3.226GlnGly: 3.226 ± 0.825
1.613GlnHis: 1.613 ± 1.024
1.075GlnIle: 1.075 ± 0.749
3.226GlnLys: 3.226 ± 0.84
2.151GlnLeu: 2.151 ± 1.499
0.538GlnMet: 0.538 ± 0.375
1.075GlnAsn: 1.075 ± 0.556
5.914GlnPro: 5.914 ± 1.525
2.688GlnGln: 2.688 ± 1.29
2.151GlnArg: 2.151 ± 1.402
1.075GlnSer: 1.075 ± 0.749
3.226GlnThr: 3.226 ± 1.108
1.075GlnVal: 1.075 ± 1.069
0.538GlnTrp: 0.538 ± 0.375
1.613GlnTyr: 1.613 ± 0.696
0.0GlnXaa: 0.0 ± 0.0
Arg
8.602ArgAla: 8.602 ± 3.363
0.538ArgCys: 0.538 ± 0.375
2.688ArgAsp: 2.688 ± 1.404
4.301ArgGlu: 4.301 ± 2.202
2.151ArgPhe: 2.151 ± 1.077
6.989ArgGly: 6.989 ± 2.253
0.0ArgHis: 0.0 ± 0.0
1.613ArgIle: 1.613 ± 0.783
2.688ArgLys: 2.688 ± 0.835
4.839ArgLeu: 4.839 ± 1.075
1.075ArgMet: 1.075 ± 1.069
2.151ArgAsn: 2.151 ± 1.536
2.151ArgPro: 2.151 ± 1.024
1.613ArgGln: 1.613 ± 0.634
4.839ArgArg: 4.839 ± 1.853
0.0ArgSer: 0.0 ± 0.0
3.226ArgThr: 3.226 ± 1.396
3.763ArgVal: 3.763 ± 0.598
1.613ArgTrp: 1.613 ± 0.634
2.688ArgTyr: 2.688 ± 0.851
0.0ArgXaa: 0.0 ± 0.0
Ser
6.452SerAla: 6.452 ± 2.992
3.763SerCys: 3.763 ± 1.368
1.613SerAsp: 1.613 ± 1.338
4.301SerGlu: 4.301 ± 2.452
2.688SerPhe: 2.688 ± 1.42
5.376SerGly: 5.376 ± 1.895
0.0SerHis: 0.0 ± 0.0
3.226SerIle: 3.226 ± 1.067
1.075SerLys: 1.075 ± 0.525
6.989SerLeu: 6.989 ± 2.29
0.0SerMet: 0.0 ± 0.0
1.075SerAsn: 1.075 ± 0.556
6.452SerPro: 6.452 ± 3.906
4.839SerGln: 4.839 ± 1.45
3.763SerArg: 3.763 ± 1.359
4.839SerSer: 4.839 ± 2.844
6.989SerThr: 6.989 ± 2.557
2.151SerVal: 2.151 ± 1.029
0.0SerTrp: 0.0 ± 0.0
1.075SerTyr: 1.075 ± 0.669
0.0SerXaa: 0.0 ± 0.0
Thr
4.301ThrAla: 4.301 ± 1.558
0.538ThrCys: 0.538 ± 0.715
2.688ThrAsp: 2.688 ± 0.983
3.763ThrGlu: 3.763 ± 1.803
2.151ThrPhe: 2.151 ± 0.823
5.914ThrGly: 5.914 ± 0.904
0.538ThrHis: 0.538 ± 0.535
1.075ThrIle: 1.075 ± 0.604
2.151ThrLys: 2.151 ± 0.77
3.226ThrLeu: 3.226 ± 1.349
1.613ThrMet: 1.613 ± 0.783
1.613ThrAsn: 1.613 ± 0.72
2.688ThrPro: 2.688 ± 1.024
1.075ThrGln: 1.075 ± 0.674
2.688ThrArg: 2.688 ± 0.633
8.065ThrSer: 8.065 ± 1.344
5.376ThrThr: 5.376 ± 2.778
6.989ThrVal: 6.989 ± 1.56
0.0ThrTrp: 0.0 ± 0.0
1.075ThrTyr: 1.075 ± 1.069
0.0ThrXaa: 0.0 ± 0.0
Val
3.226ValAla: 3.226 ± 0.983
1.075ValCys: 1.075 ± 0.674
3.763ValAsp: 3.763 ± 0.687
4.301ValGlu: 4.301 ± 1.452
1.075ValPhe: 1.075 ± 0.749
4.301ValGly: 4.301 ± 2.426
1.075ValHis: 1.075 ± 0.556
0.538ValIle: 0.538 ± 0.53
3.763ValLys: 3.763 ± 1.846
5.376ValLeu: 5.376 ± 2.115
0.0ValMet: 0.0 ± 0.0
2.151ValAsn: 2.151 ± 0.77
6.452ValPro: 6.452 ± 1.716
0.538ValGln: 0.538 ± 0.535
2.688ValArg: 2.688 ± 1.559
5.914ValSer: 5.914 ± 1.223
3.763ValThr: 3.763 ± 1.178
3.226ValVal: 3.226 ± 0.825
0.0ValTrp: 0.0 ± 0.0
4.301ValTyr: 4.301 ± 1.778
0.0ValXaa: 0.0 ± 0.0
Trp
0.538TrpAla: 0.538 ± 0.375
0.538TrpCys: 0.538 ± 0.535
1.075TrpAsp: 1.075 ± 0.701
1.613TrpGlu: 1.613 ± 0.696
0.0TrpPhe: 0.0 ± 0.0
1.075TrpGly: 1.075 ± 0.701
0.0TrpHis: 0.0 ± 0.0
1.075TrpIle: 1.075 ± 0.701
2.151TrpLys: 2.151 ± 1.402
0.538TrpLeu: 0.538 ± 0.375
1.075TrpMet: 1.075 ± 0.701
0.538TrpAsn: 0.538 ± 0.739
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.075TrpArg: 1.075 ± 0.701
0.538TrpSer: 0.538 ± 0.535
1.075TrpThr: 1.075 ± 0.701
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.538TrpTyr: 0.538 ± 0.375
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.151TyrAla: 2.151 ± 1.046
0.0TyrCys: 0.0 ± 0.0
1.075TyrAsp: 1.075 ± 0.701
2.688TyrGlu: 2.688 ± 0.835
1.613TyrPhe: 1.613 ± 1.024
3.226TyrGly: 3.226 ± 1.108
0.0TyrHis: 0.0 ± 0.0
0.538TyrIle: 0.538 ± 0.375
2.151TyrLys: 2.151 ± 1.046
4.839TyrLeu: 4.839 ± 1.237
0.538TyrMet: 0.538 ± 0.375
0.0TyrAsn: 0.0 ± 0.0
2.688TyrPro: 2.688 ± 1.478
1.075TyrGln: 1.075 ± 0.701
3.226TyrArg: 3.226 ± 1.339
2.151TyrSer: 2.151 ± 1.536
1.075TyrThr: 1.075 ± 0.556
3.226TyrVal: 3.226 ± 1.268
1.075TyrTrp: 1.075 ± 0.701
1.613TyrTyr: 1.613 ± 0.982
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski