Amino acid dipepetide frequency for Shahe isopoda virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.535AlaAla: 4.535 ± 1.753
1.134AlaCys: 1.134 ± 0.485
1.701AlaAsp: 1.701 ± 0.995
3.968AlaGlu: 3.968 ± 1.428
5.102AlaPhe: 5.102 ± 2.476
8.503AlaGly: 8.503 ± 2.204
1.701AlaHis: 1.701 ± 0.995
2.834AlaIle: 2.834 ± 1.211
5.102AlaLys: 5.102 ± 0.366
6.236AlaLeu: 6.236 ± 1.806
1.701AlaMet: 1.701 ± 0.995
1.701AlaAsn: 1.701 ± 1.147
7.37AlaPro: 7.37 ± 1.698
2.834AlaGln: 2.834 ± 1.0
7.37AlaArg: 7.37 ± 3.1
8.503AlaSer: 8.503 ± 5.196
6.236AlaThr: 6.236 ± 1.508
3.401AlaVal: 3.401 ± 1.375
0.567AlaTrp: 0.567 ± 0.332
1.701AlaTyr: 1.701 ± 0.502
0.0AlaXaa: 0.0 ± 0.0
Cys
2.834CysAla: 2.834 ± 1.52
0.0CysCys: 0.0 ± 0.0
1.134CysAsp: 1.134 ± 0.581
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.134CysGly: 1.134 ± 0.485
0.0CysHis: 0.0 ± 0.0
0.567CysIle: 0.567 ± 0.332
0.0CysLys: 0.0 ± 0.0
2.268CysLeu: 2.268 ± 1.162
0.0CysMet: 0.0 ± 0.0
1.134CysAsn: 1.134 ± 0.664
0.0CysPro: 0.0 ± 0.0
1.701CysGln: 1.701 ± 0.565
2.268CysArg: 2.268 ± 1.053
1.701CysSer: 1.701 ± 0.721
0.0CysThr: 0.0 ± 0.0
1.701CysVal: 1.701 ± 0.995
0.0CysTrp: 0.0 ± 0.0
0.567CysTyr: 0.567 ± 0.332
0.0CysXaa: 0.0 ± 0.0
Asp
4.535AspAla: 4.535 ± 0.481
1.134AspCys: 1.134 ± 0.664
2.268AspAsp: 2.268 ± 0.789
2.268AspGlu: 2.268 ± 0.961
2.268AspPhe: 2.268 ± 0.954
3.968AspGly: 3.968 ± 0.8
0.0AspHis: 0.0 ± 0.0
1.701AspIle: 1.701 ± 0.502
1.701AspLys: 1.701 ± 0.995
3.401AspLeu: 3.401 ± 1.546
0.567AspMet: 0.567 ± 0.332
0.0AspAsn: 0.0 ± 0.0
4.535AspPro: 4.535 ± 1.704
0.567AspGln: 0.567 ± 0.61
2.268AspArg: 2.268 ± 0.961
1.134AspSer: 1.134 ± 0.664
2.268AspThr: 2.268 ± 0.875
1.134AspVal: 1.134 ± 0.485
0.567AspTrp: 0.567 ± 0.613
1.134AspTyr: 1.134 ± 1.22
0.0AspXaa: 0.0 ± 0.0
Glu
6.236GluAla: 6.236 ± 1.92
0.567GluCys: 0.567 ± 0.332
1.134GluAsp: 1.134 ± 1.028
3.968GluGlu: 3.968 ± 1.279
2.834GluPhe: 2.834 ± 1.071
6.803GluGly: 6.803 ± 2.343
1.701GluHis: 1.701 ± 0.995
1.134GluIle: 1.134 ± 0.664
3.401GluLys: 3.401 ± 1.744
6.236GluLeu: 6.236 ± 1.202
2.268GluMet: 2.268 ± 0.961
2.268GluAsn: 2.268 ± 1.758
1.701GluPro: 1.701 ± 2.188
2.834GluGln: 2.834 ± 1.425
3.401GluArg: 3.401 ± 1.546
3.968GluSer: 3.968 ± 2.766
2.268GluThr: 2.268 ± 0.789
2.834GluVal: 2.834 ± 0.789
1.701GluTrp: 1.701 ± 1.147
2.268GluTyr: 2.268 ± 0.434
0.0GluXaa: 0.0 ± 0.0
Phe
2.834PheAla: 2.834 ± 0.775
1.134PheCys: 1.134 ± 0.664
2.268PheAsp: 2.268 ± 0.961
1.134PheGlu: 1.134 ± 0.664
1.134PhePhe: 1.134 ± 0.664
6.236PheGly: 6.236 ± 2.34
0.567PheHis: 0.567 ± 0.332
0.567PheIle: 0.567 ± 0.61
0.0PheLys: 0.0 ± 0.0
3.968PheLeu: 3.968 ± 0.882
0.567PheMet: 0.567 ± 0.332
1.134PheAsn: 1.134 ± 1.22
1.701PhePro: 1.701 ± 1.201
0.0PheGln: 0.0 ± 0.0
1.134PheArg: 1.134 ± 1.22
1.134PheSer: 1.134 ± 0.664
1.134PheThr: 1.134 ± 0.581
3.401PheVal: 3.401 ± 0.904
1.134PheTrp: 1.134 ± 0.732
1.701PheTyr: 1.701 ± 0.502
0.0PheXaa: 0.0 ± 0.0
Gly
7.937GlyAla: 7.937 ± 0.567
2.268GlyCys: 2.268 ± 1.162
4.535GlyAsp: 4.535 ± 0.868
5.669GlyGlu: 5.669 ± 2.382
1.701GlyPhe: 1.701 ± 0.995
5.102GlyGly: 5.102 ± 1.883
2.268GlyHis: 2.268 ± 1.021
3.968GlyIle: 3.968 ± 2.697
5.669GlyLys: 5.669 ± 2.394
7.37GlyLeu: 7.37 ± 0.934
3.968GlyMet: 3.968 ± 1.191
3.968GlyAsn: 3.968 ± 0.998
6.803GlyPro: 6.803 ± 1.563
3.968GlyGln: 3.968 ± 0.8
9.07GlyArg: 9.07 ± 2.489
6.236GlySer: 6.236 ± 1.508
6.803GlyThr: 6.803 ± 1.967
3.401GlyVal: 3.401 ± 1.375
1.134GlyTrp: 1.134 ± 0.581
0.567GlyTyr: 0.567 ± 0.61
0.567GlyXaa: 0.567 ± 0.332
His
1.701HisAla: 1.701 ± 0.565
1.134HisCys: 1.134 ± 0.664
0.0HisAsp: 0.0 ± 0.0
1.134HisGlu: 1.134 ± 0.664
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.567HisHis: 0.567 ± 0.332
0.0HisIle: 0.0 ± 0.0
0.567HisLys: 0.567 ± 0.332
3.401HisLeu: 3.401 ± 1.355
0.567HisMet: 0.567 ± 0.332
1.134HisAsn: 1.134 ± 0.581
1.701HisPro: 1.701 ± 0.969
0.567HisGln: 0.567 ± 0.332
1.701HisArg: 1.701 ± 0.995
2.268HisSer: 2.268 ± 1.021
0.0HisThr: 0.0 ± 0.0
1.134HisVal: 1.134 ± 0.581
0.0HisTrp: 0.0 ± 0.0
0.567HisTyr: 0.567 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
3.968IleAla: 3.968 ± 1.48
0.0IleCys: 0.0 ± 0.0
1.701IleAsp: 1.701 ± 1.051
2.268IleGlu: 2.268 ± 1.021
0.567IlePhe: 0.567 ± 0.332
2.268IleGly: 2.268 ± 0.954
0.567IleHis: 0.567 ± 0.332
3.401IleIle: 3.401 ± 1.787
2.268IleLys: 2.268 ± 0.789
1.701IleLeu: 1.701 ± 0.565
0.567IleMet: 0.567 ± 0.61
0.567IleAsn: 0.567 ± 0.332
3.968IlePro: 3.968 ± 1.259
2.268IleGln: 2.268 ± 0.434
1.701IleArg: 1.701 ± 0.721
2.268IleSer: 2.268 ± 0.961
3.401IleThr: 3.401 ± 2.012
1.701IleVal: 1.701 ± 1.063
0.0IleTrp: 0.0 ± 0.0
1.134IleTyr: 1.134 ± 1.22
0.0IleXaa: 0.0 ± 0.0
Lys
4.535LysAla: 4.535 ± 0.481
1.701LysCys: 1.701 ± 0.565
1.134LysAsp: 1.134 ± 0.485
2.268LysGlu: 2.268 ± 1.01
1.134LysPhe: 1.134 ± 0.485
5.102LysGly: 5.102 ± 2.333
0.567LysHis: 0.567 ± 0.613
1.701LysIle: 1.701 ± 0.565
4.535LysLys: 4.535 ± 0.868
4.535LysLeu: 4.535 ± 1.836
1.701LysMet: 1.701 ± 0.969
1.701LysAsn: 1.701 ± 0.995
2.268LysPro: 2.268 ± 0.789
2.268LysGln: 2.268 ± 0.971
3.401LysArg: 3.401 ± 0.849
3.401LysSer: 3.401 ± 1.375
1.701LysThr: 1.701 ± 0.565
2.268LysVal: 2.268 ± 0.954
1.134LysTrp: 1.134 ± 0.581
1.134LysTyr: 1.134 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
5.102LeuAla: 5.102 ± 1.471
1.701LeuCys: 1.701 ± 1.403
1.701LeuAsp: 1.701 ± 0.502
7.937LeuGlu: 7.937 ± 3.144
2.834LeuPhe: 2.834 ± 0.899
11.338LeuGly: 11.338 ± 2.427
1.134LeuHis: 1.134 ± 0.664
2.834LeuIle: 2.834 ± 0.852
5.669LeuLys: 5.669 ± 1.888
10.771LeuLeu: 10.771 ± 5.259
2.268LeuMet: 2.268 ± 0.875
1.134LeuAsn: 1.134 ± 0.485
6.236LeuPro: 6.236 ± 2.43
5.102LeuGln: 5.102 ± 5.71
6.236LeuArg: 6.236 ± 2.595
6.803LeuSer: 6.803 ± 4.022
5.102LeuThr: 5.102 ± 2.316
2.834LeuVal: 2.834 ± 2.0
1.701LeuTrp: 1.701 ± 0.721
1.134LeuTyr: 1.134 ± 0.664
0.0LeuXaa: 0.0 ± 0.0
Met
1.134MetAla: 1.134 ± 0.664
0.0MetCys: 0.0 ± 0.0
0.567MetAsp: 0.567 ± 0.332
1.134MetGlu: 1.134 ± 1.028
1.701MetPhe: 1.701 ± 1.063
0.567MetGly: 0.567 ± 0.332
0.567MetHis: 0.567 ± 1.18
0.567MetIle: 0.567 ± 1.18
1.701MetLys: 1.701 ± 0.995
2.834MetLeu: 2.834 ± 1.169
2.268MetMet: 2.268 ± 1.507
1.134MetAsn: 1.134 ± 0.664
0.567MetPro: 0.567 ± 0.332
2.268MetGln: 2.268 ± 1.758
1.134MetArg: 1.134 ± 0.485
2.834MetSer: 2.834 ± 0.587
1.134MetThr: 1.134 ± 0.485
1.701MetVal: 1.701 ± 0.502
0.567MetTrp: 0.567 ± 0.332
0.567MetTyr: 0.567 ± 0.61
0.0MetXaa: 0.0 ± 0.0
Asn
1.701AsnAla: 1.701 ± 1.201
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
2.268AsnGlu: 2.268 ± 0.961
1.701AsnPhe: 1.701 ± 0.502
2.268AsnGly: 2.268 ± 0.954
0.0AsnHis: 0.0 ± 0.0
1.701AsnIle: 1.701 ± 1.063
1.701AsnLys: 1.701 ± 0.565
1.701AsnLeu: 1.701 ± 0.969
0.567AsnMet: 0.567 ± 0.61
1.134AsnAsn: 1.134 ± 0.485
0.0AsnPro: 0.0 ± 0.0
0.567AsnGln: 0.567 ± 0.61
3.968AsnArg: 3.968 ± 0.8
3.401AsnSer: 3.401 ± 1.629
2.834AsnThr: 2.834 ± 0.587
1.701AsnVal: 1.701 ± 1.051
0.567AsnTrp: 0.567 ± 0.332
1.134AsnTyr: 1.134 ± 0.732
0.0AsnXaa: 0.0 ± 0.0
Pro
5.102ProAla: 5.102 ± 0.366
1.134ProCys: 1.134 ± 0.664
4.535ProAsp: 4.535 ± 1.212
2.268ProGlu: 2.268 ± 0.961
3.401ProPhe: 3.401 ± 1.456
7.937ProGly: 7.937 ± 2.383
1.134ProHis: 1.134 ± 1.028
2.834ProIle: 2.834 ± 1.157
2.834ProLys: 2.834 ± 1.157
5.102ProLeu: 5.102 ± 2.044
0.567ProMet: 0.567 ± 0.473
0.567ProAsn: 0.567 ± 0.61
4.535ProPro: 4.535 ± 1.212
3.401ProGln: 3.401 ± 1.443
3.401ProArg: 3.401 ± 0.849
3.968ProSer: 3.968 ± 0.998
5.669ProThr: 5.669 ± 2.549
5.669ProVal: 5.669 ± 0.507
1.134ProTrp: 1.134 ± 0.664
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.701GlnAla: 1.701 ± 0.721
0.567GlnCys: 0.567 ± 0.332
2.268GlnAsp: 2.268 ± 0.434
3.968GlnGlu: 3.968 ± 3.549
1.134GlnPhe: 1.134 ± 0.485
3.968GlnGly: 3.968 ± 2.202
2.268GlnHis: 2.268 ± 1.327
2.268GlnIle: 2.268 ± 0.434
1.134GlnLys: 1.134 ± 0.485
2.268GlnLeu: 2.268 ± 1.053
0.567GlnMet: 0.567 ± 1.18
1.134GlnAsn: 1.134 ± 1.22
3.401GlnPro: 3.401 ± 0.904
4.535GlnGln: 4.535 ± 1.335
2.268GlnArg: 2.268 ± 1.053
2.834GlnSer: 2.834 ± 1.0
2.268GlnThr: 2.268 ± 2.193
2.268GlnVal: 2.268 ± 0.434
0.0GlnTrp: 0.0 ± 0.0
2.268GlnTyr: 2.268 ± 0.954
0.0GlnXaa: 0.0 ± 0.0
Arg
8.503ArgAla: 8.503 ± 3.209
0.567ArgCys: 0.567 ± 0.332
3.968ArgAsp: 3.968 ± 1.022
6.236ArgGlu: 6.236 ± 2.986
1.701ArgPhe: 1.701 ± 1.838
7.37ArgGly: 7.37 ± 1.494
1.701ArgHis: 1.701 ± 0.721
2.834ArgIle: 2.834 ± 0.587
4.535ArgLys: 4.535 ± 0.995
5.102ArgLeu: 5.102 ± 2.907
1.134ArgMet: 1.134 ± 0.935
1.701ArgAsn: 1.701 ± 0.969
3.968ArgPro: 3.968 ± 1.466
3.401ArgGln: 3.401 ± 1.114
9.637ArgArg: 9.637 ± 1.023
6.803ArgSer: 6.803 ± 3.129
4.535ArgThr: 4.535 ± 2.01
4.535ArgVal: 4.535 ± 2.01
1.134ArgTrp: 1.134 ± 0.664
2.834ArgTyr: 2.834 ± 0.587
0.0ArgXaa: 0.0 ± 0.0
Ser
2.834SerAla: 2.834 ± 1.425
0.0SerCys: 0.0 ± 0.0
1.701SerAsp: 1.701 ± 0.969
0.567SerGlu: 0.567 ± 0.332
2.834SerPhe: 2.834 ± 0.775
8.503SerGly: 8.503 ± 2.579
1.134SerHis: 1.134 ± 0.485
3.401SerIle: 3.401 ± 1.114
1.701SerLys: 1.701 ± 0.565
9.637SerLeu: 9.637 ± 7.168
0.567SerMet: 0.567 ± 0.61
2.268SerAsn: 2.268 ± 1.649
5.102SerPro: 5.102 ± 1.942
2.268SerGln: 2.268 ± 2.279
11.338SerArg: 11.338 ± 3.687
9.07SerSer: 9.07 ± 2.264
6.803SerThr: 6.803 ± 1.344
6.236SerVal: 6.236 ± 2.787
1.701SerTrp: 1.701 ± 1.147
1.134SerTyr: 1.134 ± 0.485
0.0SerXaa: 0.0 ± 0.0
Thr
5.669ThrAla: 5.669 ± 1.174
0.567ThrCys: 0.567 ± 0.613
1.701ThrAsp: 1.701 ± 1.051
4.535ThrGlu: 4.535 ± 1.753
2.268ThrPhe: 2.268 ± 1.01
5.102ThrGly: 5.102 ± 1.185
0.567ThrHis: 0.567 ± 0.332
0.567ThrIle: 0.567 ± 0.332
1.701ThrLys: 1.701 ± 0.995
3.401ThrLeu: 3.401 ± 1.198
2.268ThrMet: 2.268 ± 1.24
3.401ThrAsn: 3.401 ± 1.629
6.236ThrPro: 6.236 ± 2.759
1.134ThrGln: 1.134 ± 1.028
6.236ThrArg: 6.236 ± 2.05
6.236ThrSer: 6.236 ± 2.583
6.236ThrThr: 6.236 ± 2.958
2.834ThrVal: 2.834 ± 0.775
1.134ThrTrp: 1.134 ± 0.732
1.701ThrTyr: 1.701 ± 0.565
0.0ThrXaa: 0.0 ± 0.0
Val
3.968ValAla: 3.968 ± 0.8
2.268ValCys: 2.268 ± 0.434
3.401ValAsp: 3.401 ± 1.375
3.968ValGlu: 3.968 ± 1.022
0.567ValPhe: 0.567 ± 0.613
3.968ValGly: 3.968 ± 0.8
0.567ValHis: 0.567 ± 0.332
2.268ValIle: 2.268 ± 1.053
1.134ValLys: 1.134 ± 0.664
6.236ValLeu: 6.236 ± 1.664
2.268ValMet: 2.268 ± 1.021
2.268ValAsn: 2.268 ± 0.971
4.535ValPro: 4.535 ± 1.699
1.701ValGln: 1.701 ± 0.565
2.834ValArg: 2.834 ± 1.032
3.401ValSer: 3.401 ± 0.974
3.401ValThr: 3.401 ± 1.325
5.669ValVal: 5.669 ± 1.651
1.134ValTrp: 1.134 ± 1.22
1.701ValTyr: 1.701 ± 0.721
0.0ValXaa: 0.0 ± 0.0
Trp
2.834TrpAla: 2.834 ± 0.899
0.0TrpCys: 0.0 ± 0.0
0.567TrpAsp: 0.567 ± 0.613
2.268TrpGlu: 2.268 ± 0.434
0.0TrpPhe: 0.0 ± 0.0
1.134TrpGly: 1.134 ± 0.664
0.567TrpHis: 0.567 ± 0.613
0.0TrpIle: 0.0 ± 0.0
1.701TrpLys: 1.701 ± 1.205
0.567TrpLeu: 0.567 ± 0.332
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.134TrpGln: 1.134 ± 1.225
1.701TrpArg: 1.701 ± 1.147
1.701TrpSer: 1.701 ± 0.565
1.134TrpThr: 1.134 ± 0.664
1.134TrpVal: 1.134 ± 0.581
0.0TrpTrp: 0.0 ± 0.0
0.567TrpTyr: 0.567 ± 0.61
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.535TyrAla: 4.535 ± 3.298
1.134TyrCys: 1.134 ± 0.485
0.567TyrAsp: 0.567 ± 0.61
1.701TyrGlu: 1.701 ± 0.995
0.0TyrPhe: 0.0 ± 0.0
1.701TyrGly: 1.701 ± 0.721
0.567TyrHis: 0.567 ± 0.613
1.134TyrIle: 1.134 ± 0.485
1.134TyrLys: 1.134 ± 0.664
2.834TyrLeu: 2.834 ± 1.243
0.0TyrMet: 0.0 ± 0.0
0.567TyrAsn: 0.567 ± 0.613
0.567TyrPro: 0.567 ± 0.61
0.567TyrGln: 0.567 ± 0.61
1.701TyrArg: 1.701 ± 1.051
1.134TyrSer: 1.134 ± 0.485
0.567TyrThr: 0.567 ± 0.61
1.701TyrVal: 1.701 ± 0.721
1.701TyrTrp: 1.701 ± 1.205
0.567TyrTyr: 0.567 ± 0.61
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.567XaaAsp: 0.567 ± 0.332
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1765 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski