Amino acid dipepetide frequency for Streptococcus phage Javan224

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.866AlaAla: 3.866 ± 1.27
0.42AlaCys: 0.42 ± 0.185
4.034AlaAsp: 4.034 ± 0.625
5.546AlaGlu: 5.546 ± 0.91
3.613AlaPhe: 3.613 ± 0.928
4.286AlaGly: 4.286 ± 0.804
0.672AlaHis: 0.672 ± 0.217
6.303AlaIle: 6.303 ± 1.146
6.639AlaLys: 6.639 ± 0.719
7.311AlaLeu: 7.311 ± 0.99
2.773AlaMet: 2.773 ± 0.64
4.118AlaAsn: 4.118 ± 0.718
2.269AlaPro: 2.269 ± 0.485
3.782AlaGln: 3.782 ± 0.726
2.689AlaArg: 2.689 ± 0.506
4.118AlaSer: 4.118 ± 1.325
4.622AlaThr: 4.622 ± 0.616
3.697AlaVal: 3.697 ± 0.841
0.84AlaTrp: 0.84 ± 0.287
2.773AlaTyr: 2.773 ± 0.625
0.0AlaXaa: 0.0 ± 0.0
Cys
0.084CysAla: 0.084 ± 0.078
0.168CysCys: 0.168 ± 0.132
0.42CysAsp: 0.42 ± 0.169
0.42CysGlu: 0.42 ± 0.213
0.084CysPhe: 0.084 ± 0.09
0.672CysGly: 0.672 ± 0.27
0.168CysHis: 0.168 ± 0.116
0.252CysIle: 0.252 ± 0.167
0.336CysLys: 0.336 ± 0.188
0.336CysLeu: 0.336 ± 0.166
0.084CysMet: 0.084 ± 0.083
0.252CysAsn: 0.252 ± 0.188
0.0CysPro: 0.0 ± 0.0
0.084CysGln: 0.084 ± 0.091
0.0CysArg: 0.0 ± 0.0
0.252CysSer: 0.252 ± 0.145
0.252CysThr: 0.252 ± 0.154
0.336CysVal: 0.336 ± 0.166
0.084CysTrp: 0.084 ± 0.09
0.42CysTyr: 0.42 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
4.37AspAla: 4.37 ± 0.561
0.336AspCys: 0.336 ± 0.187
5.294AspAsp: 5.294 ± 0.94
5.546AspGlu: 5.546 ± 1.158
2.773AspPhe: 2.773 ± 0.483
5.294AspGly: 5.294 ± 0.743
0.168AspHis: 0.168 ± 0.116
4.286AspIle: 4.286 ± 0.689
4.79AspLys: 4.79 ± 0.738
5.21AspLeu: 5.21 ± 0.71
1.765AspMet: 1.765 ± 0.427
5.21AspAsn: 5.21 ± 0.677
0.756AspPro: 0.756 ± 0.211
0.756AspGln: 0.756 ± 0.247
2.269AspArg: 2.269 ± 0.443
4.286AspSer: 4.286 ± 0.672
4.538AspThr: 4.538 ± 0.718
3.025AspVal: 3.025 ± 0.609
1.092AspTrp: 1.092 ± 0.32
3.613AspTyr: 3.613 ± 0.642
0.0AspXaa: 0.0 ± 0.0
Glu
3.782GluAla: 3.782 ± 0.73
0.42GluCys: 0.42 ± 0.244
3.866GluAsp: 3.866 ± 0.727
6.05GluGlu: 6.05 ± 1.078
3.193GluPhe: 3.193 ± 0.594
2.521GluGly: 2.521 ± 0.304
1.261GluHis: 1.261 ± 0.286
5.798GluIle: 5.798 ± 0.928
6.387GluLys: 6.387 ± 0.938
6.639GluLeu: 6.639 ± 1.138
2.017GluMet: 2.017 ± 0.369
3.782GluAsn: 3.782 ± 0.643
1.597GluPro: 1.597 ± 0.363
3.95GluGln: 3.95 ± 0.763
3.109GluArg: 3.109 ± 0.717
2.773GluSer: 2.773 ± 0.493
4.706GluThr: 4.706 ± 0.836
3.866GluVal: 3.866 ± 0.719
1.176GluTrp: 1.176 ± 0.368
3.697GluTyr: 3.697 ± 0.535
0.0GluXaa: 0.0 ± 0.0
Phe
2.689PheAla: 2.689 ± 0.571
0.084PheCys: 0.084 ± 0.086
3.613PheAsp: 3.613 ± 0.621
2.857PheGlu: 2.857 ± 0.538
1.261PhePhe: 1.261 ± 0.304
3.193PheGly: 3.193 ± 0.508
0.336PheHis: 0.336 ± 0.176
2.269PheIle: 2.269 ± 0.375
4.034PheLys: 4.034 ± 0.625
2.353PheLeu: 2.353 ± 0.428
1.008PheMet: 1.008 ± 0.268
2.941PheAsn: 2.941 ± 0.435
1.008PhePro: 1.008 ± 0.347
1.513PheGln: 1.513 ± 0.399
0.672PheArg: 0.672 ± 0.201
3.445PheSer: 3.445 ± 0.493
2.521PheThr: 2.521 ± 0.388
1.933PheVal: 1.933 ± 0.401
0.504PheTrp: 0.504 ± 0.21
1.261PheTyr: 1.261 ± 0.372
0.0PheXaa: 0.0 ± 0.0
Gly
4.538GlyAla: 4.538 ± 1.354
0.252GlyCys: 0.252 ± 0.139
3.529GlyAsp: 3.529 ± 0.719
3.025GlyGlu: 3.025 ± 0.385
2.437GlyPhe: 2.437 ± 0.415
3.866GlyGly: 3.866 ± 0.579
0.672GlyHis: 0.672 ± 0.197
4.286GlyIle: 4.286 ± 0.765
4.874GlyLys: 4.874 ± 0.54
6.471GlyLeu: 6.471 ± 0.849
1.765GlyMet: 1.765 ± 0.372
3.782GlyAsn: 3.782 ± 0.585
0.168GlyPro: 0.168 ± 0.144
1.849GlyGln: 1.849 ± 0.428
2.101GlyArg: 2.101 ± 0.375
4.286GlySer: 4.286 ± 0.642
5.21GlyThr: 5.21 ± 1.037
5.462GlyVal: 5.462 ± 1.136
0.42GlyTrp: 0.42 ± 0.188
2.605GlyTyr: 2.605 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
0.588HisAla: 0.588 ± 0.184
0.168HisCys: 0.168 ± 0.097
0.672HisAsp: 0.672 ± 0.284
1.176HisGlu: 1.176 ± 0.349
0.588HisPhe: 0.588 ± 0.27
0.588HisGly: 0.588 ± 0.193
0.252HisHis: 0.252 ± 0.137
0.756HisIle: 0.756 ± 0.244
0.672HisLys: 0.672 ± 0.252
1.008HisLeu: 1.008 ± 0.3
0.084HisMet: 0.084 ± 0.078
0.42HisAsn: 0.42 ± 0.202
0.504HisPro: 0.504 ± 0.284
0.42HisGln: 0.42 ± 0.178
0.588HisArg: 0.588 ± 0.226
0.672HisSer: 0.672 ± 0.228
0.672HisThr: 0.672 ± 0.213
0.588HisVal: 0.588 ± 0.226
0.168HisTrp: 0.168 ± 0.1
0.756HisTyr: 0.756 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
5.546IleAla: 5.546 ± 0.764
0.168IleCys: 0.168 ± 0.121
5.126IleAsp: 5.126 ± 0.908
4.958IleGlu: 4.958 ± 0.883
2.605IlePhe: 2.605 ± 0.374
4.538IleGly: 4.538 ± 0.787
0.756IleHis: 0.756 ± 0.227
4.118IleIle: 4.118 ± 0.697
6.303IleLys: 6.303 ± 0.672
4.118IleLeu: 4.118 ± 0.605
1.513IleMet: 1.513 ± 0.374
4.874IleAsn: 4.874 ± 0.874
1.429IlePro: 1.429 ± 0.349
4.034IleGln: 4.034 ± 0.512
2.521IleArg: 2.521 ± 0.526
5.042IleSer: 5.042 ± 1.188
5.21IleThr: 5.21 ± 1.104
4.034IleVal: 4.034 ± 0.655
0.504IleTrp: 0.504 ± 0.197
2.605IleTyr: 2.605 ± 0.542
0.0IleXaa: 0.0 ± 0.0
Lys
6.975LysAla: 6.975 ± 0.757
0.672LysCys: 0.672 ± 0.274
4.286LysAsp: 4.286 ± 0.743
5.378LysGlu: 5.378 ± 1.109
2.353LysPhe: 2.353 ± 0.448
4.454LysGly: 4.454 ± 0.696
1.513LysHis: 1.513 ± 0.402
5.714LysIle: 5.714 ± 0.858
5.966LysLys: 5.966 ± 1.036
6.555LysLeu: 6.555 ± 0.979
2.773LysMet: 2.773 ± 0.604
5.714LysAsn: 5.714 ± 0.751
2.689LysPro: 2.689 ± 0.691
4.202LysGln: 4.202 ± 0.864
5.378LysArg: 5.378 ± 0.923
5.462LysSer: 5.462 ± 0.518
4.622LysThr: 4.622 ± 0.676
4.79LysVal: 4.79 ± 0.621
0.84LysTrp: 0.84 ± 0.28
2.353LysTyr: 2.353 ± 0.518
0.0LysXaa: 0.0 ± 0.0
Leu
6.303LeuAla: 6.303 ± 0.628
0.42LeuCys: 0.42 ± 0.18
6.891LeuAsp: 6.891 ± 1.055
6.555LeuGlu: 6.555 ± 1.043
2.773LeuPhe: 2.773 ± 0.5
4.874LeuGly: 4.874 ± 1.179
0.84LeuHis: 0.84 ± 0.274
5.21LeuIle: 5.21 ± 0.542
7.395LeuLys: 7.395 ± 0.906
3.95LeuLeu: 3.95 ± 0.546
1.345LeuMet: 1.345 ± 0.434
5.546LeuAsn: 5.546 ± 0.577
3.025LeuPro: 3.025 ± 0.474
3.782LeuGln: 3.782 ± 0.605
2.353LeuArg: 2.353 ± 0.58
6.303LeuSer: 6.303 ± 0.672
5.378LeuThr: 5.378 ± 0.667
3.782LeuVal: 3.782 ± 0.528
0.924LeuTrp: 0.924 ± 0.318
2.269LeuTyr: 2.269 ± 0.5
0.0LeuXaa: 0.0 ± 0.0
Met
1.513MetAla: 1.513 ± 0.353
0.0MetCys: 0.0 ± 0.0
1.176MetAsp: 1.176 ± 0.272
1.429MetGlu: 1.429 ± 0.391
0.588MetPhe: 0.588 ± 0.182
1.429MetGly: 1.429 ± 0.456
0.252MetHis: 0.252 ± 0.141
1.765MetIle: 1.765 ± 0.457
2.773MetLys: 2.773 ± 0.482
1.597MetLeu: 1.597 ± 0.38
1.008MetMet: 1.008 ± 0.291
0.924MetAsn: 0.924 ± 0.276
0.588MetPro: 0.588 ± 0.199
1.933MetGln: 1.933 ± 0.401
1.261MetArg: 1.261 ± 0.335
1.933MetSer: 1.933 ± 0.472
2.857MetThr: 2.857 ± 0.531
0.924MetVal: 0.924 ± 0.227
0.588MetTrp: 0.588 ± 0.179
0.924MetTyr: 0.924 ± 0.308
0.0MetXaa: 0.0 ± 0.0
Asn
5.294AsnAla: 5.294 ± 0.762
0.084AsnCys: 0.084 ± 0.09
3.445AsnAsp: 3.445 ± 0.578
3.95AsnGlu: 3.95 ± 0.602
2.017AsnPhe: 2.017 ± 0.337
4.538AsnGly: 4.538 ± 1.01
0.84AsnHis: 0.84 ± 0.323
2.521AsnIle: 2.521 ± 0.41
5.294AsnLys: 5.294 ± 0.869
5.63AsnLeu: 5.63 ± 0.67
1.092AsnMet: 1.092 ± 0.477
2.689AsnAsn: 2.689 ± 0.454
2.353AsnPro: 2.353 ± 0.514
2.185AsnGln: 2.185 ± 0.396
2.689AsnArg: 2.689 ± 0.512
3.613AsnSer: 3.613 ± 0.776
3.782AsnThr: 3.782 ± 0.612
5.21AsnVal: 5.21 ± 0.61
0.42AsnTrp: 0.42 ± 0.206
1.849AsnTyr: 1.849 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
1.429ProAla: 1.429 ± 0.346
0.084ProCys: 0.084 ± 0.083
2.101ProAsp: 2.101 ± 0.435
2.185ProGlu: 2.185 ± 0.48
1.513ProPhe: 1.513 ± 0.278
1.008ProGly: 1.008 ± 0.346
0.252ProHis: 0.252 ± 0.137
1.933ProIle: 1.933 ± 0.47
2.605ProLys: 2.605 ± 0.579
2.269ProLeu: 2.269 ± 0.483
0.756ProMet: 0.756 ± 0.242
1.513ProAsn: 1.513 ± 0.429
0.588ProPro: 0.588 ± 0.204
1.092ProGln: 1.092 ± 0.397
0.504ProArg: 0.504 ± 0.242
1.597ProSer: 1.597 ± 0.356
1.092ProThr: 1.092 ± 0.312
2.185ProVal: 2.185 ± 0.415
0.084ProTrp: 0.084 ± 0.078
1.092ProTyr: 1.092 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
3.95GlnAla: 3.95 ± 0.662
0.252GlnCys: 0.252 ± 0.132
2.437GlnAsp: 2.437 ± 0.552
3.109GlnGlu: 3.109 ± 0.45
1.261GlnPhe: 1.261 ± 0.304
2.857GlnGly: 2.857 ± 0.632
0.42GlnHis: 0.42 ± 0.18
4.118GlnIle: 4.118 ± 0.876
3.529GlnLys: 3.529 ± 0.646
3.782GlnLeu: 3.782 ± 0.625
1.345GlnMet: 1.345 ± 0.411
2.437GlnAsn: 2.437 ± 0.743
0.672GlnPro: 0.672 ± 0.302
2.269GlnGln: 2.269 ± 0.489
1.849GlnArg: 1.849 ± 0.422
3.697GlnSer: 3.697 ± 0.483
2.689GlnThr: 2.689 ± 0.574
2.185GlnVal: 2.185 ± 0.708
0.252GlnTrp: 0.252 ± 0.127
1.933GlnTyr: 1.933 ± 0.449
0.0GlnXaa: 0.0 ± 0.0
Arg
3.361ArgAla: 3.361 ± 0.632
0.084ArgCys: 0.084 ± 0.082
1.849ArgAsp: 1.849 ± 0.383
2.353ArgGlu: 2.353 ± 0.45
1.597ArgPhe: 1.597 ± 0.43
1.513ArgGly: 1.513 ± 0.327
0.504ArgHis: 0.504 ± 0.235
2.437ArgIle: 2.437 ± 0.371
2.605ArgLys: 2.605 ± 0.555
3.613ArgLeu: 3.613 ± 0.564
1.261ArgMet: 1.261 ± 0.26
1.765ArgAsn: 1.765 ± 0.411
1.092ArgPro: 1.092 ± 0.283
1.849ArgGln: 1.849 ± 0.462
1.513ArgArg: 1.513 ± 0.422
1.765ArgSer: 1.765 ± 0.352
2.437ArgThr: 2.437 ± 0.413
3.277ArgVal: 3.277 ± 0.525
0.336ArgTrp: 0.336 ± 0.185
1.849ArgTyr: 1.849 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
5.546SerAla: 5.546 ± 1.485
0.168SerCys: 0.168 ± 0.117
5.378SerAsp: 5.378 ± 0.683
4.874SerGlu: 4.874 ± 0.78
3.025SerPhe: 3.025 ± 0.691
4.958SerGly: 4.958 ± 0.88
0.588SerHis: 0.588 ± 0.197
4.706SerIle: 4.706 ± 0.693
4.286SerLys: 4.286 ± 0.654
5.042SerLeu: 5.042 ± 0.804
1.597SerMet: 1.597 ± 0.47
3.95SerAsn: 3.95 ± 0.655
1.765SerPro: 1.765 ± 0.406
2.269SerGln: 2.269 ± 0.508
2.101SerArg: 2.101 ± 0.332
4.118SerSer: 4.118 ± 1.06
3.445SerThr: 3.445 ± 0.72
4.454SerVal: 4.454 ± 0.642
0.756SerTrp: 0.756 ± 0.214
2.689SerTyr: 2.689 ± 0.477
0.0SerXaa: 0.0 ± 0.0
Thr
5.966ThrAla: 5.966 ± 1.182
0.168ThrCys: 0.168 ± 0.115
3.866ThrAsp: 3.866 ± 0.795
4.034ThrGlu: 4.034 ± 0.864
3.361ThrPhe: 3.361 ± 0.786
3.866ThrGly: 3.866 ± 0.589
0.252ThrHis: 0.252 ± 0.138
5.462ThrIle: 5.462 ± 0.647
5.126ThrLys: 5.126 ± 0.993
5.042ThrLeu: 5.042 ± 0.809
0.756ThrMet: 0.756 ± 0.251
3.277ThrAsn: 3.277 ± 0.677
2.689ThrPro: 2.689 ± 0.569
3.445ThrGln: 3.445 ± 0.593
1.765ThrArg: 1.765 ± 0.318
4.37ThrSer: 4.37 ± 0.67
4.286ThrThr: 4.286 ± 0.906
5.462ThrVal: 5.462 ± 0.877
0.672ThrTrp: 0.672 ± 0.286
3.193ThrTyr: 3.193 ± 0.662
0.0ThrXaa: 0.0 ± 0.0
Val
4.874ValAla: 4.874 ± 1.085
0.0ValCys: 0.0 ± 0.0
4.454ValAsp: 4.454 ± 0.531
3.445ValGlu: 3.445 ± 0.552
2.689ValPhe: 2.689 ± 0.436
4.118ValGly: 4.118 ± 1.18
0.588ValHis: 0.588 ± 0.246
3.445ValIle: 3.445 ± 0.582
5.462ValLys: 5.462 ± 0.545
4.79ValLeu: 4.79 ± 0.632
1.176ValMet: 1.176 ± 0.342
3.361ValAsn: 3.361 ± 0.658
1.345ValPro: 1.345 ± 0.351
3.025ValGln: 3.025 ± 0.498
1.765ValArg: 1.765 ± 0.307
4.958ValSer: 4.958 ± 0.678
4.874ValThr: 4.874 ± 0.69
4.538ValVal: 4.538 ± 0.603
0.504ValTrp: 0.504 ± 0.227
2.605ValTyr: 2.605 ± 0.617
0.0ValXaa: 0.0 ± 0.0
Trp
0.588TrpAla: 0.588 ± 0.206
0.168TrpCys: 0.168 ± 0.108
0.336TrpAsp: 0.336 ± 0.155
0.672TrpGlu: 0.672 ± 0.254
0.588TrpPhe: 0.588 ± 0.247
0.588TrpGly: 0.588 ± 0.219
0.252TrpHis: 0.252 ± 0.147
0.84TrpIle: 0.84 ± 0.26
0.84TrpLys: 0.84 ± 0.322
0.504TrpLeu: 0.504 ± 0.223
0.252TrpMet: 0.252 ± 0.153
0.84TrpAsn: 0.84 ± 0.316
0.0TrpPro: 0.0 ± 0.0
0.336TrpGln: 0.336 ± 0.201
0.84TrpArg: 0.84 ± 0.243
0.84TrpSer: 0.84 ± 0.2
0.756TrpThr: 0.756 ± 0.252
0.588TrpVal: 0.588 ± 0.242
0.084TrpTrp: 0.084 ± 0.088
0.672TrpTyr: 0.672 ± 0.254
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.109TyrAla: 3.109 ± 0.616
0.588TyrCys: 0.588 ± 0.233
2.605TyrAsp: 2.605 ± 0.669
3.109TyrGlu: 3.109 ± 0.617
1.176TyrPhe: 1.176 ± 0.361
2.437TyrGly: 2.437 ± 0.519
0.756TyrHis: 0.756 ± 0.268
3.782TyrIle: 3.782 ± 0.674
2.773TyrLys: 2.773 ± 0.517
3.697TyrLeu: 3.697 ± 0.749
1.008TyrMet: 1.008 ± 0.27
2.101TyrAsn: 2.101 ± 0.499
1.345TyrPro: 1.345 ± 0.384
2.269TyrGln: 2.269 ± 0.536
1.092TyrArg: 1.092 ± 0.399
2.185TyrSer: 2.185 ± 0.476
3.109TyrThr: 3.109 ± 0.699
1.681TyrVal: 1.681 ± 0.395
0.336TyrTrp: 0.336 ± 0.159
2.437TyrTyr: 2.437 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski