Amino acid dipepetide frequency for Streptococcus satellite phage Javan473

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.845AlaAla: 0.845 ± 0.443
1.409AlaCys: 1.409 ± 0.696
2.254AlaAsp: 2.254 ± 0.826
4.227AlaGlu: 4.227 ± 0.958
4.227AlaPhe: 4.227 ± 0.95
2.254AlaGly: 2.254 ± 0.691
0.282AlaHis: 0.282 ± 0.232
6.199AlaIle: 6.199 ± 1.327
4.508AlaLys: 4.508 ± 1.132
5.072AlaLeu: 5.072 ± 1.176
1.691AlaMet: 1.691 ± 0.585
3.381AlaAsn: 3.381 ± 0.802
1.127AlaPro: 1.127 ± 0.56
2.254AlaGln: 2.254 ± 0.599
2.254AlaArg: 2.254 ± 0.727
4.508AlaSer: 4.508 ± 1.654
3.945AlaThr: 3.945 ± 0.697
3.381AlaVal: 3.381 ± 0.849
1.127AlaTrp: 1.127 ± 0.646
1.691AlaTyr: 1.691 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
0.845CysAla: 0.845 ± 0.416
0.282CysCys: 0.282 ± 0.261
0.845CysAsp: 0.845 ± 0.421
0.282CysGlu: 0.282 ± 0.261
0.0CysPhe: 0.0 ± 0.0
0.564CysGly: 0.564 ± 0.378
0.282CysHis: 0.282 ± 0.323
0.0CysIle: 0.0 ± 0.0
0.845CysLys: 0.845 ± 0.371
1.127CysLeu: 1.127 ± 0.517
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.564CysPro: 0.564 ± 0.45
0.564CysGln: 0.564 ± 0.297
0.564CysArg: 0.564 ± 0.386
0.564CysSer: 0.564 ± 0.372
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.845CysTyr: 0.845 ± 0.406
0.0CysXaa: 0.0 ± 0.0
Asp
2.254AspAla: 2.254 ± 0.568
1.409AspCys: 1.409 ± 0.681
3.099AspAsp: 3.099 ± 0.8
3.945AspGlu: 3.945 ± 1.261
1.972AspPhe: 1.972 ± 0.526
2.254AspGly: 2.254 ± 0.671
1.127AspHis: 1.127 ± 0.714
8.453AspIle: 8.453 ± 1.01
5.635AspLys: 5.635 ± 1.336
6.199AspLeu: 6.199 ± 0.902
1.409AspMet: 1.409 ± 0.781
1.409AspAsn: 1.409 ± 0.584
0.564AspPro: 0.564 ± 0.366
1.691AspGln: 1.691 ± 0.623
1.972AspArg: 1.972 ± 0.644
3.099AspSer: 3.099 ± 1.098
4.227AspThr: 4.227 ± 1.178
0.845AspVal: 0.845 ± 0.421
0.564AspTrp: 0.564 ± 0.352
3.663AspTyr: 3.663 ± 1.044
0.0AspXaa: 0.0 ± 0.0
Glu
6.762GluAla: 6.762 ± 1.032
1.127GluCys: 1.127 ± 0.598
5.354GluAsp: 5.354 ± 1.536
7.044GluGlu: 7.044 ± 1.746
1.972GluPhe: 1.972 ± 1.0
2.818GluGly: 2.818 ± 0.648
1.409GluHis: 1.409 ± 0.679
5.917GluIle: 5.917 ± 1.14
5.917GluLys: 5.917 ± 0.642
10.425GluLeu: 10.425 ± 1.545
2.254GluMet: 2.254 ± 0.688
3.099GluAsn: 3.099 ± 1.169
1.127GluPro: 1.127 ± 0.56
4.79GluGln: 4.79 ± 1.689
3.663GluArg: 3.663 ± 1.133
1.691GluSer: 1.691 ± 0.67
3.663GluThr: 3.663 ± 0.902
2.818GluVal: 2.818 ± 0.86
1.409GluTrp: 1.409 ± 0.711
4.79GluTyr: 4.79 ± 1.176
0.0GluXaa: 0.0 ± 0.0
Phe
1.691PheAla: 1.691 ± 0.598
0.0PheCys: 0.0 ± 0.0
2.536PheAsp: 2.536 ± 0.669
3.099PheGlu: 3.099 ± 0.72
1.409PhePhe: 1.409 ± 0.553
1.409PheGly: 1.409 ± 0.526
1.691PheHis: 1.691 ± 0.505
1.972PheIle: 1.972 ± 0.539
4.79PheLys: 4.79 ± 0.879
3.381PheLeu: 3.381 ± 0.905
0.0PheMet: 0.0 ± 0.0
2.254PheAsn: 2.254 ± 0.668
1.127PhePro: 1.127 ± 0.548
1.409PheGln: 1.409 ± 0.595
1.691PheArg: 1.691 ± 0.596
1.972PheSer: 1.972 ± 0.581
3.381PheThr: 3.381 ± 0.685
1.972PheVal: 1.972 ± 0.573
0.282PheTrp: 0.282 ± 0.232
1.127PheTyr: 1.127 ± 0.529
0.0PheXaa: 0.0 ± 0.0
Gly
2.254GlyAla: 2.254 ± 1.088
0.0GlyCys: 0.0 ± 0.0
4.227GlyAsp: 4.227 ± 1.051
2.818GlyGlu: 2.818 ± 0.893
1.972GlyPhe: 1.972 ± 0.757
3.099GlyGly: 3.099 ± 1.071
0.845GlyHis: 0.845 ± 0.585
1.409GlyIle: 1.409 ± 0.747
3.945GlyLys: 3.945 ± 0.921
6.762GlyLeu: 6.762 ± 1.731
1.409GlyMet: 1.409 ± 0.421
3.099GlyAsn: 3.099 ± 0.985
0.282GlyPro: 0.282 ± 0.269
3.099GlyGln: 3.099 ± 1.261
2.536GlyArg: 2.536 ± 0.863
1.972GlySer: 1.972 ± 0.813
2.818GlyThr: 2.818 ± 0.684
3.945GlyVal: 3.945 ± 0.888
0.282GlyTrp: 0.282 ± 0.232
3.381GlyTyr: 3.381 ± 0.743
0.0GlyXaa: 0.0 ± 0.0
His
2.536HisAla: 2.536 ± 1.274
0.0HisCys: 0.0 ± 0.0
0.564HisAsp: 0.564 ± 0.365
0.282HisGlu: 0.282 ± 0.232
0.0HisPhe: 0.0 ± 0.0
0.845HisGly: 0.845 ± 0.398
0.564HisHis: 0.564 ± 0.331
1.409HisIle: 1.409 ± 0.536
1.972HisLys: 1.972 ± 0.791
2.536HisLeu: 2.536 ± 0.873
0.282HisMet: 0.282 ± 0.265
1.409HisAsn: 1.409 ± 0.746
0.564HisPro: 0.564 ± 0.366
0.845HisGln: 0.845 ± 0.436
0.564HisArg: 0.564 ± 0.406
0.282HisSer: 0.282 ± 0.316
1.127HisThr: 1.127 ± 0.453
0.845HisVal: 0.845 ± 0.42
0.564HisTrp: 0.564 ± 0.297
1.691HisTyr: 1.691 ± 0.652
0.0HisXaa: 0.0 ± 0.0
Ile
5.917IleAla: 5.917 ± 1.232
0.282IleCys: 0.282 ± 0.323
5.917IleAsp: 5.917 ± 1.176
5.635IleGlu: 5.635 ± 1.105
2.254IlePhe: 2.254 ± 0.755
2.254IleGly: 2.254 ± 0.593
1.127IleHis: 1.127 ± 0.832
4.508IleIle: 4.508 ± 1.124
9.58IleLys: 9.58 ± 1.393
3.663IleLeu: 3.663 ± 1.0
1.972IleMet: 1.972 ± 0.554
4.227IleAsn: 4.227 ± 0.819
2.536IlePro: 2.536 ± 0.814
2.818IleGln: 2.818 ± 0.818
3.663IleArg: 3.663 ± 0.995
4.227IleSer: 4.227 ± 1.127
4.79IleThr: 4.79 ± 1.218
1.691IleVal: 1.691 ± 0.657
0.0IleTrp: 0.0 ± 0.0
1.691IleTyr: 1.691 ± 0.728
0.0IleXaa: 0.0 ± 0.0
Lys
7.044LysAla: 7.044 ± 1.271
0.282LysCys: 0.282 ± 0.249
5.635LysAsp: 5.635 ± 1.269
9.862LysGlu: 9.862 ± 1.867
2.254LysPhe: 2.254 ± 0.652
5.354LysGly: 5.354 ± 1.414
2.818LysHis: 2.818 ± 0.69
5.072LysIle: 5.072 ± 1.083
8.171LysLys: 8.171 ± 1.817
6.199LysLeu: 6.199 ± 1.412
1.972LysMet: 1.972 ± 0.888
5.635LysAsn: 5.635 ± 1.359
5.917LysPro: 5.917 ± 1.127
4.227LysGln: 4.227 ± 1.192
4.79LysArg: 4.79 ± 0.962
4.79LysSer: 4.79 ± 1.218
6.481LysThr: 6.481 ± 1.446
5.354LysVal: 5.354 ± 1.077
0.282LysTrp: 0.282 ± 0.269
3.381LysTyr: 3.381 ± 1.093
0.0LysXaa: 0.0 ± 0.0
Leu
4.227LeuAla: 4.227 ± 1.039
0.282LeuCys: 0.282 ± 0.249
6.199LeuAsp: 6.199 ± 1.425
11.834LeuGlu: 11.834 ± 1.347
3.945LeuPhe: 3.945 ± 1.194
5.635LeuGly: 5.635 ± 1.322
1.972LeuHis: 1.972 ± 0.755
7.608LeuIle: 7.608 ± 1.417
10.425LeuLys: 10.425 ± 1.1
9.58LeuLeu: 9.58 ± 1.424
2.254LeuMet: 2.254 ± 0.877
5.354LeuAsn: 5.354 ± 1.574
4.227LeuPro: 4.227 ± 1.254
3.099LeuGln: 3.099 ± 0.639
3.099LeuArg: 3.099 ± 0.966
8.735LeuSer: 8.735 ± 2.032
4.508LeuThr: 4.508 ± 0.685
4.227LeuVal: 4.227 ± 1.116
0.282LeuTrp: 0.282 ± 0.272
5.072LeuTyr: 5.072 ± 1.184
0.0LeuXaa: 0.0 ± 0.0
Met
2.254MetAla: 2.254 ± 0.823
0.0MetCys: 0.0 ± 0.0
1.691MetAsp: 1.691 ± 0.532
1.127MetGlu: 1.127 ± 0.746
0.282MetPhe: 0.282 ± 0.287
0.282MetGly: 0.282 ± 0.253
0.0MetHis: 0.0 ± 0.0
1.409MetIle: 1.409 ± 0.626
2.818MetLys: 2.818 ± 0.75
2.536MetLeu: 2.536 ± 0.718
0.0MetMet: 0.0 ± 0.0
2.254MetAsn: 2.254 ± 0.629
0.282MetPro: 0.282 ± 0.272
0.282MetGln: 0.282 ± 0.314
1.691MetArg: 1.691 ± 0.51
1.409MetSer: 1.409 ± 0.659
3.381MetThr: 3.381 ± 0.984
0.564MetVal: 0.564 ± 0.384
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.663AsnAla: 3.663 ± 0.978
0.282AsnCys: 0.282 ± 0.232
1.409AsnAsp: 1.409 ± 0.506
3.099AsnGlu: 3.099 ± 0.92
0.845AsnPhe: 0.845 ± 0.494
5.072AsnGly: 5.072 ± 1.35
1.127AsnHis: 1.127 ± 0.515
2.536AsnIle: 2.536 ± 0.808
3.663AsnLys: 3.663 ± 0.904
5.635AsnLeu: 5.635 ± 1.224
1.127AsnMet: 1.127 ± 0.426
2.536AsnAsn: 2.536 ± 0.811
1.972AsnPro: 1.972 ± 0.669
2.818AsnGln: 2.818 ± 1.146
4.227AsnArg: 4.227 ± 1.048
2.536AsnSer: 2.536 ± 0.723
3.099AsnThr: 3.099 ± 0.786
2.254AsnVal: 2.254 ± 0.774
0.845AsnTrp: 0.845 ± 0.484
1.972AsnTyr: 1.972 ± 0.636
0.0AsnXaa: 0.0 ± 0.0
Pro
0.564ProAla: 0.564 ± 0.337
0.282ProCys: 0.282 ± 0.287
1.691ProAsp: 1.691 ± 0.629
3.945ProGlu: 3.945 ± 0.795
1.691ProPhe: 1.691 ± 0.702
0.564ProGly: 0.564 ± 0.467
0.0ProHis: 0.0 ± 0.0
1.409ProIle: 1.409 ± 0.598
4.508ProLys: 4.508 ± 0.982
3.099ProLeu: 3.099 ± 1.008
0.282ProMet: 0.282 ± 0.261
2.254ProAsn: 2.254 ± 0.661
0.564ProPro: 0.564 ± 0.373
1.127ProGln: 1.127 ± 0.634
1.972ProArg: 1.972 ± 0.596
1.127ProSer: 1.127 ± 0.478
3.099ProThr: 3.099 ± 0.581
1.691ProVal: 1.691 ± 0.694
0.0ProTrp: 0.0 ± 0.0
1.127ProTyr: 1.127 ± 0.572
0.0ProXaa: 0.0 ± 0.0
Gln
3.099GlnAla: 3.099 ± 0.667
0.282GlnCys: 0.282 ± 0.262
1.972GlnAsp: 1.972 ± 0.608
3.663GlnGlu: 3.663 ± 0.989
0.845GlnPhe: 0.845 ± 0.438
2.818GlnGly: 2.818 ± 0.914
0.845GlnHis: 0.845 ± 0.362
3.099GlnIle: 3.099 ± 0.72
3.945GlnLys: 3.945 ± 0.947
7.044GlnLeu: 7.044 ± 1.267
1.127GlnMet: 1.127 ± 0.771
1.409GlnAsn: 1.409 ± 0.73
1.691GlnPro: 1.691 ± 0.667
1.691GlnGln: 1.691 ± 0.531
3.381GlnArg: 3.381 ± 0.761
2.536GlnSer: 2.536 ± 0.751
1.409GlnThr: 1.409 ± 0.526
3.663GlnVal: 3.663 ± 0.835
0.564GlnTrp: 0.564 ± 0.417
0.845GlnTyr: 0.845 ± 0.392
0.0GlnXaa: 0.0 ± 0.0
Arg
2.254ArgAla: 2.254 ± 0.691
0.564ArgCys: 0.564 ± 0.365
2.254ArgAsp: 2.254 ± 0.776
3.381ArgGlu: 3.381 ± 0.82
2.254ArgPhe: 2.254 ± 0.728
2.818ArgGly: 2.818 ± 0.881
1.409ArgHis: 1.409 ± 0.688
1.972ArgIle: 1.972 ± 0.509
6.199ArgLys: 6.199 ± 1.239
6.481ArgLeu: 6.481 ± 1.295
0.845ArgMet: 0.845 ± 0.372
1.409ArgAsn: 1.409 ± 0.602
1.127ArgPro: 1.127 ± 0.434
3.945ArgGln: 3.945 ± 0.808
2.536ArgArg: 2.536 ± 0.966
2.818ArgSer: 2.818 ± 0.906
2.818ArgThr: 2.818 ± 0.729
3.663ArgVal: 3.663 ± 0.812
0.564ArgTrp: 0.564 ± 0.424
3.099ArgTyr: 3.099 ± 0.842
0.0ArgXaa: 0.0 ± 0.0
Ser
3.099SerAla: 3.099 ± 0.957
0.564SerCys: 0.564 ± 0.378
3.945SerAsp: 3.945 ± 0.949
3.099SerGlu: 3.099 ± 1.095
2.818SerPhe: 2.818 ± 1.271
2.254SerGly: 2.254 ± 0.841
0.564SerHis: 0.564 ± 0.365
4.79SerIle: 4.79 ± 0.861
6.199SerLys: 6.199 ± 1.237
4.79SerLeu: 4.79 ± 0.834
1.409SerMet: 1.409 ± 0.646
1.691SerAsn: 1.691 ± 0.666
0.845SerPro: 0.845 ± 0.391
2.818SerGln: 2.818 ± 0.869
3.099SerArg: 3.099 ± 0.926
1.691SerSer: 1.691 ± 1.027
3.099SerThr: 3.099 ± 0.926
4.79SerVal: 4.79 ± 1.253
0.845SerTrp: 0.845 ± 0.481
1.691SerTyr: 1.691 ± 0.78
0.0SerXaa: 0.0 ± 0.0
Thr
3.663ThrAla: 3.663 ± 0.901
0.282ThrCys: 0.282 ± 0.253
1.691ThrAsp: 1.691 ± 0.522
4.508ThrGlu: 4.508 ± 0.994
2.536ThrPhe: 2.536 ± 1.044
5.354ThrGly: 5.354 ± 1.125
0.845ThrHis: 0.845 ± 0.418
3.663ThrIle: 3.663 ± 0.947
4.227ThrLys: 4.227 ± 1.251
7.89ThrLeu: 7.89 ± 1.732
1.691ThrMet: 1.691 ± 0.57
1.127ThrAsn: 1.127 ± 0.525
3.945ThrPro: 3.945 ± 1.019
2.818ThrGln: 2.818 ± 0.872
3.945ThrArg: 3.945 ± 0.978
3.099ThrSer: 3.099 ± 0.744
3.381ThrThr: 3.381 ± 1.026
3.381ThrVal: 3.381 ± 0.949
1.127ThrTrp: 1.127 ± 0.617
3.945ThrTyr: 3.945 ± 1.241
0.0ThrXaa: 0.0 ± 0.0
Val
2.818ValAla: 2.818 ± 0.826
0.564ValCys: 0.564 ± 0.428
1.972ValAsp: 1.972 ± 0.876
1.972ValGlu: 1.972 ± 0.886
2.818ValPhe: 2.818 ± 0.941
2.536ValGly: 2.536 ± 0.671
0.282ValHis: 0.282 ± 0.253
5.635ValIle: 5.635 ± 1.254
4.227ValLys: 4.227 ± 0.808
5.635ValLeu: 5.635 ± 1.433
1.127ValMet: 1.127 ± 0.449
3.381ValAsn: 3.381 ± 0.833
0.845ValPro: 0.845 ± 0.452
1.972ValGln: 1.972 ± 0.652
1.691ValArg: 1.691 ± 0.666
3.663ValSer: 3.663 ± 1.005
3.945ValThr: 3.945 ± 1.185
3.381ValVal: 3.381 ± 0.898
0.564ValTrp: 0.564 ± 0.464
2.536ValTyr: 2.536 ± 0.733
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.845TrpAsp: 0.845 ± 0.458
0.845TrpGlu: 0.845 ± 0.472
0.0TrpPhe: 0.0 ± 0.0
0.282TrpGly: 0.282 ± 0.272
0.0TrpHis: 0.0 ± 0.0
0.282TrpIle: 0.282 ± 0.253
0.564TrpLys: 0.564 ± 0.464
1.972TrpLeu: 1.972 ± 0.663
0.0TrpMet: 0.0 ± 0.0
0.564TrpAsn: 0.564 ± 0.379
0.282TrpPro: 0.282 ± 0.232
0.564TrpGln: 0.564 ± 0.35
0.845TrpArg: 0.845 ± 0.425
0.564TrpSer: 0.564 ± 0.327
0.564TrpThr: 0.564 ± 0.376
1.127TrpVal: 1.127 ± 0.556
0.845TrpTrp: 0.845 ± 0.582
0.564TrpTyr: 0.564 ± 0.372
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.845TyrAla: 0.845 ± 0.509
0.282TyrCys: 0.282 ± 0.269
1.691TyrAsp: 1.691 ± 0.633
3.381TyrGlu: 3.381 ± 0.96
3.099TyrPhe: 3.099 ± 0.727
1.691TyrGly: 1.691 ± 0.624
1.691TyrHis: 1.691 ± 0.622
1.409TyrIle: 1.409 ± 0.557
3.381TyrLys: 3.381 ± 1.066
2.818TyrLeu: 2.818 ± 0.556
1.127TyrMet: 1.127 ± 0.635
4.227TyrAsn: 4.227 ± 1.085
1.409TyrPro: 1.409 ± 0.809
3.099TyrGln: 3.099 ± 0.865
4.227TyrArg: 4.227 ± 1.136
2.818TyrSer: 2.818 ± 0.795
3.381TyrThr: 3.381 ± 0.789
1.972TyrVal: 1.972 ± 0.703
0.564TyrTrp: 0.564 ± 0.521
3.381TyrTyr: 3.381 ± 0.851
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3550 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski