Amino acid dipepetide frequency for Streptococcus satellite phage Javan21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.848AlaAla: 0.848 ± 0.552
0.565AlaCys: 0.565 ± 0.431
3.673AlaAsp: 3.673 ± 0.702
4.804AlaGlu: 4.804 ± 1.298
2.826AlaPhe: 2.826 ± 1.026
1.695AlaGly: 1.695 ± 0.833
0.0AlaHis: 0.0 ± 0.0
6.782AlaIle: 6.782 ± 1.151
3.956AlaLys: 3.956 ± 0.992
5.086AlaLeu: 5.086 ± 1.017
1.413AlaMet: 1.413 ± 0.73
5.086AlaAsn: 5.086 ± 0.827
1.978AlaPro: 1.978 ± 0.817
3.391AlaGln: 3.391 ± 0.867
2.826AlaArg: 2.826 ± 0.891
3.673AlaSer: 3.673 ± 1.199
3.673AlaThr: 3.673 ± 0.978
3.673AlaVal: 3.673 ± 1.125
0.565AlaTrp: 0.565 ± 0.419
2.826AlaTyr: 2.826 ± 0.707
0.0AlaXaa: 0.0 ± 0.0
Cys
0.565CysAla: 0.565 ± 0.415
0.0CysCys: 0.0 ± 0.0
0.848CysAsp: 0.848 ± 0.485
0.283CysGlu: 0.283 ± 0.284
0.0CysPhe: 0.0 ± 0.0
1.413CysGly: 1.413 ± 0.789
0.565CysHis: 0.565 ± 0.418
0.565CysIle: 0.565 ± 0.4
0.0CysLys: 0.0 ± 0.0
1.413CysLeu: 1.413 ± 0.457
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.283CysPro: 0.283 ± 0.284
0.565CysGln: 0.565 ± 0.568
1.13CysArg: 1.13 ± 0.554
0.283CysSer: 0.283 ± 0.29
0.0CysThr: 0.0 ± 0.0
0.283CysVal: 0.283 ± 0.248
0.0CysTrp: 0.0 ± 0.0
0.283CysTyr: 0.283 ± 0.365
0.0CysXaa: 0.0 ± 0.0
Asp
0.848AspAla: 0.848 ± 0.481
1.695AspCys: 1.695 ± 0.836
3.391AspAsp: 3.391 ± 1.19
2.261AspGlu: 2.261 ± 0.872
1.695AspPhe: 1.695 ± 0.812
1.13AspGly: 1.13 ± 0.549
1.413AspHis: 1.413 ± 0.581
5.651AspIle: 5.651 ± 1.435
7.629AspLys: 7.629 ± 0.978
5.651AspLeu: 5.651 ± 1.03
1.978AspMet: 1.978 ± 0.974
3.956AspAsn: 3.956 ± 1.118
0.848AspPro: 0.848 ± 0.419
1.13AspGln: 1.13 ± 0.515
2.261AspArg: 2.261 ± 0.844
4.804AspSer: 4.804 ± 1.563
5.086AspThr: 5.086 ± 1.36
1.978AspVal: 1.978 ± 0.723
0.283AspTrp: 0.283 ± 0.282
5.934AspTyr: 5.934 ± 1.55
0.0AspXaa: 0.0 ± 0.0
Glu
4.521GluAla: 4.521 ± 1.135
1.13GluCys: 1.13 ± 0.758
4.238GluAsp: 4.238 ± 1.244
3.673GluGlu: 3.673 ± 1.001
3.391GluPhe: 3.391 ± 1.076
2.543GluGly: 2.543 ± 0.636
2.261GluHis: 2.261 ± 0.818
7.912GluIle: 7.912 ± 1.255
7.347GluLys: 7.347 ± 0.942
10.172GluLeu: 10.172 ± 1.98
1.413GluMet: 1.413 ± 0.876
2.543GluAsn: 2.543 ± 0.601
1.13GluPro: 1.13 ± 0.715
4.238GluGln: 4.238 ± 1.306
3.956GluArg: 3.956 ± 0.984
2.261GluSer: 2.261 ± 0.827
5.934GluThr: 5.934 ± 1.191
5.651GluVal: 5.651 ± 1.234
0.565GluTrp: 0.565 ± 0.436
2.543GluTyr: 2.543 ± 0.8
0.0GluXaa: 0.0 ± 0.0
Phe
1.695PheAla: 1.695 ± 0.632
0.0PheCys: 0.0 ± 0.0
2.826PheAsp: 2.826 ± 0.975
3.391PheGlu: 3.391 ± 0.802
1.413PhePhe: 1.413 ± 0.617
2.826PheGly: 2.826 ± 0.982
1.13PheHis: 1.13 ± 0.509
2.826PheIle: 2.826 ± 1.069
4.238PheLys: 4.238 ± 0.983
3.956PheLeu: 3.956 ± 0.99
0.283PheMet: 0.283 ± 0.339
5.086PheAsn: 5.086 ± 0.687
0.848PhePro: 0.848 ± 0.471
0.565PheGln: 0.565 ± 0.366
1.978PheArg: 1.978 ± 0.718
2.543PheSer: 2.543 ± 0.773
1.978PheThr: 1.978 ± 0.729
1.413PheVal: 1.413 ± 0.504
0.283PheTrp: 0.283 ± 0.292
1.13PheTyr: 1.13 ± 0.514
0.0PheXaa: 0.0 ± 0.0
Gly
1.695GlyAla: 1.695 ± 0.794
0.565GlyCys: 0.565 ± 0.315
2.543GlyAsp: 2.543 ± 0.742
4.521GlyGlu: 4.521 ± 1.294
2.826GlyPhe: 2.826 ± 0.753
3.673GlyGly: 3.673 ± 1.535
0.848GlyHis: 0.848 ± 0.463
3.673GlyIle: 3.673 ± 0.979
3.956GlyLys: 3.956 ± 1.16
5.651GlyLeu: 5.651 ± 1.421
0.848GlyMet: 0.848 ± 0.377
0.848GlyAsn: 0.848 ± 0.451
0.0GlyPro: 0.0 ± 0.0
1.413GlyGln: 1.413 ± 0.558
2.261GlyArg: 2.261 ± 0.649
1.695GlySer: 1.695 ± 0.805
3.391GlyThr: 3.391 ± 1.394
1.695GlyVal: 1.695 ± 0.662
0.565GlyTrp: 0.565 ± 0.495
3.673GlyTyr: 3.673 ± 1.413
0.0GlyXaa: 0.0 ± 0.0
His
1.978HisAla: 1.978 ± 0.764
0.283HisCys: 0.283 ± 0.248
0.283HisAsp: 0.283 ± 0.339
0.565HisGlu: 0.565 ± 0.38
0.848HisPhe: 0.848 ± 0.596
1.413HisGly: 1.413 ± 0.597
0.283HisHis: 0.283 ± 0.29
1.695HisIle: 1.695 ± 0.658
1.13HisLys: 1.13 ± 0.51
1.13HisLeu: 1.13 ± 0.458
0.0HisMet: 0.0 ± 0.0
0.848HisAsn: 0.848 ± 0.685
1.413HisPro: 1.413 ± 0.776
1.13HisGln: 1.13 ± 0.596
0.283HisArg: 0.283 ± 0.365
0.848HisSer: 0.848 ± 0.639
1.13HisThr: 1.13 ± 0.541
0.848HisVal: 0.848 ± 0.446
0.283HisTrp: 0.283 ± 0.284
3.108HisTyr: 3.108 ± 0.978
0.0HisXaa: 0.0 ± 0.0
Ile
5.086IleAla: 5.086 ± 1.29
0.565IleCys: 0.565 ± 0.418
5.086IleAsp: 5.086 ± 1.136
6.499IleGlu: 6.499 ± 1.474
2.543IlePhe: 2.543 ± 0.924
1.978IleGly: 1.978 ± 0.566
1.13IleHis: 1.13 ± 0.605
4.238IleIle: 4.238 ± 0.831
7.912IleLys: 7.912 ± 1.24
4.521IleLeu: 4.521 ± 1.134
1.13IleMet: 1.13 ± 0.525
2.826IleAsn: 2.826 ± 1.293
2.261IlePro: 2.261 ± 0.84
1.695IleGln: 1.695 ± 0.684
2.826IleArg: 2.826 ± 0.92
3.956IleSer: 3.956 ± 1.298
3.673IleThr: 3.673 ± 0.802
3.108IleVal: 3.108 ± 0.599
0.0IleTrp: 0.0 ± 0.0
4.521IleTyr: 4.521 ± 1.024
0.0IleXaa: 0.0 ± 0.0
Lys
8.76LysAla: 8.76 ± 1.216
0.283LysCys: 0.283 ± 0.339
4.521LysAsp: 4.521 ± 1.298
8.76LysGlu: 8.76 ± 1.21
2.543LysPhe: 2.543 ± 0.991
3.956LysGly: 3.956 ± 1.136
1.978LysHis: 1.978 ± 0.705
3.956LysIle: 3.956 ± 0.977
8.477LysLys: 8.477 ± 1.313
6.216LysLeu: 6.216 ± 1.382
2.261LysMet: 2.261 ± 1.018
4.521LysAsn: 4.521 ± 0.985
4.238LysPro: 4.238 ± 1.425
5.651LysGln: 5.651 ± 1.221
7.347LysArg: 7.347 ± 1.019
7.347LysSer: 7.347 ± 1.709
6.782LysThr: 6.782 ± 1.229
4.521LysVal: 4.521 ± 1.168
0.848LysTrp: 0.848 ± 0.363
3.391LysTyr: 3.391 ± 1.234
0.0LysXaa: 0.0 ± 0.0
Leu
7.347LeuAla: 7.347 ± 1.191
0.848LeuCys: 0.848 ± 0.479
7.064LeuAsp: 7.064 ± 1.253
8.76LeuGlu: 8.76 ± 2.386
3.108LeuPhe: 3.108 ± 0.844
5.086LeuGly: 5.086 ± 1.131
0.848LeuHis: 0.848 ± 0.407
6.782LeuIle: 6.782 ± 1.604
7.064LeuLys: 7.064 ± 1.339
7.064LeuLeu: 7.064 ± 1.293
1.978LeuMet: 1.978 ± 0.745
5.086LeuAsn: 5.086 ± 1.26
3.673LeuPro: 3.673 ± 1.208
2.826LeuGln: 2.826 ± 0.8
2.543LeuArg: 2.543 ± 0.617
5.369LeuSer: 5.369 ± 1.094
5.369LeuThr: 5.369 ± 0.826
3.108LeuVal: 3.108 ± 1.126
0.848LeuTrp: 0.848 ± 0.351
2.543LeuTyr: 2.543 ± 0.818
0.0LeuXaa: 0.0 ± 0.0
Met
1.695MetAla: 1.695 ± 0.818
0.0MetCys: 0.0 ± 0.0
1.413MetAsp: 1.413 ± 0.538
0.848MetGlu: 0.848 ± 0.483
0.283MetPhe: 0.283 ± 0.237
0.565MetGly: 0.565 ± 0.454
0.565MetHis: 0.565 ± 0.436
0.848MetIle: 0.848 ± 0.479
1.978MetLys: 1.978 ± 0.534
3.391MetLeu: 3.391 ± 1.281
0.283MetMet: 0.283 ± 0.307
2.261MetAsn: 2.261 ± 0.807
0.0MetPro: 0.0 ± 0.0
0.565MetGln: 0.565 ± 0.454
1.413MetArg: 1.413 ± 0.57
1.13MetSer: 1.13 ± 0.754
2.826MetThr: 2.826 ± 1.097
0.848MetVal: 0.848 ± 0.472
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.956AsnAla: 3.956 ± 0.909
0.283AsnCys: 0.283 ± 0.248
3.108AsnAsp: 3.108 ± 0.755
2.826AsnGlu: 2.826 ± 0.87
1.695AsnPhe: 1.695 ± 0.613
3.956AsnGly: 3.956 ± 1.182
1.695AsnHis: 1.695 ± 0.772
3.108AsnIle: 3.108 ± 0.819
5.369AsnLys: 5.369 ± 0.973
3.108AsnLeu: 3.108 ± 1.263
1.695AsnMet: 1.695 ± 1.051
2.261AsnAsn: 2.261 ± 0.683
1.978AsnPro: 1.978 ± 0.549
3.673AsnGln: 3.673 ± 1.306
2.261AsnArg: 2.261 ± 0.529
4.238AsnSer: 4.238 ± 0.969
3.108AsnThr: 3.108 ± 1.234
3.108AsnVal: 3.108 ± 1.078
0.565AsnTrp: 0.565 ± 0.33
1.978AsnTyr: 1.978 ± 0.906
0.0AsnXaa: 0.0 ± 0.0
Pro
1.413ProAla: 1.413 ± 0.728
0.283ProCys: 0.283 ± 0.29
0.848ProAsp: 0.848 ± 0.46
3.391ProGlu: 3.391 ± 1.215
1.413ProPhe: 1.413 ± 0.67
0.848ProGly: 0.848 ± 0.413
0.848ProHis: 0.848 ± 0.428
1.695ProIle: 1.695 ± 0.702
3.108ProLys: 3.108 ± 0.902
2.261ProLeu: 2.261 ± 0.577
1.695ProMet: 1.695 ± 0.643
2.826ProAsn: 2.826 ± 1.03
0.848ProPro: 0.848 ± 0.481
0.0ProGln: 0.0 ± 0.0
2.261ProArg: 2.261 ± 0.632
1.978ProSer: 1.978 ± 0.826
1.13ProThr: 1.13 ± 0.555
1.413ProVal: 1.413 ± 0.589
0.283ProTrp: 0.283 ± 0.248
1.695ProTyr: 1.695 ± 0.581
0.0ProXaa: 0.0 ± 0.0
Gln
3.956GlnAla: 3.956 ± 1.113
0.283GlnCys: 0.283 ± 0.345
2.261GlnAsp: 2.261 ± 0.9
5.369GlnGlu: 5.369 ± 1.234
0.848GlnPhe: 0.848 ± 0.579
2.826GlnGly: 2.826 ± 0.728
0.0GlnHis: 0.0 ± 0.0
1.695GlnIle: 1.695 ± 0.488
3.673GlnLys: 3.673 ± 1.094
5.369GlnLeu: 5.369 ± 0.738
0.283GlnMet: 0.283 ± 0.339
1.978GlnAsn: 1.978 ± 0.956
2.826GlnPro: 2.826 ± 1.18
1.978GlnGln: 1.978 ± 0.689
1.978GlnArg: 1.978 ± 0.656
2.826GlnSer: 2.826 ± 0.995
2.826GlnThr: 2.826 ± 0.766
2.826GlnVal: 2.826 ± 0.801
0.565GlnTrp: 0.565 ± 0.564
1.978GlnTyr: 1.978 ± 0.839
0.0GlnXaa: 0.0 ± 0.0
Arg
2.826ArgAla: 2.826 ± 1.201
0.565ArgCys: 0.565 ± 0.391
1.695ArgAsp: 1.695 ± 0.568
3.108ArgGlu: 3.108 ± 0.917
2.543ArgPhe: 2.543 ± 0.727
2.826ArgGly: 2.826 ± 0.784
0.565ArgHis: 0.565 ± 0.315
3.108ArgIle: 3.108 ± 0.787
6.216ArgLys: 6.216 ± 1.204
3.673ArgLeu: 3.673 ± 0.838
0.848ArgMet: 0.848 ± 0.46
3.391ArgAsn: 3.391 ± 1.06
1.13ArgPro: 1.13 ± 0.522
2.826ArgGln: 2.826 ± 0.912
2.543ArgArg: 2.543 ± 0.718
2.261ArgSer: 2.261 ± 0.915
2.826ArgThr: 2.826 ± 0.989
3.956ArgVal: 3.956 ± 0.845
0.848ArgTrp: 0.848 ± 0.565
2.261ArgTyr: 2.261 ± 0.651
0.0ArgXaa: 0.0 ± 0.0
Ser
3.391SerAla: 3.391 ± 1.186
0.283SerCys: 0.283 ± 0.339
5.934SerAsp: 5.934 ± 1.041
5.086SerGlu: 5.086 ± 1.244
3.391SerPhe: 3.391 ± 0.703
1.413SerGly: 1.413 ± 0.611
1.13SerHis: 1.13 ± 0.588
3.108SerIle: 3.108 ± 0.878
7.912SerLys: 7.912 ± 1.2
3.673SerLeu: 3.673 ± 1.072
0.848SerMet: 0.848 ± 0.456
1.695SerAsn: 1.695 ± 0.716
0.848SerPro: 0.848 ± 0.504
5.086SerGln: 5.086 ± 1.166
2.261SerArg: 2.261 ± 0.739
4.238SerSer: 4.238 ± 2.01
3.391SerThr: 3.391 ± 1.004
2.543SerVal: 2.543 ± 0.763
0.283SerTrp: 0.283 ± 0.365
5.086SerTyr: 5.086 ± 1.242
0.0SerXaa: 0.0 ± 0.0
Thr
4.238ThrAla: 4.238 ± 1.066
0.283ThrCys: 0.283 ± 0.284
4.238ThrAsp: 4.238 ± 1.096
3.673ThrGlu: 3.673 ± 0.899
2.826ThrPhe: 2.826 ± 1.48
4.521ThrGly: 4.521 ± 0.906
1.13ThrHis: 1.13 ± 0.684
3.108ThrIle: 3.108 ± 1.048
5.086ThrLys: 5.086 ± 1.276
5.086ThrLeu: 5.086 ± 1.391
1.13ThrMet: 1.13 ± 0.709
3.673ThrAsn: 3.673 ± 0.769
3.108ThrPro: 3.108 ± 0.936
4.804ThrGln: 4.804 ± 0.882
3.391ThrArg: 3.391 ± 0.788
4.521ThrSer: 4.521 ± 1.377
4.521ThrThr: 4.521 ± 1.826
3.956ThrVal: 3.956 ± 1.089
0.565ThrTrp: 0.565 ± 0.383
2.826ThrTyr: 2.826 ± 0.725
0.0ThrXaa: 0.0 ± 0.0
Val
2.826ValAla: 2.826 ± 0.992
0.283ValCys: 0.283 ± 0.256
1.978ValAsp: 1.978 ± 0.613
3.956ValGlu: 3.956 ± 0.965
3.391ValPhe: 3.391 ± 0.735
1.978ValGly: 1.978 ± 0.492
0.848ValHis: 0.848 ± 0.553
2.826ValIle: 2.826 ± 1.043
5.369ValLys: 5.369 ± 0.945
4.238ValLeu: 4.238 ± 1.145
1.13ValMet: 1.13 ± 0.523
1.695ValAsn: 1.695 ± 0.714
1.13ValPro: 1.13 ± 0.408
1.695ValGln: 1.695 ± 0.694
1.413ValArg: 1.413 ± 0.65
4.238ValSer: 4.238 ± 1.145
6.216ValThr: 6.216 ± 1.461
3.108ValVal: 3.108 ± 0.636
0.565ValTrp: 0.565 ± 0.495
1.13ValTyr: 1.13 ± 0.498
0.0ValXaa: 0.0 ± 0.0
Trp
0.283TrpAla: 0.283 ± 0.292
0.0TrpCys: 0.0 ± 0.0
0.848TrpAsp: 0.848 ± 0.504
0.848TrpGlu: 0.848 ± 0.39
0.283TrpPhe: 0.283 ± 0.248
0.283TrpGly: 0.283 ± 0.29
0.0TrpHis: 0.0 ± 0.0
0.283TrpIle: 0.283 ± 0.282
1.13TrpLys: 1.13 ± 0.435
1.13TrpLeu: 1.13 ± 0.548
0.0TrpMet: 0.0 ± 0.0
0.283TrpAsn: 0.283 ± 0.292
0.0TrpPro: 0.0 ± 0.0
0.565TrpGln: 0.565 ± 0.433
0.0TrpArg: 0.0 ± 0.0
1.13TrpSer: 1.13 ± 0.394
0.283TrpThr: 0.283 ± 0.248
0.848TrpVal: 0.848 ± 0.507
0.565TrpTrp: 0.565 ± 0.377
0.283TrpTyr: 0.283 ± 0.365
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.413TyrAla: 1.413 ± 0.628
0.283TyrCys: 0.283 ± 0.248
2.826TyrAsp: 2.826 ± 0.964
4.804TyrGlu: 4.804 ± 1.026
2.543TyrPhe: 2.543 ± 0.879
1.413TyrGly: 1.413 ± 0.636
2.261TyrHis: 2.261 ± 0.705
1.695TyrIle: 1.695 ± 0.795
4.804TyrLys: 4.804 ± 1.49
4.521TyrLeu: 4.521 ± 0.772
1.413TyrMet: 1.413 ± 0.715
3.391TyrAsn: 3.391 ± 0.769
1.695TyrPro: 1.695 ± 0.768
2.543TyrGln: 2.543 ± 0.826
5.086TyrArg: 5.086 ± 1.554
2.261TyrSer: 2.261 ± 0.478
2.543TyrThr: 2.543 ± 0.88
1.13TyrVal: 1.13 ± 0.525
0.565TyrTrp: 0.565 ± 0.568
3.673TyrTyr: 3.673 ± 1.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3540 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski