Amino acid dipepetide frequency for Streptococcus satellite phage Javan449

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.794AlaAla: 0.794 ± 0.446
0.794AlaCys: 0.794 ± 0.375
2.912AlaAsp: 2.912 ± 0.855
3.176AlaGlu: 3.176 ± 0.863
3.441AlaPhe: 3.441 ± 0.674
2.912AlaGly: 2.912 ± 0.783
0.0AlaHis: 0.0 ± 0.0
5.823AlaIle: 5.823 ± 1.209
5.029AlaLys: 5.029 ± 1.228
4.235AlaLeu: 4.235 ± 1.094
1.323AlaMet: 1.323 ± 0.467
4.235AlaAsn: 4.235 ± 0.889
0.529AlaPro: 0.529 ± 0.309
2.382AlaGln: 2.382 ± 0.499
2.647AlaArg: 2.647 ± 0.615
4.235AlaSer: 4.235 ± 1.476
4.235AlaThr: 4.235 ± 0.603
4.235AlaVal: 4.235 ± 1.159
0.794AlaTrp: 0.794 ± 0.52
1.853AlaTyr: 1.853 ± 0.55
0.0AlaXaa: 0.0 ± 0.0
Cys
0.794CysAla: 0.794 ± 0.43
0.265CysCys: 0.265 ± 0.219
0.529CysAsp: 0.529 ± 0.325
0.265CysGlu: 0.265 ± 0.219
0.0CysPhe: 0.0 ± 0.0
0.529CysGly: 0.529 ± 0.362
0.265CysHis: 0.265 ± 0.243
0.0CysIle: 0.0 ± 0.0
0.529CysLys: 0.529 ± 0.33
1.588CysLeu: 1.588 ± 0.71
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.529CysPro: 0.529 ± 0.364
1.059CysGln: 1.059 ± 0.549
0.265CysArg: 0.265 ± 0.219
0.529CysSer: 0.529 ± 0.44
0.0CysThr: 0.0 ± 0.0
0.265CysVal: 0.265 ± 0.268
0.0CysTrp: 0.0 ± 0.0
0.529CysTyr: 0.529 ± 0.324
0.0CysXaa: 0.0 ± 0.0
Asp
3.176AspAla: 3.176 ± 0.704
1.059AspCys: 1.059 ± 0.61
2.647AspAsp: 2.647 ± 0.664
3.441AspGlu: 3.441 ± 1.209
3.176AspPhe: 3.176 ± 0.938
2.912AspGly: 2.912 ± 0.781
1.323AspHis: 1.323 ± 0.465
8.735AspIle: 8.735 ± 1.13
4.764AspLys: 4.764 ± 0.809
6.088AspLeu: 6.088 ± 0.61
1.059AspMet: 1.059 ± 0.581
1.588AspAsn: 1.588 ± 0.69
0.794AspPro: 0.794 ± 0.359
1.059AspGln: 1.059 ± 0.416
2.912AspArg: 2.912 ± 0.61
2.912AspSer: 2.912 ± 1.061
3.441AspThr: 3.441 ± 0.794
1.853AspVal: 1.853 ± 0.83
0.265AspTrp: 0.265 ± 0.26
4.764AspTyr: 4.764 ± 1.116
0.0AspXaa: 0.0 ± 0.0
Glu
5.294GluAla: 5.294 ± 1.232
1.059GluCys: 1.059 ± 0.556
4.5GluAsp: 4.5 ± 1.14
5.029GluGlu: 5.029 ± 1.142
1.853GluPhe: 1.853 ± 0.908
2.912GluGly: 2.912 ± 0.801
2.118GluHis: 2.118 ± 0.664
4.764GluIle: 4.764 ± 0.876
5.029GluLys: 5.029 ± 0.762
8.735GluLeu: 8.735 ± 1.076
2.382GluMet: 2.382 ± 0.617
2.647GluAsn: 2.647 ± 0.708
1.853GluPro: 1.853 ± 0.506
3.97GluGln: 3.97 ± 1.401
3.441GluArg: 3.441 ± 0.992
1.588GluSer: 1.588 ± 0.679
5.294GluThr: 5.294 ± 1.047
3.176GluVal: 3.176 ± 0.704
0.794GluTrp: 0.794 ± 0.376
3.706GluTyr: 3.706 ± 1.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.588PheAla: 1.588 ± 0.527
0.0PheCys: 0.0 ± 0.0
2.912PheAsp: 2.912 ± 0.633
2.647PheGlu: 2.647 ± 0.577
1.588PhePhe: 1.588 ± 0.573
2.118PheGly: 2.118 ± 0.629
1.853PheHis: 1.853 ± 0.561
2.647PheIle: 2.647 ± 0.695
3.441PheLys: 3.441 ± 0.892
3.706PheLeu: 3.706 ± 0.852
0.0PheMet: 0.0 ± 0.0
2.382PheAsn: 2.382 ± 0.807
0.794PhePro: 0.794 ± 0.351
0.529PheGln: 0.529 ± 0.348
2.647PheArg: 2.647 ± 0.645
3.176PheSer: 3.176 ± 0.634
2.382PheThr: 2.382 ± 0.734
2.382PheVal: 2.382 ± 0.569
0.265PheTrp: 0.265 ± 0.23
1.323PheTyr: 1.323 ± 0.546
0.0PheXaa: 0.0 ± 0.0
Gly
3.176GlyAla: 3.176 ± 1.242
0.265GlyCys: 0.265 ± 0.268
4.764GlyAsp: 4.764 ± 1.329
2.382GlyGlu: 2.382 ± 0.548
2.382GlyPhe: 2.382 ± 0.89
2.382GlyGly: 2.382 ± 0.719
1.323GlyHis: 1.323 ± 0.741
3.176GlyIle: 3.176 ± 0.98
4.764GlyLys: 4.764 ± 0.92
6.353GlyLeu: 6.353 ± 1.493
1.588GlyMet: 1.588 ± 0.603
2.912GlyAsn: 2.912 ± 0.717
0.265GlyPro: 0.265 ± 0.244
4.235GlyGln: 4.235 ± 1.219
2.118GlyArg: 2.118 ± 0.908
2.118GlySer: 2.118 ± 0.699
2.912GlyThr: 2.912 ± 0.571
3.441GlyVal: 3.441 ± 0.967
0.529GlyTrp: 0.529 ± 0.307
3.441GlyTyr: 3.441 ± 0.844
0.0GlyXaa: 0.0 ± 0.0
His
2.118HisAla: 2.118 ± 1.069
0.0HisCys: 0.0 ± 0.0
0.529HisAsp: 0.529 ± 0.404
0.794HisGlu: 0.794 ± 0.329
0.265HisPhe: 0.265 ± 0.251
1.588HisGly: 1.588 ± 0.596
0.794HisHis: 0.794 ± 0.489
1.323HisIle: 1.323 ± 0.46
1.853HisLys: 1.853 ± 0.659
2.647HisLeu: 2.647 ± 1.052
0.265HisMet: 0.265 ± 0.218
1.323HisAsn: 1.323 ± 0.641
0.794HisPro: 0.794 ± 0.357
1.323HisGln: 1.323 ± 0.667
0.529HisArg: 0.529 ± 0.378
0.529HisSer: 0.529 ± 0.511
1.059HisThr: 1.059 ± 0.392
0.529HisVal: 0.529 ± 0.348
1.059HisTrp: 1.059 ± 0.549
1.588HisTyr: 1.588 ± 0.612
0.0HisXaa: 0.0 ± 0.0
Ile
5.558IleAla: 5.558 ± 1.283
0.265IleCys: 0.265 ± 0.243
7.147IleAsp: 7.147 ± 1.406
5.029IleGlu: 5.029 ± 0.894
2.647IlePhe: 2.647 ± 0.685
3.441IleGly: 3.441 ± 0.964
0.794IleHis: 0.794 ± 0.486
3.97IleIle: 3.97 ± 1.14
8.735IleLys: 8.735 ± 1.782
4.764IleLeu: 4.764 ± 0.996
1.588IleMet: 1.588 ± 0.665
4.235IleAsn: 4.235 ± 1.058
2.118IlePro: 2.118 ± 0.82
1.853IleGln: 1.853 ± 0.707
3.97IleArg: 3.97 ± 1.067
5.294IleSer: 5.294 ± 1.694
5.029IleThr: 5.029 ± 1.073
2.647IleVal: 2.647 ± 0.956
0.265IleTrp: 0.265 ± 0.26
2.118IleTyr: 2.118 ± 0.768
0.0IleXaa: 0.0 ± 0.0
Lys
6.882LysAla: 6.882 ± 1.859
0.265LysCys: 0.265 ± 0.32
3.706LysAsp: 3.706 ± 0.997
10.323LysGlu: 10.323 ± 1.726
1.853LysPhe: 1.853 ± 0.56
4.764LysGly: 4.764 ± 1.174
2.118LysHis: 2.118 ± 0.606
3.706LysIle: 3.706 ± 1.07
5.823LysLys: 5.823 ± 1.466
6.617LysLeu: 6.617 ± 1.455
1.588LysMet: 1.588 ± 0.58
4.5LysAsn: 4.5 ± 0.712
5.294LysPro: 5.294 ± 1.134
3.706LysGln: 3.706 ± 0.874
4.5LysArg: 4.5 ± 0.813
5.029LysSer: 5.029 ± 1.118
5.558LysThr: 5.558 ± 1.143
6.088LysVal: 6.088 ± 1.082
1.059LysTrp: 1.059 ± 0.615
3.706LysTyr: 3.706 ± 0.828
0.0LysXaa: 0.0 ± 0.0
Leu
5.823LeuAla: 5.823 ± 1.266
0.794LeuCys: 0.794 ± 0.48
6.353LeuAsp: 6.353 ± 1.305
9.264LeuGlu: 9.264 ± 1.197
4.5LeuPhe: 4.5 ± 0.954
5.558LeuGly: 5.558 ± 1.192
1.323LeuHis: 1.323 ± 0.585
7.941LeuIle: 7.941 ± 1.317
7.941LeuLys: 7.941 ± 1.033
7.411LeuLeu: 7.411 ± 1.476
2.647LeuMet: 2.647 ± 0.873
5.294LeuAsn: 5.294 ± 1.156
4.5LeuPro: 4.5 ± 1.281
2.647LeuGln: 2.647 ± 0.478
2.382LeuArg: 2.382 ± 0.707
8.205LeuSer: 8.205 ± 2.019
5.294LeuThr: 5.294 ± 0.871
5.029LeuVal: 5.029 ± 1.073
0.794LeuTrp: 0.794 ± 0.378
5.558LeuTyr: 5.558 ± 0.908
0.0LeuXaa: 0.0 ± 0.0
Met
1.323MetAla: 1.323 ± 0.551
0.265MetCys: 0.265 ± 0.268
1.059MetAsp: 1.059 ± 0.435
0.794MetGlu: 0.794 ± 0.397
0.529MetPhe: 0.529 ± 0.309
0.529MetGly: 0.529 ± 0.361
0.265MetHis: 0.265 ± 0.268
1.323MetIle: 1.323 ± 0.455
2.912MetLys: 2.912 ± 0.771
2.382MetLeu: 2.382 ± 0.701
0.265MetMet: 0.265 ± 0.218
2.647MetAsn: 2.647 ± 0.642
0.265MetPro: 0.265 ± 0.239
0.794MetGln: 0.794 ± 0.51
1.323MetArg: 1.323 ± 0.488
1.059MetSer: 1.059 ± 0.538
3.706MetThr: 3.706 ± 0.897
0.794MetVal: 0.794 ± 0.332
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.235AsnAla: 4.235 ± 0.862
0.265AsnCys: 0.265 ± 0.268
2.382AsnAsp: 2.382 ± 0.96
3.706AsnGlu: 3.706 ± 0.916
0.794AsnPhe: 0.794 ± 0.469
4.235AsnGly: 4.235 ± 0.947
1.853AsnHis: 1.853 ± 0.498
2.118AsnIle: 2.118 ± 0.874
3.97AsnLys: 3.97 ± 0.975
5.823AsnLeu: 5.823 ± 0.804
1.323AsnMet: 1.323 ± 0.403
1.853AsnAsn: 1.853 ± 0.551
2.647AsnPro: 2.647 ± 0.862
3.176AsnGln: 3.176 ± 0.856
3.706AsnArg: 3.706 ± 0.793
2.647AsnSer: 2.647 ± 0.838
2.647AsnThr: 2.647 ± 0.791
2.912AsnVal: 2.912 ± 0.867
0.265AsnTrp: 0.265 ± 0.318
2.118AsnTyr: 2.118 ± 0.634
0.0AsnXaa: 0.0 ± 0.0
Pro
0.529ProAla: 0.529 ± 0.405
0.265ProCys: 0.265 ± 0.245
1.059ProAsp: 1.059 ± 0.469
3.706ProGlu: 3.706 ± 0.991
1.059ProPhe: 1.059 ± 0.524
0.794ProGly: 0.794 ± 0.393
0.0ProHis: 0.0 ± 0.0
1.588ProIle: 1.588 ± 0.732
4.764ProLys: 4.764 ± 1.006
3.176ProLeu: 3.176 ± 0.874
0.265ProMet: 0.265 ± 0.219
1.853ProAsn: 1.853 ± 0.694
0.529ProPro: 0.529 ± 0.359
1.323ProGln: 1.323 ± 0.6
2.118ProArg: 2.118 ± 0.766
1.588ProSer: 1.588 ± 0.486
2.647ProThr: 2.647 ± 0.657
2.118ProVal: 2.118 ± 0.661
0.0ProTrp: 0.0 ± 0.0
0.794ProTyr: 0.794 ± 0.507
0.0ProXaa: 0.0 ± 0.0
Gln
2.118GlnAla: 2.118 ± 0.625
0.0GlnCys: 0.0 ± 0.0
1.323GlnAsp: 1.323 ± 0.518
3.706GlnGlu: 3.706 ± 0.857
1.588GlnPhe: 1.588 ± 0.428
3.706GlnGly: 3.706 ± 1.054
1.059GlnHis: 1.059 ± 0.568
2.382GlnIle: 2.382 ± 0.543
5.029GlnLys: 5.029 ± 1.03
7.676GlnLeu: 7.676 ± 1.124
1.059GlnMet: 1.059 ± 0.711
2.118GlnAsn: 2.118 ± 0.746
2.118GlnPro: 2.118 ± 0.799
1.853GlnGln: 1.853 ± 0.481
2.647GlnArg: 2.647 ± 0.552
2.912GlnSer: 2.912 ± 0.77
1.853GlnThr: 1.853 ± 0.902
3.441GlnVal: 3.441 ± 0.913
0.0GlnTrp: 0.0 ± 0.0
0.794GlnTyr: 0.794 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
1.853ArgAla: 1.853 ± 0.475
0.529ArgCys: 0.529 ± 0.277
2.647ArgAsp: 2.647 ± 0.801
2.118ArgGlu: 2.118 ± 0.695
1.853ArgPhe: 1.853 ± 0.693
3.441ArgGly: 3.441 ± 1.54
1.323ArgHis: 1.323 ± 0.593
1.853ArgIle: 1.853 ± 0.514
6.882ArgLys: 6.882 ± 1.24
3.97ArgLeu: 3.97 ± 0.769
0.794ArgMet: 0.794 ± 0.409
1.323ArgAsn: 1.323 ± 0.528
1.059ArgPro: 1.059 ± 0.572
3.97ArgGln: 3.97 ± 0.71
2.382ArgArg: 2.382 ± 0.722
2.647ArgSer: 2.647 ± 0.796
2.118ArgThr: 2.118 ± 0.59
3.706ArgVal: 3.706 ± 0.744
0.529ArgTrp: 0.529 ± 0.328
3.706ArgTyr: 3.706 ± 1.167
0.0ArgXaa: 0.0 ± 0.0
Ser
2.647SerAla: 2.647 ± 0.923
0.529SerCys: 0.529 ± 0.362
5.029SerAsp: 5.029 ± 1.125
2.912SerGlu: 2.912 ± 0.736
1.588SerPhe: 1.588 ± 0.471
2.647SerGly: 2.647 ± 0.884
0.794SerHis: 0.794 ± 0.399
5.294SerIle: 5.294 ± 0.729
4.5SerLys: 4.5 ± 1.036
6.882SerLeu: 6.882 ± 1.076
1.853SerMet: 1.853 ± 0.799
3.176SerAsn: 3.176 ± 0.757
0.794SerPro: 0.794 ± 0.416
3.441SerGln: 3.441 ± 0.757
1.853SerArg: 1.853 ± 0.643
2.647SerSer: 2.647 ± 1.033
3.97SerThr: 3.97 ± 1.013
4.235SerVal: 4.235 ± 1.148
0.794SerTrp: 0.794 ± 0.426
1.853SerTyr: 1.853 ± 0.77
0.0SerXaa: 0.0 ± 0.0
Thr
3.706ThrAla: 3.706 ± 0.875
0.0ThrCys: 0.0 ± 0.0
1.323ThrAsp: 1.323 ± 0.567
3.706ThrGlu: 3.706 ± 1.204
3.176ThrPhe: 3.176 ± 0.966
5.029ThrGly: 5.029 ± 1.18
0.529ThrHis: 0.529 ± 0.294
7.147ThrIle: 7.147 ± 1.697
2.382ThrLys: 2.382 ± 0.86
6.882ThrLeu: 6.882 ± 1.304
0.529ThrMet: 0.529 ± 0.307
2.118ThrAsn: 2.118 ± 1.201
2.647ThrPro: 2.647 ± 0.931
3.441ThrGln: 3.441 ± 0.879
3.441ThrArg: 3.441 ± 1.003
4.5ThrSer: 4.5 ± 0.986
4.235ThrThr: 4.235 ± 1.42
2.647ThrVal: 2.647 ± 0.641
0.794ThrTrp: 0.794 ± 0.397
5.029ThrTyr: 5.029 ± 1.193
0.0ThrXaa: 0.0 ± 0.0
Val
2.647ValAla: 2.647 ± 0.63
0.794ValCys: 0.794 ± 0.391
3.441ValAsp: 3.441 ± 0.881
3.176ValGlu: 3.176 ± 0.976
3.441ValPhe: 3.441 ± 0.778
1.588ValGly: 1.588 ± 0.523
1.323ValHis: 1.323 ± 0.888
5.558ValIle: 5.558 ± 1.212
3.97ValLys: 3.97 ± 0.711
5.558ValLeu: 5.558 ± 1.215
2.118ValMet: 2.118 ± 0.547
3.441ValAsn: 3.441 ± 0.807
1.059ValPro: 1.059 ± 0.544
3.176ValGln: 3.176 ± 0.968
2.118ValArg: 2.118 ± 0.769
3.97ValSer: 3.97 ± 1.106
3.441ValThr: 3.441 ± 1.002
2.382ValVal: 2.382 ± 0.697
0.794ValTrp: 0.794 ± 0.506
1.588ValTyr: 1.588 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
0.265TrpAla: 0.265 ± 0.236
0.0TrpCys: 0.0 ± 0.0
0.529TrpAsp: 0.529 ± 0.365
0.794TrpGlu: 0.794 ± 0.416
0.265TrpPhe: 0.265 ± 0.218
0.265TrpGly: 0.265 ± 0.239
0.265TrpHis: 0.265 ± 0.268
0.265TrpIle: 0.265 ± 0.28
1.323TrpLys: 1.323 ± 0.544
1.853TrpLeu: 1.853 ± 0.675
0.0TrpMet: 0.0 ± 0.0
0.265TrpAsn: 0.265 ± 0.236
0.265TrpPro: 0.265 ± 0.23
0.794TrpGln: 0.794 ± 0.435
0.265TrpArg: 0.265 ± 0.219
0.265TrpSer: 0.265 ± 0.244
0.529TrpThr: 0.529 ± 0.52
1.323TrpVal: 1.323 ± 0.554
0.265TrpTrp: 0.265 ± 0.236
0.265TrpTyr: 0.265 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.059TyrAla: 1.059 ± 0.454
0.529TyrCys: 0.529 ± 0.348
3.176TyrAsp: 3.176 ± 0.72
3.176TyrGlu: 3.176 ± 0.829
2.382TyrPhe: 2.382 ± 0.879
3.176TyrGly: 3.176 ± 0.959
1.588TyrHis: 1.588 ± 0.449
1.853TyrIle: 1.853 ± 0.547
3.441TyrLys: 3.441 ± 0.98
2.912TyrLeu: 2.912 ± 0.434
1.323TyrMet: 1.323 ± 0.637
4.764TyrAsn: 4.764 ± 1.101
1.323TyrPro: 1.323 ± 0.728
2.647TyrGln: 2.647 ± 0.82
3.441TyrArg: 3.441 ± 1.19
1.853TyrSer: 1.853 ± 0.744
2.912TyrThr: 2.912 ± 0.656
2.382TyrVal: 2.382 ± 0.696
0.794TyrTrp: 0.794 ± 0.538
4.5TyrTyr: 4.5 ± 0.988
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3779 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski