Amino acid dipepetide frequency for Streptococcus phage Javan51

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.61AlaAla: 5.61 ± 0.931
0.475AlaCys: 0.475 ± 0.183
4.945AlaAsp: 4.945 ± 0.669
3.994AlaGlu: 3.994 ± 0.744
2.663AlaPhe: 2.663 ± 0.373
5.04AlaGly: 5.04 ± 0.979
0.761AlaHis: 0.761 ± 0.257
5.04AlaIle: 5.04 ± 0.776
4.755AlaLys: 4.755 ± 0.632
5.706AlaLeu: 5.706 ± 1.06
2.282AlaMet: 2.282 ± 0.438
3.328AlaAsn: 3.328 ± 0.541
1.521AlaPro: 1.521 ± 0.346
3.233AlaGln: 3.233 ± 0.633
3.899AlaArg: 3.899 ± 0.611
4.945AlaSer: 4.945 ± 0.567
6.086AlaThr: 6.086 ± 0.802
4.66AlaVal: 4.66 ± 0.576
0.666AlaTrp: 0.666 ± 0.196
3.043AlaTyr: 3.043 ± 0.485
0.0AlaXaa: 0.0 ± 0.0
Cys
0.285CysAla: 0.285 ± 0.256
0.095CysCys: 0.095 ± 0.098
0.475CysAsp: 0.475 ± 0.201
0.571CysGlu: 0.571 ± 0.207
0.19CysPhe: 0.19 ± 0.136
0.475CysGly: 0.475 ± 0.203
0.095CysHis: 0.095 ± 0.107
0.285CysIle: 0.285 ± 0.14
0.19CysLys: 0.19 ± 0.134
1.046CysLeu: 1.046 ± 0.406
0.095CysMet: 0.095 ± 0.077
0.285CysAsn: 0.285 ± 0.177
0.38CysPro: 0.38 ± 0.228
0.475CysGln: 0.475 ± 0.196
0.475CysArg: 0.475 ± 0.177
0.38CysSer: 0.38 ± 0.2
0.19CysThr: 0.19 ± 0.117
0.38CysVal: 0.38 ± 0.154
0.0CysTrp: 0.0 ± 0.0
0.761CysTyr: 0.761 ± 0.286
0.0CysXaa: 0.0 ± 0.0
Asp
3.709AspAla: 3.709 ± 0.53
0.38AspCys: 0.38 ± 0.181
4.279AspAsp: 4.279 ± 0.905
4.945AspGlu: 4.945 ± 0.588
2.853AspPhe: 2.853 ± 0.475
4.184AspGly: 4.184 ± 0.679
1.046AspHis: 1.046 ± 0.38
3.709AspIle: 3.709 ± 0.589
4.945AspLys: 4.945 ± 0.584
7.037AspLeu: 7.037 ± 0.767
2.092AspMet: 2.092 ± 0.402
2.282AspAsn: 2.282 ± 0.597
2.092AspPro: 2.092 ± 0.647
2.187AspGln: 2.187 ± 0.503
2.282AspArg: 2.282 ± 0.561
2.948AspSer: 2.948 ± 0.394
2.948AspThr: 2.948 ± 0.541
4.184AspVal: 4.184 ± 0.611
0.666AspTrp: 0.666 ± 0.233
3.709AspTyr: 3.709 ± 0.782
0.0AspXaa: 0.0 ± 0.0
Glu
5.325GluAla: 5.325 ± 0.762
0.856GluCys: 0.856 ± 0.256
4.66GluAsp: 4.66 ± 0.832
6.181GluGlu: 6.181 ± 0.966
2.758GluPhe: 2.758 ± 0.543
5.04GluGly: 5.04 ± 0.611
1.331GluHis: 1.331 ± 0.26
2.948GluIle: 2.948 ± 0.486
5.42GluLys: 5.42 ± 0.705
7.893GluLeu: 7.893 ± 0.912
1.997GluMet: 1.997 ± 0.507
2.948GluAsn: 2.948 ± 0.397
1.902GluPro: 1.902 ± 0.503
3.423GluGln: 3.423 ± 0.57
2.663GluArg: 2.663 ± 0.471
3.423GluSer: 3.423 ± 0.679
4.564GluThr: 4.564 ± 0.603
4.85GluVal: 4.85 ± 0.702
0.571GluTrp: 0.571 ± 0.295
1.617GluTyr: 1.617 ± 0.332
0.0GluXaa: 0.0 ± 0.0
Phe
2.187PheAla: 2.187 ± 0.523
0.38PheCys: 0.38 ± 0.184
2.948PheAsp: 2.948 ± 0.502
2.568PheGlu: 2.568 ± 0.531
1.712PhePhe: 1.712 ± 0.461
2.948PheGly: 2.948 ± 0.396
0.761PheHis: 0.761 ± 0.261
2.092PheIle: 2.092 ± 0.475
3.709PheLys: 3.709 ± 0.781
2.948PheLeu: 2.948 ± 0.598
1.046PheMet: 1.046 ± 0.289
2.282PheAsn: 2.282 ± 0.387
0.666PhePro: 0.666 ± 0.317
1.046PheGln: 1.046 ± 0.317
1.426PheArg: 1.426 ± 0.267
2.568PheSer: 2.568 ± 0.442
1.807PheThr: 1.807 ± 0.358
2.187PheVal: 2.187 ± 0.326
0.761PheTrp: 0.761 ± 0.258
2.092PheTyr: 2.092 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
3.614GlyAla: 3.614 ± 0.508
0.095GlyCys: 0.095 ± 0.087
4.564GlyAsp: 4.564 ± 0.612
3.994GlyGlu: 3.994 ± 0.541
2.472GlyPhe: 2.472 ± 0.38
5.515GlyGly: 5.515 ± 0.899
1.997GlyHis: 1.997 ± 0.466
5.135GlyIle: 5.135 ± 0.675
5.23GlyLys: 5.23 ± 0.713
6.086GlyLeu: 6.086 ± 0.825
2.282GlyMet: 2.282 ± 0.391
3.328GlyAsn: 3.328 ± 0.553
0.761GlyPro: 0.761 ± 0.234
2.853GlyGln: 2.853 ± 0.557
3.899GlyArg: 3.899 ± 0.673
5.325GlySer: 5.325 ± 1.265
4.755GlyThr: 4.755 ± 0.572
4.564GlyVal: 4.564 ± 1.072
0.666GlyTrp: 0.666 ± 0.254
2.758GlyTyr: 2.758 ± 0.584
0.0GlyXaa: 0.0 ± 0.0
His
0.951HisAla: 0.951 ± 0.295
0.095HisCys: 0.095 ± 0.109
1.712HisAsp: 1.712 ± 0.406
0.856HisGlu: 0.856 ± 0.298
0.951HisPhe: 0.951 ± 0.374
1.426HisGly: 1.426 ± 0.383
0.951HisHis: 0.951 ± 0.263
0.856HisIle: 0.856 ± 0.262
0.666HisLys: 0.666 ± 0.225
1.997HisLeu: 1.997 ± 0.352
0.571HisMet: 0.571 ± 0.245
1.521HisAsn: 1.521 ± 0.329
1.236HisPro: 1.236 ± 0.476
0.475HisGln: 0.475 ± 0.216
1.236HisArg: 1.236 ± 0.369
1.236HisSer: 1.236 ± 0.267
0.856HisThr: 0.856 ± 0.358
1.141HisVal: 1.141 ± 0.361
0.095HisTrp: 0.095 ± 0.107
0.761HisTyr: 0.761 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
4.279IleAla: 4.279 ± 0.442
0.666IleCys: 0.666 ± 0.257
4.755IleAsp: 4.755 ± 0.599
3.709IleGlu: 3.709 ± 0.5
1.331IlePhe: 1.331 ± 0.49
4.469IleGly: 4.469 ± 0.694
0.951IleHis: 0.951 ± 0.274
3.518IleIle: 3.518 ± 0.631
4.564IleLys: 4.564 ± 0.94
4.85IleLeu: 4.85 ± 0.572
1.141IleMet: 1.141 ± 0.304
2.663IleAsn: 2.663 ± 0.448
1.902IlePro: 1.902 ± 0.419
1.997IleGln: 1.997 ± 0.484
1.902IleArg: 1.902 ± 0.49
3.709IleSer: 3.709 ± 0.62
4.945IleThr: 4.945 ± 0.792
3.328IleVal: 3.328 ± 0.741
0.951IleTrp: 0.951 ± 0.4
2.853IleTyr: 2.853 ± 0.583
0.0IleXaa: 0.0 ± 0.0
Lys
7.322LysAla: 7.322 ± 0.952
0.19LysCys: 0.19 ± 0.128
4.469LysAsp: 4.469 ± 0.856
4.85LysGlu: 4.85 ± 0.622
1.902LysPhe: 1.902 ± 0.332
4.66LysGly: 4.66 ± 0.625
1.807LysHis: 1.807 ± 0.476
3.804LysIle: 3.804 ± 0.519
4.564LysLys: 4.564 ± 0.862
6.466LysLeu: 6.466 ± 0.872
0.951LysMet: 0.951 ± 0.263
2.282LysAsn: 2.282 ± 0.394
2.377LysPro: 2.377 ± 0.408
3.518LysGln: 3.518 ± 0.637
3.899LysArg: 3.899 ± 0.69
3.233LysSer: 3.233 ± 0.462
4.945LysThr: 4.945 ± 0.75
5.04LysVal: 5.04 ± 0.932
1.141LysTrp: 1.141 ± 0.331
1.807LysTyr: 1.807 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
7.417LeuAla: 7.417 ± 0.969
0.38LeuCys: 0.38 ± 0.194
5.325LeuAsp: 5.325 ± 0.569
7.512LeuGlu: 7.512 ± 0.786
3.138LeuPhe: 3.138 ± 0.463
5.515LeuGly: 5.515 ± 0.805
1.141LeuHis: 1.141 ± 0.439
4.184LeuIle: 4.184 ± 0.537
8.463LeuLys: 8.463 ± 0.81
7.227LeuLeu: 7.227 ± 0.951
2.663LeuMet: 2.663 ± 0.454
4.374LeuAsn: 4.374 ± 0.583
3.709LeuPro: 3.709 ± 0.606
3.233LeuGln: 3.233 ± 0.69
3.804LeuArg: 3.804 ± 0.606
8.273LeuSer: 8.273 ± 1.136
6.752LeuThr: 6.752 ± 0.814
5.706LeuVal: 5.706 ± 0.833
0.951LeuTrp: 0.951 ± 0.206
4.089LeuTyr: 4.089 ± 0.847
0.0LeuXaa: 0.0 ± 0.0
Met
2.377MetAla: 2.377 ± 0.462
0.095MetCys: 0.095 ± 0.109
1.807MetAsp: 1.807 ± 0.414
1.521MetGlu: 1.521 ± 0.483
0.951MetPhe: 0.951 ± 0.297
2.092MetGly: 2.092 ± 0.529
0.0MetHis: 0.0 ± 0.0
0.856MetIle: 0.856 ± 0.206
1.902MetLys: 1.902 ± 0.388
1.236MetLeu: 1.236 ± 0.354
0.571MetMet: 0.571 ± 0.266
0.38MetAsn: 0.38 ± 0.161
0.475MetPro: 0.475 ± 0.147
0.571MetGln: 0.571 ± 0.184
1.141MetArg: 1.141 ± 0.279
1.902MetSer: 1.902 ± 0.446
3.043MetThr: 3.043 ± 0.519
1.902MetVal: 1.902 ± 0.405
0.19MetTrp: 0.19 ± 0.118
0.38MetTyr: 0.38 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
3.233AsnAla: 3.233 ± 0.486
0.285AsnCys: 0.285 ± 0.232
2.092AsnAsp: 2.092 ± 0.483
3.138AsnGlu: 3.138 ± 0.599
1.807AsnPhe: 1.807 ± 0.487
5.135AsnGly: 5.135 ± 0.823
1.141AsnHis: 1.141 ± 0.352
1.521AsnIle: 1.521 ± 0.425
2.472AsnLys: 2.472 ± 0.409
3.994AsnLeu: 3.994 ± 0.516
0.951AsnMet: 0.951 ± 0.264
1.807AsnAsn: 1.807 ± 0.53
1.807AsnPro: 1.807 ± 0.375
1.902AsnGln: 1.902 ± 0.423
1.807AsnArg: 1.807 ± 0.376
3.614AsnSer: 3.614 ± 0.5
2.377AsnThr: 2.377 ± 0.731
2.568AsnVal: 2.568 ± 0.458
1.046AsnTrp: 1.046 ± 0.368
1.141AsnTyr: 1.141 ± 0.323
0.0AsnXaa: 0.0 ± 0.0
Pro
1.331ProAla: 1.331 ± 0.358
0.285ProCys: 0.285 ± 0.125
1.997ProAsp: 1.997 ± 0.509
2.092ProGlu: 2.092 ± 0.571
1.141ProPhe: 1.141 ± 0.363
1.426ProGly: 1.426 ± 0.395
0.761ProHis: 0.761 ± 0.237
1.617ProIle: 1.617 ± 0.395
2.472ProLys: 2.472 ± 0.428
3.043ProLeu: 3.043 ± 0.38
0.475ProMet: 0.475 ± 0.199
1.426ProAsn: 1.426 ± 0.399
1.141ProPro: 1.141 ± 0.423
1.521ProGln: 1.521 ± 0.352
1.046ProArg: 1.046 ± 0.249
2.853ProSer: 2.853 ± 0.678
2.948ProThr: 2.948 ± 0.51
2.092ProVal: 2.092 ± 0.354
0.475ProTrp: 0.475 ± 0.194
1.426ProTyr: 1.426 ± 0.377
0.0ProXaa: 0.0 ± 0.0
Gln
4.564GlnAla: 4.564 ± 0.687
0.095GlnCys: 0.095 ± 0.099
1.426GlnAsp: 1.426 ± 0.332
3.043GlnGlu: 3.043 ± 0.524
2.282GlnPhe: 2.282 ± 0.48
2.187GlnGly: 2.187 ± 0.453
0.856GlnHis: 0.856 ± 0.231
2.948GlnIle: 2.948 ± 0.467
2.568GlnLys: 2.568 ± 0.408
4.089GlnLeu: 4.089 ± 0.594
0.666GlnMet: 0.666 ± 0.251
2.377GlnAsn: 2.377 ± 0.414
1.521GlnPro: 1.521 ± 0.459
1.712GlnGln: 1.712 ± 0.553
1.141GlnArg: 1.141 ± 0.299
2.663GlnSer: 2.663 ± 0.427
2.758GlnThr: 2.758 ± 0.455
3.233GlnVal: 3.233 ± 0.473
0.666GlnTrp: 0.666 ± 0.326
0.856GlnTyr: 0.856 ± 0.321
0.0GlnXaa: 0.0 ± 0.0
Arg
2.472ArgAla: 2.472 ± 0.563
0.571ArgCys: 0.571 ± 0.237
2.282ArgAsp: 2.282 ± 0.493
3.043ArgGlu: 3.043 ± 0.524
1.236ArgPhe: 1.236 ± 0.357
2.377ArgGly: 2.377 ± 0.342
1.046ArgHis: 1.046 ± 0.362
2.377ArgIle: 2.377 ± 0.425
3.518ArgLys: 3.518 ± 0.788
4.945ArgLeu: 4.945 ± 0.67
0.761ArgMet: 0.761 ± 0.252
1.997ArgAsn: 1.997 ± 0.433
1.141ArgPro: 1.141 ± 0.413
3.138ArgGln: 3.138 ± 0.488
1.807ArgArg: 1.807 ± 0.517
2.377ArgSer: 2.377 ± 0.413
1.712ArgThr: 1.712 ± 0.398
3.233ArgVal: 3.233 ± 0.528
0.666ArgTrp: 0.666 ± 0.276
1.807ArgTyr: 1.807 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
5.325SerAla: 5.325 ± 1.14
0.38SerCys: 0.38 ± 0.185
4.469SerAsp: 4.469 ± 0.694
4.66SerGlu: 4.66 ± 0.791
2.758SerPhe: 2.758 ± 0.56
5.801SerGly: 5.801 ± 0.839
1.807SerHis: 1.807 ± 0.332
4.469SerIle: 4.469 ± 0.833
3.614SerLys: 3.614 ± 0.475
5.706SerLeu: 5.706 ± 0.839
1.141SerMet: 1.141 ± 0.377
2.187SerAsn: 2.187 ± 0.563
2.853SerPro: 2.853 ± 0.459
2.282SerGln: 2.282 ± 0.446
2.472SerArg: 2.472 ± 0.461
5.42SerSer: 5.42 ± 1.242
4.089SerThr: 4.089 ± 0.883
4.755SerVal: 4.755 ± 0.48
1.902SerTrp: 1.902 ± 0.334
2.092SerTyr: 2.092 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
4.85ThrAla: 4.85 ± 0.879
0.475ThrCys: 0.475 ± 0.226
2.853ThrAsp: 2.853 ± 0.494
4.755ThrGlu: 4.755 ± 0.525
3.804ThrPhe: 3.804 ± 0.842
4.66ThrGly: 4.66 ± 0.602
0.951ThrHis: 0.951 ± 0.296
5.515ThrIle: 5.515 ± 1.428
4.374ThrLys: 4.374 ± 0.494
6.466ThrLeu: 6.466 ± 0.871
1.046ThrMet: 1.046 ± 0.272
3.614ThrAsn: 3.614 ± 0.504
2.853ThrPro: 2.853 ± 0.681
2.282ThrGln: 2.282 ± 0.541
1.807ThrArg: 1.807 ± 0.406
5.42ThrSer: 5.42 ± 0.854
5.325ThrThr: 5.325 ± 0.525
5.42ThrVal: 5.42 ± 0.878
1.236ThrTrp: 1.236 ± 0.42
2.187ThrTyr: 2.187 ± 0.415
0.0ThrXaa: 0.0 ± 0.0
Val
3.614ValAla: 3.614 ± 0.608
0.475ValCys: 0.475 ± 0.209
3.899ValAsp: 3.899 ± 0.664
4.66ValGlu: 4.66 ± 0.835
2.377ValPhe: 2.377 ± 0.484
3.804ValGly: 3.804 ± 0.59
0.951ValHis: 0.951 ± 0.271
4.945ValIle: 4.945 ± 0.705
3.804ValLys: 3.804 ± 0.474
7.798ValLeu: 7.798 ± 0.71
1.331ValMet: 1.331 ± 0.35
2.092ValAsn: 2.092 ± 0.361
1.807ValPro: 1.807 ± 0.451
2.568ValGln: 2.568 ± 0.484
3.138ValArg: 3.138 ± 0.6
5.135ValSer: 5.135 ± 0.844
5.896ValThr: 5.896 ± 0.824
4.279ValVal: 4.279 ± 0.725
0.951ValTrp: 0.951 ± 0.388
2.472ValTyr: 2.472 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
1.141TrpAla: 1.141 ± 0.367
0.19TrpCys: 0.19 ± 0.156
0.761TrpAsp: 0.761 ± 0.24
1.426TrpGlu: 1.426 ± 0.287
0.761TrpPhe: 0.761 ± 0.286
0.475TrpGly: 0.475 ± 0.164
0.095TrpHis: 0.095 ± 0.088
0.666TrpIle: 0.666 ± 0.29
0.666TrpLys: 0.666 ± 0.237
1.331TrpLeu: 1.331 ± 0.256
0.475TrpMet: 0.475 ± 0.19
1.426TrpAsn: 1.426 ± 0.445
0.095TrpPro: 0.095 ± 0.084
0.856TrpGln: 0.856 ± 0.243
0.856TrpArg: 0.856 ± 0.259
0.761TrpSer: 0.761 ± 0.27
1.046TrpThr: 1.046 ± 0.304
0.856TrpVal: 0.856 ± 0.306
0.19TrpTrp: 0.19 ± 0.11
0.095TrpTyr: 0.095 ± 0.084
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.043TyrAla: 3.043 ± 0.437
0.666TyrCys: 0.666 ± 0.246
2.758TyrAsp: 2.758 ± 0.625
2.853TyrGlu: 2.853 ± 0.433
1.046TyrPhe: 1.046 ± 0.331
2.758TyrGly: 2.758 ± 0.419
1.141TyrHis: 1.141 ± 0.373
1.997TyrIle: 1.997 ± 0.45
1.331TyrLys: 1.331 ± 0.327
3.994TyrLeu: 3.994 ± 0.66
0.761TyrMet: 0.761 ± 0.255
1.236TyrAsn: 1.236 ± 0.323
1.331TyrPro: 1.331 ± 0.256
2.377TyrGln: 2.377 ± 0.422
1.807TyrArg: 1.807 ± 0.365
1.997TyrSer: 1.997 ± 0.44
2.853TyrThr: 2.853 ± 0.615
1.617TyrVal: 1.617 ± 0.363
0.38TyrTrp: 0.38 ± 0.172
1.046TyrTyr: 1.046 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35 proteins (10517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski