Amino acid dipepetide frequency for Pectobacterium phage PP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.291AlaAla: 10.291 ± 1.749
0.681AlaCys: 0.681 ± 0.263
6.507AlaAsp: 6.507 ± 0.609
6.81AlaGlu: 6.81 ± 0.936
3.481AlaPhe: 3.481 ± 0.597
6.81AlaGly: 6.81 ± 0.891
1.74AlaHis: 1.74 ± 0.345
5.297AlaIle: 5.297 ± 0.628
5.297AlaLys: 5.297 ± 0.583
8.399AlaLeu: 8.399 ± 0.597
4.389AlaMet: 4.389 ± 0.758
2.421AlaAsn: 2.421 ± 0.423
2.951AlaPro: 2.951 ± 0.528
4.389AlaGln: 4.389 ± 0.769
5.599AlaArg: 5.599 ± 0.671
4.616AlaSer: 4.616 ± 0.638
4.01AlaThr: 4.01 ± 0.719
6.81AlaVal: 6.81 ± 0.621
1.059AlaTrp: 1.059 ± 0.41
2.119AlaTyr: 2.119 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
0.605CysAla: 0.605 ± 0.259
0.0CysCys: 0.0 ± 0.0
0.454CysAsp: 0.454 ± 0.195
0.227CysGlu: 0.227 ± 0.127
0.227CysPhe: 0.227 ± 0.137
0.681CysGly: 0.681 ± 0.283
0.227CysHis: 0.227 ± 0.128
0.378CysIle: 0.378 ± 0.154
0.984CysLys: 0.984 ± 0.26
0.681CysLeu: 0.681 ± 0.198
0.605CysMet: 0.605 ± 0.231
0.53CysAsn: 0.53 ± 0.238
0.605CysPro: 0.605 ± 0.187
0.227CysGln: 0.227 ± 0.133
0.605CysArg: 0.605 ± 0.235
0.378CysSer: 0.378 ± 0.146
0.227CysThr: 0.227 ± 0.155
0.984CysVal: 0.984 ± 0.315
0.151CysTrp: 0.151 ± 0.104
0.605CysTyr: 0.605 ± 0.235
0.0CysXaa: 0.0 ± 0.0
Asp
7.34AspAla: 7.34 ± 0.68
0.378AspCys: 0.378 ± 0.155
4.162AspAsp: 4.162 ± 0.464
4.01AspGlu: 4.01 ± 0.649
1.74AspPhe: 1.74 ± 0.3
4.918AspGly: 4.918 ± 0.956
1.059AspHis: 1.059 ± 0.276
4.389AspIle: 4.389 ± 0.628
3.632AspLys: 3.632 ± 0.461
5.221AspLeu: 5.221 ± 0.683
1.74AspMet: 1.74 ± 0.332
2.951AspAsn: 2.951 ± 0.722
2.27AspPro: 2.27 ± 0.432
0.832AspGln: 0.832 ± 0.251
2.497AspArg: 2.497 ± 0.407
3.935AspSer: 3.935 ± 0.467
4.086AspThr: 4.086 ± 0.58
4.086AspVal: 4.086 ± 0.657
1.135AspTrp: 1.135 ± 0.413
1.665AspTyr: 1.665 ± 0.401
0.0AspXaa: 0.0 ± 0.0
Glu
7.188GluAla: 7.188 ± 1.023
0.757GluCys: 0.757 ± 0.25
3.783GluAsp: 3.783 ± 0.501
4.691GluGlu: 4.691 ± 0.765
2.875GluPhe: 2.875 ± 0.419
5.675GluGly: 5.675 ± 0.671
1.816GluHis: 1.816 ± 0.361
2.27GluIle: 2.27 ± 0.401
3.102GluLys: 3.102 ± 0.452
5.826GluLeu: 5.826 ± 0.539
2.573GluMet: 2.573 ± 0.415
2.043GluAsn: 2.043 ± 0.426
2.194GluPro: 2.194 ± 0.475
3.178GluGln: 3.178 ± 0.48
3.935GluArg: 3.935 ± 0.422
2.951GluSer: 2.951 ± 0.384
3.178GluThr: 3.178 ± 0.522
4.994GluVal: 4.994 ± 0.603
1.438GluTrp: 1.438 ± 0.413
2.119GluTyr: 2.119 ± 0.578
0.0GluXaa: 0.0 ± 0.0
Phe
3.481PheAla: 3.481 ± 0.56
0.378PheCys: 0.378 ± 0.146
2.724PheAsp: 2.724 ± 0.405
2.724PheGlu: 2.724 ± 0.476
1.665PhePhe: 1.665 ± 0.387
3.254PheGly: 3.254 ± 0.648
0.53PheHis: 0.53 ± 0.179
1.967PheIle: 1.967 ± 0.219
2.043PheLys: 2.043 ± 0.443
2.875PheLeu: 2.875 ± 0.393
0.908PheMet: 0.908 ± 0.245
2.497PheAsn: 2.497 ± 0.352
1.513PhePro: 1.513 ± 0.31
0.984PheGln: 0.984 ± 0.212
1.665PheArg: 1.665 ± 0.361
2.497PheSer: 2.497 ± 0.392
2.119PheThr: 2.119 ± 0.364
1.967PheVal: 1.967 ± 0.419
0.227PheTrp: 0.227 ± 0.146
0.908PheTyr: 0.908 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
6.507GlyAla: 6.507 ± 0.709
1.211GlyCys: 1.211 ± 0.348
6.205GlyAsp: 6.205 ± 0.9
4.616GlyGlu: 4.616 ± 0.591
3.329GlyPhe: 3.329 ± 0.776
6.81GlyGly: 6.81 ± 0.87
2.497GlyHis: 2.497 ± 0.317
4.389GlyIle: 4.389 ± 0.506
6.583GlyLys: 6.583 ± 0.943
4.389GlyLeu: 4.389 ± 0.727
2.27GlyMet: 2.27 ± 0.426
3.481GlyAsn: 3.481 ± 0.527
0.757GlyPro: 0.757 ± 0.267
2.875GlyGln: 2.875 ± 0.327
4.994GlyArg: 4.994 ± 0.455
5.448GlySer: 5.448 ± 0.838
3.783GlyThr: 3.783 ± 0.5
5.297GlyVal: 5.297 ± 0.687
1.362GlyTrp: 1.362 ± 0.283
3.178GlyTyr: 3.178 ± 0.293
0.0GlyXaa: 0.0 ± 0.0
His
1.135HisAla: 1.135 ± 0.258
0.151HisCys: 0.151 ± 0.103
1.665HisAsp: 1.665 ± 0.324
0.984HisGlu: 0.984 ± 0.267
1.135HisPhe: 1.135 ± 0.262
1.816HisGly: 1.816 ± 0.331
0.454HisHis: 0.454 ± 0.172
1.438HisIle: 1.438 ± 0.419
0.908HisLys: 0.908 ± 0.235
2.346HisLeu: 2.346 ± 0.424
0.832HisMet: 0.832 ± 0.33
1.362HisAsn: 1.362 ± 0.307
0.757HisPro: 0.757 ± 0.259
0.681HisGln: 0.681 ± 0.192
1.665HisArg: 1.665 ± 0.305
1.135HisSer: 1.135 ± 0.327
0.908HisThr: 0.908 ± 0.21
1.665HisVal: 1.665 ± 0.312
0.303HisTrp: 0.303 ± 0.166
0.681HisTyr: 0.681 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
5.145IleAla: 5.145 ± 0.604
0.151IleCys: 0.151 ± 0.105
3.481IleAsp: 3.481 ± 0.522
4.313IleGlu: 4.313 ± 0.444
1.362IlePhe: 1.362 ± 0.469
4.01IleGly: 4.01 ± 0.645
0.984IleHis: 0.984 ± 0.201
1.967IleIle: 1.967 ± 0.502
3.935IleLys: 3.935 ± 0.646
2.951IleLeu: 2.951 ± 0.449
0.908IleMet: 0.908 ± 0.291
2.573IleAsn: 2.573 ± 0.44
1.892IlePro: 1.892 ± 0.319
1.967IleGln: 1.967 ± 0.342
1.967IleArg: 1.967 ± 0.317
2.875IleSer: 2.875 ± 0.445
4.313IleThr: 4.313 ± 0.528
3.027IleVal: 3.027 ± 0.406
0.605IleTrp: 0.605 ± 0.156
1.211IleTyr: 1.211 ± 0.355
0.0IleXaa: 0.0 ± 0.0
Lys
5.599LysAla: 5.599 ± 0.77
0.681LysCys: 0.681 ± 0.219
3.481LysAsp: 3.481 ± 0.512
4.237LysGlu: 4.237 ± 0.42
2.346LysPhe: 2.346 ± 0.422
4.464LysGly: 4.464 ± 0.735
1.059LysHis: 1.059 ± 0.371
2.648LysIle: 2.648 ± 0.38
3.254LysLys: 3.254 ± 0.547
5.145LysLeu: 5.145 ± 0.68
2.043LysMet: 2.043 ± 0.426
2.194LysAsn: 2.194 ± 0.481
3.027LysPro: 3.027 ± 0.303
2.194LysGln: 2.194 ± 0.384
3.556LysArg: 3.556 ± 0.471
3.556LysSer: 3.556 ± 0.591
2.421LysThr: 2.421 ± 0.367
4.616LysVal: 4.616 ± 0.541
0.53LysTrp: 0.53 ± 0.183
1.816LysTyr: 1.816 ± 0.344
0.0LysXaa: 0.0 ± 0.0
Leu
6.583LeuAla: 6.583 ± 0.666
0.908LeuCys: 0.908 ± 0.284
4.691LeuAsp: 4.691 ± 0.634
4.616LeuGlu: 4.616 ± 0.515
2.194LeuPhe: 2.194 ± 0.497
6.886LeuGly: 6.886 ± 0.705
1.362LeuHis: 1.362 ± 0.325
3.405LeuIle: 3.405 ± 0.641
4.54LeuLys: 4.54 ± 0.603
5.221LeuLeu: 5.221 ± 0.624
2.724LeuMet: 2.724 ± 0.477
4.01LeuAsn: 4.01 ± 0.616
3.481LeuPro: 3.481 ± 0.457
3.859LeuGln: 3.859 ± 0.719
4.616LeuArg: 4.616 ± 0.634
5.221LeuSer: 5.221 ± 0.651
4.162LeuThr: 4.162 ± 0.488
5.145LeuVal: 5.145 ± 0.532
1.135LeuTrp: 1.135 ± 0.297
2.043LeuTyr: 2.043 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
3.556MetAla: 3.556 ± 0.448
0.303MetCys: 0.303 ± 0.15
1.362MetAsp: 1.362 ± 0.366
1.816MetGlu: 1.816 ± 0.406
1.135MetPhe: 1.135 ± 0.292
2.724MetGly: 2.724 ± 0.741
0.908MetHis: 0.908 ± 0.311
1.362MetIle: 1.362 ± 0.294
1.74MetLys: 1.74 ± 0.298
2.875MetLeu: 2.875 ± 0.471
1.665MetMet: 1.665 ± 0.405
1.74MetAsn: 1.74 ± 0.355
1.211MetPro: 1.211 ± 0.213
1.589MetGln: 1.589 ± 0.362
1.967MetArg: 1.967 ± 0.358
3.254MetSer: 3.254 ± 0.526
2.951MetThr: 2.951 ± 0.546
1.513MetVal: 1.513 ± 0.325
0.227MetTrp: 0.227 ± 0.148
0.984MetTyr: 0.984 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
2.875AsnAla: 2.875 ± 0.401
0.53AsnCys: 0.53 ± 0.241
2.421AsnAsp: 2.421 ± 0.327
2.875AsnGlu: 2.875 ± 0.368
1.513AsnPhe: 1.513 ± 0.3
3.708AsnGly: 3.708 ± 0.423
1.211AsnHis: 1.211 ± 0.292
1.967AsnIle: 1.967 ± 0.458
2.573AsnLys: 2.573 ± 0.461
3.405AsnLeu: 3.405 ± 0.439
2.421AsnMet: 2.421 ± 0.482
1.211AsnAsn: 1.211 ± 0.233
2.043AsnPro: 2.043 ± 0.29
2.119AsnGln: 2.119 ± 0.457
3.481AsnArg: 3.481 ± 0.513
2.27AsnSer: 2.27 ± 0.592
2.8AsnThr: 2.8 ± 0.507
2.27AsnVal: 2.27 ± 0.484
0.605AsnTrp: 0.605 ± 0.183
1.438AsnTyr: 1.438 ± 0.314
0.0AsnXaa: 0.0 ± 0.0
Pro
3.329ProAla: 3.329 ± 0.549
0.454ProCys: 0.454 ± 0.183
2.875ProAsp: 2.875 ± 0.486
2.875ProGlu: 2.875 ± 0.437
1.438ProPhe: 1.438 ± 0.325
1.892ProGly: 1.892 ± 0.469
0.53ProHis: 0.53 ± 0.3
1.059ProIle: 1.059 ± 0.353
2.346ProLys: 2.346 ± 0.316
2.043ProLeu: 2.043 ± 0.33
0.757ProMet: 0.757 ± 0.228
1.816ProAsn: 1.816 ± 0.471
1.438ProPro: 1.438 ± 0.273
1.438ProGln: 1.438 ± 0.266
0.984ProArg: 0.984 ± 0.279
2.346ProSer: 2.346 ± 0.488
2.648ProThr: 2.648 ± 0.432
3.632ProVal: 3.632 ± 0.554
0.757ProTrp: 0.757 ± 0.237
1.74ProTyr: 1.74 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
4.464GlnAla: 4.464 ± 0.501
0.454GlnCys: 0.454 ± 0.176
2.27GlnAsp: 2.27 ± 0.408
3.027GlnGlu: 3.027 ± 0.571
1.513GlnPhe: 1.513 ± 0.366
3.329GlnGly: 3.329 ± 0.508
0.832GlnHis: 0.832 ± 0.229
1.967GlnIle: 1.967 ± 0.345
1.286GlnLys: 1.286 ± 0.289
2.724GlnLeu: 2.724 ± 0.42
1.438GlnMet: 1.438 ± 0.455
1.665GlnAsn: 1.665 ± 0.381
0.984GlnPro: 0.984 ± 0.281
2.27GlnGln: 2.27 ± 0.52
1.816GlnArg: 1.816 ± 0.505
3.102GlnSer: 3.102 ± 0.488
1.74GlnThr: 1.74 ± 0.308
2.8GlnVal: 2.8 ± 0.404
0.53GlnTrp: 0.53 ± 0.251
2.119GlnTyr: 2.119 ± 0.427
0.0GlnXaa: 0.0 ± 0.0
Arg
5.145ArgAla: 5.145 ± 0.584
0.605ArgCys: 0.605 ± 0.236
3.708ArgAsp: 3.708 ± 0.375
4.01ArgGlu: 4.01 ± 0.445
1.665ArgPhe: 1.665 ± 0.299
4.086ArgGly: 4.086 ± 0.629
1.362ArgHis: 1.362 ± 0.394
3.027ArgIle: 3.027 ± 0.391
2.875ArgLys: 2.875 ± 0.498
5.145ArgLeu: 5.145 ± 0.5
2.497ArgMet: 2.497 ± 0.403
2.119ArgAsn: 2.119 ± 0.327
1.513ArgPro: 1.513 ± 0.372
2.043ArgGln: 2.043 ± 0.231
3.556ArgArg: 3.556 ± 0.405
3.178ArgSer: 3.178 ± 0.563
2.951ArgThr: 2.951 ± 0.501
4.313ArgVal: 4.313 ± 0.6
0.757ArgTrp: 0.757 ± 0.3
1.286ArgTyr: 1.286 ± 0.261
0.0ArgXaa: 0.0 ± 0.0
Ser
4.54SerAla: 4.54 ± 0.634
0.378SerCys: 0.378 ± 0.162
2.875SerAsp: 2.875 ± 0.456
3.783SerGlu: 3.783 ± 0.65
2.573SerPhe: 2.573 ± 0.287
6.129SerGly: 6.129 ± 0.78
1.438SerHis: 1.438 ± 0.352
3.254SerIle: 3.254 ± 0.432
3.102SerLys: 3.102 ± 0.423
4.616SerLeu: 4.616 ± 0.628
2.119SerMet: 2.119 ± 0.412
3.178SerAsn: 3.178 ± 0.568
1.892SerPro: 1.892 ± 0.41
2.421SerGln: 2.421 ± 0.429
4.086SerArg: 4.086 ± 0.589
3.102SerSer: 3.102 ± 0.519
4.237SerThr: 4.237 ± 0.486
5.524SerVal: 5.524 ± 0.602
0.757SerTrp: 0.757 ± 0.253
1.816SerTyr: 1.816 ± 0.557
0.0SerXaa: 0.0 ± 0.0
Thr
5.07ThrAla: 5.07 ± 0.76
0.454ThrCys: 0.454 ± 0.171
3.102ThrAsp: 3.102 ± 0.584
3.783ThrGlu: 3.783 ± 0.645
3.102ThrPhe: 3.102 ± 0.648
4.389ThrGly: 4.389 ± 0.577
1.286ThrHis: 1.286 ± 0.25
3.178ThrIle: 3.178 ± 0.459
3.405ThrLys: 3.405 ± 0.467
4.237ThrLeu: 4.237 ± 0.592
1.211ThrMet: 1.211 ± 0.315
2.119ThrAsn: 2.119 ± 0.438
2.573ThrPro: 2.573 ± 0.348
2.043ThrGln: 2.043 ± 0.417
2.648ThrArg: 2.648 ± 0.429
4.01ThrSer: 4.01 ± 0.537
3.783ThrThr: 3.783 ± 0.891
3.783ThrVal: 3.783 ± 0.694
0.681ThrTrp: 0.681 ± 0.258
2.497ThrTyr: 2.497 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
6.659ValAla: 6.659 ± 0.699
0.53ValCys: 0.53 ± 0.217
3.329ValAsp: 3.329 ± 0.616
4.464ValGlu: 4.464 ± 0.498
2.573ValPhe: 2.573 ± 0.467
4.918ValGly: 4.918 ± 0.712
1.816ValHis: 1.816 ± 0.368
3.481ValIle: 3.481 ± 0.42
4.162ValLys: 4.162 ± 0.685
4.691ValLeu: 4.691 ± 0.475
2.421ValMet: 2.421 ± 0.367
3.708ValAsn: 3.708 ± 0.572
3.405ValPro: 3.405 ± 0.515
3.329ValGln: 3.329 ± 0.566
4.237ValArg: 4.237 ± 0.557
4.616ValSer: 4.616 ± 0.552
4.162ValThr: 4.162 ± 0.54
4.162ValVal: 4.162 ± 0.626
0.757ValTrp: 0.757 ± 0.194
1.816ValTyr: 1.816 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
0.984TrpAla: 0.984 ± 0.248
0.151TrpCys: 0.151 ± 0.106
0.454TrpAsp: 0.454 ± 0.17
1.362TrpGlu: 1.362 ± 0.287
0.303TrpPhe: 0.303 ± 0.172
0.757TrpGly: 0.757 ± 0.149
0.303TrpHis: 0.303 ± 0.163
0.303TrpIle: 0.303 ± 0.146
1.211TrpLys: 1.211 ± 0.282
1.135TrpLeu: 1.135 ± 0.353
0.303TrpMet: 0.303 ± 0.168
0.757TrpAsn: 0.757 ± 0.213
0.832TrpPro: 0.832 ± 0.239
0.681TrpGln: 0.681 ± 0.201
0.53TrpArg: 0.53 ± 0.244
1.286TrpSer: 1.286 ± 0.416
0.378TrpThr: 0.378 ± 0.166
1.211TrpVal: 1.211 ± 0.342
0.151TrpTrp: 0.151 ± 0.091
0.303TrpTyr: 0.303 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.254TyrAla: 3.254 ± 0.506
0.227TyrCys: 0.227 ± 0.127
2.497TyrAsp: 2.497 ± 0.395
1.362TyrGlu: 1.362 ± 0.282
0.757TyrPhe: 0.757 ± 0.2
2.724TyrGly: 2.724 ± 0.495
0.605TyrHis: 0.605 ± 0.223
1.892TyrIle: 1.892 ± 0.292
1.892TyrLys: 1.892 ± 0.385
2.875TyrLeu: 2.875 ± 0.467
0.984TyrMet: 0.984 ± 0.192
1.362TyrAsn: 1.362 ± 0.358
1.059TyrPro: 1.059 ± 0.314
1.211TyrGln: 1.211 ± 0.243
1.438TyrArg: 1.438 ± 0.284
2.043TyrSer: 2.043 ± 0.439
2.497TyrThr: 2.497 ± 0.355
1.438TyrVal: 1.438 ± 0.409
0.227TyrTrp: 0.227 ± 0.127
0.757TyrTyr: 0.757 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (13217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski