Amino acid dipepetide frequency for Pseudomonas phage oldone

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.063AlaAla: 8.063 ± 1.535
0.864AlaCys: 0.864 ± 0.273
4.032AlaAsp: 4.032 ± 0.55
6.407AlaGlu: 6.407 ± 0.837
2.952AlaPhe: 2.952 ± 0.503
7.271AlaGly: 7.271 ± 1.355
1.152AlaHis: 1.152 ± 0.296
4.68AlaIle: 4.68 ± 0.683
5.112AlaLys: 5.112 ± 0.73
7.847AlaLeu: 7.847 ± 0.802
2.664AlaMet: 2.664 ± 0.416
3.456AlaAsn: 3.456 ± 0.573
3.168AlaPro: 3.168 ± 0.573
4.896AlaGln: 4.896 ± 1.053
3.24AlaArg: 3.24 ± 0.488
5.112AlaSer: 5.112 ± 0.694
3.384AlaThr: 3.384 ± 0.695
6.335AlaVal: 6.335 ± 0.928
0.864AlaTrp: 0.864 ± 0.255
2.88AlaTyr: 2.88 ± 0.56
0.0AlaXaa: 0.0 ± 0.0
Cys
0.864CysAla: 0.864 ± 0.288
0.144CysCys: 0.144 ± 0.107
0.648CysAsp: 0.648 ± 0.237
1.008CysGlu: 1.008 ± 0.272
0.288CysPhe: 0.288 ± 0.123
0.936CysGly: 0.936 ± 0.297
0.432CysHis: 0.432 ± 0.154
0.576CysIle: 0.576 ± 0.244
0.504CysLys: 0.504 ± 0.199
0.432CysLeu: 0.432 ± 0.145
0.288CysMet: 0.288 ± 0.131
0.576CysAsn: 0.576 ± 0.196
0.504CysPro: 0.504 ± 0.233
0.432CysGln: 0.432 ± 0.159
0.432CysArg: 0.432 ± 0.188
0.864CysSer: 0.864 ± 0.261
0.504CysThr: 0.504 ± 0.183
0.432CysVal: 0.432 ± 0.233
0.216CysTrp: 0.216 ± 0.112
0.288CysTyr: 0.288 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
5.544AspAla: 5.544 ± 0.474
0.288AspCys: 0.288 ± 0.151
3.168AspAsp: 3.168 ± 0.456
4.104AspGlu: 4.104 ± 0.702
2.088AspPhe: 2.088 ± 0.375
5.256AspGly: 5.256 ± 0.671
1.728AspHis: 1.728 ± 0.427
3.456AspIle: 3.456 ± 0.384
2.808AspLys: 2.808 ± 0.375
6.048AspLeu: 6.048 ± 0.637
1.584AspMet: 1.584 ± 0.309
1.944AspAsn: 1.944 ± 0.32
3.672AspPro: 3.672 ± 0.609
1.368AspGln: 1.368 ± 0.345
3.672AspArg: 3.672 ± 0.533
3.024AspSer: 3.024 ± 0.477
3.6AspThr: 3.6 ± 0.568
3.096AspVal: 3.096 ± 0.348
1.872AspTrp: 1.872 ± 0.347
2.088AspTyr: 2.088 ± 0.306
0.0AspXaa: 0.0 ± 0.0
Glu
6.623GluAla: 6.623 ± 0.902
0.648GluCys: 0.648 ± 0.209
4.608GluAsp: 4.608 ± 0.599
6.695GluGlu: 6.695 ± 1.105
2.664GluPhe: 2.664 ± 0.379
5.328GluGly: 5.328 ± 0.722
1.224GluHis: 1.224 ± 0.29
3.528GluIle: 3.528 ± 0.449
3.456GluLys: 3.456 ± 0.502
6.695GluLeu: 6.695 ± 0.757
2.376GluMet: 2.376 ± 0.472
2.52GluAsn: 2.52 ± 0.368
1.944GluPro: 1.944 ± 0.359
2.16GluGln: 2.16 ± 0.449
5.112GluArg: 5.112 ± 0.824
3.24GluSer: 3.24 ± 0.55
2.952GluThr: 2.952 ± 0.411
6.623GluVal: 6.623 ± 0.795
0.72GluTrp: 0.72 ± 0.202
2.304GluTyr: 2.304 ± 0.532
0.0GluXaa: 0.0 ± 0.0
Phe
1.872PheAla: 1.872 ± 0.423
0.36PheCys: 0.36 ± 0.149
2.376PheAsp: 2.376 ± 0.508
2.088PheGlu: 2.088 ± 0.399
1.296PhePhe: 1.296 ± 0.351
4.392PheGly: 4.392 ± 0.542
0.576PheHis: 0.576 ± 0.189
2.808PheIle: 2.808 ± 0.502
1.584PheLys: 1.584 ± 0.335
3.888PheLeu: 3.888 ± 0.455
1.08PheMet: 1.08 ± 0.242
1.512PheAsn: 1.512 ± 0.347
1.728PhePro: 1.728 ± 0.359
1.512PheGln: 1.512 ± 0.348
1.584PheArg: 1.584 ± 0.364
2.448PheSer: 2.448 ± 0.387
2.16PheThr: 2.16 ± 0.322
2.592PheVal: 2.592 ± 0.43
0.864PheTrp: 0.864 ± 0.291
1.224PheTyr: 1.224 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
7.559GlyAla: 7.559 ± 1.335
1.08GlyCys: 1.08 ± 0.258
5.328GlyAsp: 5.328 ± 0.616
4.968GlyGlu: 4.968 ± 0.698
2.88GlyPhe: 2.88 ± 0.503
6.911GlyGly: 6.911 ± 1.292
1.944GlyHis: 1.944 ± 0.392
4.824GlyIle: 4.824 ± 0.469
5.184GlyLys: 5.184 ± 0.613
5.688GlyLeu: 5.688 ± 0.61
2.736GlyMet: 2.736 ± 0.449
3.456GlyAsn: 3.456 ± 0.641
2.736GlyPro: 2.736 ± 0.416
3.168GlyGln: 3.168 ± 0.513
4.392GlyArg: 4.392 ± 0.551
5.832GlySer: 5.832 ± 0.897
5.472GlyThr: 5.472 ± 0.655
5.976GlyVal: 5.976 ± 0.562
1.368GlyTrp: 1.368 ± 0.303
2.52GlyTyr: 2.52 ± 0.407
0.0GlyXaa: 0.0 ± 0.0
His
1.44HisAla: 1.44 ± 0.452
0.36HisCys: 0.36 ± 0.168
0.864HisAsp: 0.864 ± 0.256
1.152HisGlu: 1.152 ± 0.424
1.224HisPhe: 1.224 ± 0.311
1.728HisGly: 1.728 ± 0.391
0.288HisHis: 0.288 ± 0.137
1.368HisIle: 1.368 ± 0.32
1.152HisLys: 1.152 ± 0.328
3.024HisLeu: 3.024 ± 0.504
0.432HisMet: 0.432 ± 0.159
0.792HisAsn: 0.792 ± 0.227
0.576HisPro: 0.576 ± 0.178
0.648HisGln: 0.648 ± 0.199
1.512HisArg: 1.512 ± 0.294
1.152HisSer: 1.152 ± 0.31
0.792HisThr: 0.792 ± 0.259
1.296HisVal: 1.296 ± 0.268
0.36HisTrp: 0.36 ± 0.226
0.792HisTyr: 0.792 ± 0.296
0.0HisXaa: 0.0 ± 0.0
Ile
4.464IleAla: 4.464 ± 0.516
0.648IleCys: 0.648 ± 0.235
2.952IleAsp: 2.952 ± 0.503
3.168IleGlu: 3.168 ± 0.438
2.016IlePhe: 2.016 ± 0.398
5.04IleGly: 5.04 ± 0.53
1.008IleHis: 1.008 ± 0.303
3.024IleIle: 3.024 ± 0.479
3.384IleLys: 3.384 ± 0.558
4.032IleLeu: 4.032 ± 0.5
0.936IleMet: 0.936 ± 0.274
2.088IleAsn: 2.088 ± 0.385
3.456IlePro: 3.456 ± 0.531
2.376IleGln: 2.376 ± 0.452
3.168IleArg: 3.168 ± 0.525
2.52IleSer: 2.52 ± 0.384
2.808IleThr: 2.808 ± 0.501
2.808IleVal: 2.808 ± 0.502
0.504IleTrp: 0.504 ± 0.185
2.16IleTyr: 2.16 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
5.472LysAla: 5.472 ± 0.742
0.432LysCys: 0.432 ± 0.147
3.888LysAsp: 3.888 ± 0.518
3.96LysGlu: 3.96 ± 0.46
2.088LysPhe: 2.088 ± 0.354
3.96LysGly: 3.96 ± 0.662
1.584LysHis: 1.584 ± 0.274
3.312LysIle: 3.312 ± 0.425
3.384LysLys: 3.384 ± 0.472
4.248LysLeu: 4.248 ± 0.561
1.08LysMet: 1.08 ± 0.255
2.16LysAsn: 2.16 ± 0.391
2.52LysPro: 2.52 ± 0.568
1.296LysGln: 1.296 ± 0.293
3.384LysArg: 3.384 ± 0.645
3.096LysSer: 3.096 ± 0.413
2.736LysThr: 2.736 ± 0.482
4.536LysVal: 4.536 ± 0.675
1.296LysTrp: 1.296 ± 0.323
2.376LysTyr: 2.376 ± 0.32
0.0LysXaa: 0.0 ± 0.0
Leu
6.983LeuAla: 6.983 ± 0.633
0.936LeuCys: 0.936 ± 0.234
6.048LeuAsp: 6.048 ± 0.612
6.623LeuGlu: 6.623 ± 0.89
3.096LeuPhe: 3.096 ± 0.503
7.127LeuGly: 7.127 ± 0.971
1.08LeuHis: 1.08 ± 0.285
2.808LeuIle: 2.808 ± 0.383
6.263LeuLys: 6.263 ± 0.582
7.127LeuLeu: 7.127 ± 0.78
2.088LeuMet: 2.088 ± 0.474
3.24LeuAsn: 3.24 ± 0.466
3.24LeuPro: 3.24 ± 0.523
3.312LeuGln: 3.312 ± 0.548
6.263LeuArg: 6.263 ± 0.734
4.104LeuSer: 4.104 ± 0.638
3.672LeuThr: 3.672 ± 0.515
4.968LeuVal: 4.968 ± 0.586
0.936LeuTrp: 0.936 ± 0.277
2.088LeuTyr: 2.088 ± 0.466
0.0LeuXaa: 0.0 ± 0.0
Met
3.312MetAla: 3.312 ± 0.514
0.216MetCys: 0.216 ± 0.117
1.656MetAsp: 1.656 ± 0.355
1.296MetGlu: 1.296 ± 0.277
1.08MetPhe: 1.08 ± 0.27
2.16MetGly: 2.16 ± 0.517
0.504MetHis: 0.504 ± 0.181
1.584MetIle: 1.584 ± 0.305
1.8MetLys: 1.8 ± 0.358
1.728MetLeu: 1.728 ± 0.315
0.504MetMet: 0.504 ± 0.148
1.224MetAsn: 1.224 ± 0.297
1.152MetPro: 1.152 ± 0.239
1.224MetGln: 1.224 ± 0.288
1.368MetArg: 1.368 ± 0.266
2.376MetSer: 2.376 ± 0.452
1.728MetThr: 1.728 ± 0.417
1.368MetVal: 1.368 ± 0.261
0.504MetTrp: 0.504 ± 0.179
0.648MetTyr: 0.648 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
3.672AsnAla: 3.672 ± 0.583
0.72AsnCys: 0.72 ± 0.254
2.232AsnAsp: 2.232 ± 0.386
3.024AsnGlu: 3.024 ± 0.41
1.656AsnPhe: 1.656 ± 0.426
3.168AsnGly: 3.168 ± 0.63
1.08AsnHis: 1.08 ± 0.309
2.88AsnIle: 2.88 ± 0.511
1.44AsnLys: 1.44 ± 0.277
3.024AsnLeu: 3.024 ± 0.393
1.008AsnMet: 1.008 ± 0.282
1.512AsnAsn: 1.512 ± 0.382
2.592AsnPro: 2.592 ± 0.507
2.016AsnGln: 2.016 ± 0.406
2.808AsnArg: 2.808 ± 0.359
1.8AsnSer: 1.8 ± 0.363
2.592AsnThr: 2.592 ± 0.462
2.52AsnVal: 2.52 ± 0.413
0.72AsnTrp: 0.72 ± 0.174
1.296AsnTyr: 1.296 ± 0.312
0.0AsnXaa: 0.0 ± 0.0
Pro
3.312ProAla: 3.312 ± 0.541
0.36ProCys: 0.36 ± 0.153
3.168ProAsp: 3.168 ± 0.452
4.68ProGlu: 4.68 ± 0.576
2.52ProPhe: 2.52 ± 0.372
3.528ProGly: 3.528 ± 0.48
0.864ProHis: 0.864 ± 0.244
1.44ProIle: 1.44 ± 0.33
2.664ProLys: 2.664 ± 0.399
2.376ProLeu: 2.376 ± 0.374
0.792ProMet: 0.792 ± 0.233
1.872ProAsn: 1.872 ± 0.489
1.584ProPro: 1.584 ± 0.503
2.016ProGln: 2.016 ± 0.434
1.296ProArg: 1.296 ± 0.32
2.016ProSer: 2.016 ± 0.484
2.736ProThr: 2.736 ± 0.429
2.736ProVal: 2.736 ± 0.45
0.864ProTrp: 0.864 ± 0.216
1.728ProTyr: 1.728 ± 0.387
0.0ProXaa: 0.0 ± 0.0
Gln
4.176GlnAla: 4.176 ± 0.73
0.216GlnCys: 0.216 ± 0.148
2.088GlnAsp: 2.088 ± 0.393
3.24GlnGlu: 3.24 ± 0.45
1.224GlnPhe: 1.224 ± 0.255
2.952GlnGly: 2.952 ± 0.549
0.36GlnHis: 0.36 ± 0.151
1.44GlnIle: 1.44 ± 0.342
1.8GlnLys: 1.8 ± 0.404
3.024GlnLeu: 3.024 ± 0.506
1.872GlnMet: 1.872 ± 0.307
1.944GlnAsn: 1.944 ± 0.495
1.08GlnPro: 1.08 ± 0.278
2.232GlnGln: 2.232 ± 0.504
3.456GlnArg: 3.456 ± 0.617
2.088GlnSer: 2.088 ± 0.389
1.872GlnThr: 1.872 ± 0.502
3.312GlnVal: 3.312 ± 0.483
0.504GlnTrp: 0.504 ± 0.182
0.936GlnTyr: 0.936 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
4.608ArgAla: 4.608 ± 0.691
0.576ArgCys: 0.576 ± 0.214
3.456ArgAsp: 3.456 ± 0.523
3.384ArgGlu: 3.384 ± 0.478
1.944ArgPhe: 1.944 ± 0.401
4.536ArgGly: 4.536 ± 0.571
1.152ArgHis: 1.152 ± 0.258
3.744ArgIle: 3.744 ± 0.481
3.672ArgLys: 3.672 ± 0.648
5.184ArgLeu: 5.184 ± 0.649
1.944ArgMet: 1.944 ± 0.371
3.024ArgAsn: 3.024 ± 0.418
2.016ArgPro: 2.016 ± 0.419
2.736ArgGln: 2.736 ± 0.423
3.888ArgArg: 3.888 ± 0.51
3.24ArgSer: 3.24 ± 0.462
2.16ArgThr: 2.16 ± 0.451
5.4ArgVal: 5.4 ± 0.703
0.936ArgTrp: 0.936 ± 0.286
1.296ArgTyr: 1.296 ± 0.304
0.0ArgXaa: 0.0 ± 0.0
Ser
4.464SerAla: 4.464 ± 0.621
0.576SerCys: 0.576 ± 0.235
3.096SerAsp: 3.096 ± 0.528
4.536SerGlu: 4.536 ± 0.581
1.944SerPhe: 1.944 ± 0.452
5.76SerGly: 5.76 ± 0.658
1.512SerHis: 1.512 ± 0.331
3.528SerIle: 3.528 ± 0.422
3.384SerLys: 3.384 ± 0.539
4.248SerLeu: 4.248 ± 0.634
1.224SerMet: 1.224 ± 0.329
2.808SerAsn: 2.808 ± 0.414
2.376SerPro: 2.376 ± 0.454
2.232SerGln: 2.232 ± 0.402
2.376SerArg: 2.376 ± 0.399
3.672SerSer: 3.672 ± 0.624
2.304SerThr: 2.304 ± 0.443
4.032SerVal: 4.032 ± 0.596
1.224SerTrp: 1.224 ± 0.449
1.944SerTyr: 1.944 ± 0.465
0.0SerXaa: 0.0 ± 0.0
Thr
4.248ThrAla: 4.248 ± 0.575
0.432ThrCys: 0.432 ± 0.166
2.088ThrAsp: 2.088 ± 0.422
3.024ThrGlu: 3.024 ± 0.393
2.16ThrPhe: 2.16 ± 0.502
4.752ThrGly: 4.752 ± 0.704
1.44ThrHis: 1.44 ± 0.327
2.664ThrIle: 2.664 ± 0.554
2.376ThrLys: 2.376 ± 0.476
4.752ThrLeu: 4.752 ± 0.544
1.512ThrMet: 1.512 ± 0.306
2.016ThrAsn: 2.016 ± 0.407
3.672ThrPro: 3.672 ± 0.727
2.664ThrGln: 2.664 ± 0.425
2.952ThrArg: 2.952 ± 0.466
2.592ThrSer: 2.592 ± 0.422
2.592ThrThr: 2.592 ± 0.569
3.456ThrVal: 3.456 ± 0.466
0.864ThrTrp: 0.864 ± 0.335
1.224ThrTyr: 1.224 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
4.968ValAla: 4.968 ± 0.698
0.792ValCys: 0.792 ± 0.235
4.608ValAsp: 4.608 ± 0.553
5.04ValGlu: 5.04 ± 0.638
3.168ValPhe: 3.168 ± 0.508
5.328ValGly: 5.328 ± 0.581
2.088ValHis: 2.088 ± 0.432
2.736ValIle: 2.736 ± 0.565
3.672ValLys: 3.672 ± 0.348
4.968ValLeu: 4.968 ± 0.667
2.016ValMet: 2.016 ± 0.416
3.528ValAsn: 3.528 ± 0.423
2.16ValPro: 2.16 ± 0.385
2.016ValGln: 2.016 ± 0.357
4.536ValArg: 4.536 ± 0.492
5.04ValSer: 5.04 ± 0.696
4.68ValThr: 4.68 ± 0.573
5.832ValVal: 5.832 ± 0.763
0.936ValTrp: 0.936 ± 0.261
3.024ValTyr: 3.024 ± 0.5
0.0ValXaa: 0.0 ± 0.0
Trp
1.08TrpAla: 1.08 ± 0.221
0.288TrpCys: 0.288 ± 0.134
1.296TrpAsp: 1.296 ± 0.327
1.152TrpGlu: 1.152 ± 0.273
0.576TrpPhe: 0.576 ± 0.2
1.296TrpGly: 1.296 ± 0.303
0.0TrpHis: 0.0 ± 0.0
0.504TrpIle: 0.504 ± 0.198
1.44TrpLys: 1.44 ± 0.323
1.152TrpLeu: 1.152 ± 0.282
0.504TrpMet: 0.504 ± 0.15
0.432TrpAsn: 0.432 ± 0.151
0.72TrpPro: 0.72 ± 0.272
0.504TrpGln: 0.504 ± 0.202
1.224TrpArg: 1.224 ± 0.292
0.864TrpSer: 0.864 ± 0.359
1.08TrpThr: 1.08 ± 0.251
1.008TrpVal: 1.008 ± 0.3
0.36TrpTrp: 0.36 ± 0.18
0.72TrpTyr: 0.72 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.656TyrAla: 1.656 ± 0.337
0.432TyrCys: 0.432 ± 0.15
2.808TyrAsp: 2.808 ± 0.441
1.872TyrGlu: 1.872 ± 0.332
1.08TyrPhe: 1.08 ± 0.214
2.592TyrGly: 2.592 ± 0.36
0.936TyrHis: 0.936 ± 0.274
1.656TyrIle: 1.656 ± 0.355
1.728TyrLys: 1.728 ± 0.317
2.52TyrLeu: 2.52 ± 0.439
0.72TyrMet: 0.72 ± 0.281
1.656TyrAsn: 1.656 ± 0.341
1.656TyrPro: 1.656 ± 0.406
1.008TyrGln: 1.008 ± 0.239
2.232TyrArg: 2.232 ± 0.336
2.232TyrSer: 2.232 ± 0.4
1.656TyrThr: 1.656 ± 0.386
2.808TyrVal: 2.808 ± 0.436
0.288TyrTrp: 0.288 ± 0.172
1.152TyrTyr: 1.152 ± 0.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13891 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski