Amino acid dipepetide frequency for Pseudomonas virus PaP3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.215AlaAla: 9.215 ± 1.743
1.134AlaCys: 1.134 ± 0.297
4.537AlaAsp: 4.537 ± 0.505
5.175AlaGlu: 5.175 ± 0.854
2.765AlaPhe: 2.765 ± 0.536
6.734AlaGly: 6.734 ± 1.037
1.347AlaHis: 1.347 ± 0.278
4.466AlaIle: 4.466 ± 0.596
4.82AlaLys: 4.82 ± 0.644
8.152AlaLeu: 8.152 ± 0.781
2.552AlaMet: 2.552 ± 0.49
3.261AlaAsn: 3.261 ± 0.444
2.906AlaPro: 2.906 ± 0.6
4.182AlaGln: 4.182 ± 0.849
3.403AlaArg: 3.403 ± 0.482
5.529AlaSer: 5.529 ± 0.777
4.182AlaThr: 4.182 ± 0.67
6.876AlaVal: 6.876 ± 0.82
0.922AlaTrp: 0.922 ± 0.312
2.623AlaTyr: 2.623 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.851CysAla: 0.851 ± 0.208
0.142CysCys: 0.142 ± 0.096
0.425CysAsp: 0.425 ± 0.177
0.638CysGlu: 0.638 ± 0.189
0.142CysPhe: 0.142 ± 0.103
1.276CysGly: 1.276 ± 0.327
0.496CysHis: 0.496 ± 0.152
0.496CysIle: 0.496 ± 0.199
0.284CysLys: 0.284 ± 0.138
0.78CysLeu: 0.78 ± 0.215
0.213CysMet: 0.213 ± 0.133
0.709CysAsn: 0.709 ± 0.228
0.496CysPro: 0.496 ± 0.181
0.354CysGln: 0.354 ± 0.153
0.851CysArg: 0.851 ± 0.215
0.425CysSer: 0.425 ± 0.15
0.354CysThr: 0.354 ± 0.158
0.709CysVal: 0.709 ± 0.256
0.071CysTrp: 0.071 ± 0.07
0.78CysTyr: 0.78 ± 0.268
0.0CysXaa: 0.0 ± 0.0
Asp
5.529AspAla: 5.529 ± 0.554
0.425AspCys: 0.425 ± 0.202
3.757AspAsp: 3.757 ± 0.541
4.608AspGlu: 4.608 ± 0.768
2.41AspPhe: 2.41 ± 0.45
4.395AspGly: 4.395 ± 0.637
1.772AspHis: 1.772 ± 0.48
4.324AspIle: 4.324 ± 0.498
2.552AspLys: 2.552 ± 0.534
6.025AspLeu: 6.025 ± 0.502
1.843AspMet: 1.843 ± 0.273
2.056AspAsn: 2.056 ± 0.383
3.686AspPro: 3.686 ± 0.587
1.56AspGln: 1.56 ± 0.321
3.615AspArg: 3.615 ± 0.476
2.906AspSer: 2.906 ± 0.535
3.261AspThr: 3.261 ± 0.461
2.835AspVal: 2.835 ± 0.45
1.843AspTrp: 1.843 ± 0.301
1.843AspTyr: 1.843 ± 0.357
0.0AspXaa: 0.0 ± 0.0
Glu
6.522GluAla: 6.522 ± 0.954
0.567GluCys: 0.567 ± 0.159
5.6GluAsp: 5.6 ± 0.759
6.38GluGlu: 6.38 ± 1.054
2.056GluPhe: 2.056 ± 0.364
6.167GluGly: 6.167 ± 0.686
1.134GluHis: 1.134 ± 0.268
2.41GluIle: 2.41 ± 0.383
3.261GluLys: 3.261 ± 0.507
6.38GluLeu: 6.38 ± 0.697
2.268GluMet: 2.268 ± 0.408
2.552GluAsn: 2.552 ± 0.31
1.914GluPro: 1.914 ± 0.383
2.41GluGln: 2.41 ± 0.505
5.387GluArg: 5.387 ± 0.748
2.835GluSer: 2.835 ± 0.521
3.473GluThr: 3.473 ± 0.542
6.805GluVal: 6.805 ± 0.752
0.78GluTrp: 0.78 ± 0.219
1.914GluTyr: 1.914 ± 0.39
0.0GluXaa: 0.0 ± 0.0
Phe
2.339PheAla: 2.339 ± 0.417
0.425PheCys: 0.425 ± 0.172
2.339PheAsp: 2.339 ± 0.46
1.914PheGlu: 1.914 ± 0.395
1.772PhePhe: 1.772 ± 0.365
4.111PheGly: 4.111 ± 0.56
0.922PheHis: 0.922 ± 0.249
2.339PheIle: 2.339 ± 0.41
2.127PheLys: 2.127 ± 0.368
3.757PheLeu: 3.757 ± 0.473
0.992PheMet: 0.992 ± 0.198
1.418PheAsn: 1.418 ± 0.305
1.134PhePro: 1.134 ± 0.345
1.418PheGln: 1.418 ± 0.358
1.63PheArg: 1.63 ± 0.363
2.623PheSer: 2.623 ± 0.318
2.339PheThr: 2.339 ± 0.331
3.119PheVal: 3.119 ± 0.479
0.851PheTrp: 0.851 ± 0.278
1.489PheTyr: 1.489 ± 0.36
0.0PheXaa: 0.0 ± 0.0
Gly
7.301GlyAla: 7.301 ± 1.196
0.922GlyCys: 0.922 ± 0.205
4.962GlyAsp: 4.962 ± 0.658
4.82GlyGlu: 4.82 ± 0.623
2.906GlyPhe: 2.906 ± 0.522
7.16GlyGly: 7.16 ± 1.286
2.481GlyHis: 2.481 ± 0.459
5.175GlyIle: 5.175 ± 0.571
4.537GlyLys: 4.537 ± 0.595
6.522GlyLeu: 6.522 ± 0.711
2.552GlyMet: 2.552 ± 0.41
4.608GlyAsn: 4.608 ± 0.621
2.765GlyPro: 2.765 ± 0.48
3.615GlyGln: 3.615 ± 0.551
4.111GlyArg: 4.111 ± 0.577
6.238GlySer: 6.238 ± 0.917
4.253GlyThr: 4.253 ± 0.544
5.529GlyVal: 5.529 ± 0.604
1.134GlyTrp: 1.134 ± 0.32
2.056GlyTyr: 2.056 ± 0.344
0.0GlyXaa: 0.0 ± 0.0
His
1.205HisAla: 1.205 ± 0.286
0.354HisCys: 0.354 ± 0.153
0.78HisAsp: 0.78 ± 0.306
1.701HisGlu: 1.701 ± 0.422
1.063HisPhe: 1.063 ± 0.285
1.347HisGly: 1.347 ± 0.276
0.496HisHis: 0.496 ± 0.181
1.276HisIle: 1.276 ± 0.329
1.205HisLys: 1.205 ± 0.287
2.552HisLeu: 2.552 ± 0.481
0.567HisMet: 0.567 ± 0.192
0.638HisAsn: 0.638 ± 0.231
0.851HisPro: 0.851 ± 0.218
0.638HisGln: 0.638 ± 0.202
1.276HisArg: 1.276 ± 0.306
0.851HisSer: 0.851 ± 0.226
1.205HisThr: 1.205 ± 0.323
1.205HisVal: 1.205 ± 0.315
0.709HisTrp: 0.709 ± 0.269
0.567HisTyr: 0.567 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
3.828IleAla: 3.828 ± 0.473
0.425IleCys: 0.425 ± 0.17
2.481IleAsp: 2.481 ± 0.433
3.119IleGlu: 3.119 ± 0.502
2.339IlePhe: 2.339 ± 0.468
4.962IleGly: 4.962 ± 0.45
1.276IleHis: 1.276 ± 0.296
2.552IleIle: 2.552 ± 0.41
3.119IleLys: 3.119 ± 0.484
3.828IleLeu: 3.828 ± 0.497
0.78IleMet: 0.78 ± 0.21
1.985IleAsn: 1.985 ± 0.395
3.615IlePro: 3.615 ± 0.566
2.552IleGln: 2.552 ± 0.45
3.403IleArg: 3.403 ± 0.599
2.481IleSer: 2.481 ± 0.47
2.552IleThr: 2.552 ± 0.416
3.403IleVal: 3.403 ± 0.618
0.496IleTrp: 0.496 ± 0.176
2.339IleTyr: 2.339 ± 0.431
0.0IleXaa: 0.0 ± 0.0
Lys
5.387LysAla: 5.387 ± 0.656
0.354LysCys: 0.354 ± 0.169
4.395LysAsp: 4.395 ± 0.509
3.97LysGlu: 3.97 ± 0.467
1.914LysPhe: 1.914 ± 0.462
3.757LysGly: 3.757 ± 0.585
1.347LysHis: 1.347 ± 0.305
1.985LysIle: 1.985 ± 0.399
3.048LysLys: 3.048 ± 0.537
4.182LysLeu: 4.182 ± 0.66
1.347LysMet: 1.347 ± 0.354
1.985LysAsn: 1.985 ± 0.41
2.694LysPro: 2.694 ± 0.496
1.843LysGln: 1.843 ± 0.318
3.119LysArg: 3.119 ± 0.527
2.835LysSer: 2.835 ± 0.427
2.623LysThr: 2.623 ± 0.39
4.111LysVal: 4.111 ± 0.734
1.205LysTrp: 1.205 ± 0.321
2.339LysTyr: 2.339 ± 0.436
0.0LysXaa: 0.0 ± 0.0
Leu
7.018LeuAla: 7.018 ± 0.653
0.851LeuCys: 0.851 ± 0.224
5.387LeuAsp: 5.387 ± 0.469
6.309LeuGlu: 6.309 ± 0.805
3.261LeuPhe: 3.261 ± 0.62
6.734LeuGly: 6.734 ± 1.039
1.134LeuHis: 1.134 ± 0.318
3.544LeuIle: 3.544 ± 0.434
5.6LeuLys: 5.6 ± 0.51
6.947LeuLeu: 6.947 ± 0.614
3.544LeuMet: 3.544 ± 0.602
3.97LeuAsn: 3.97 ± 0.448
3.757LeuPro: 3.757 ± 0.495
3.686LeuGln: 3.686 ± 0.522
5.954LeuArg: 5.954 ± 0.679
5.104LeuSer: 5.104 ± 0.692
3.899LeuThr: 3.899 ± 0.482
5.6LeuVal: 5.6 ± 0.682
0.709LeuTrp: 0.709 ± 0.217
2.41LeuTyr: 2.41 ± 0.459
0.0LeuXaa: 0.0 ± 0.0
Met
2.977MetAla: 2.977 ± 0.513
0.213MetCys: 0.213 ± 0.12
1.63MetAsp: 1.63 ± 0.274
1.772MetGlu: 1.772 ± 0.384
0.922MetPhe: 0.922 ± 0.276
2.339MetGly: 2.339 ± 0.586
0.354MetHis: 0.354 ± 0.168
1.63MetIle: 1.63 ± 0.333
1.985MetLys: 1.985 ± 0.324
1.985MetLeu: 1.985 ± 0.398
0.567MetMet: 0.567 ± 0.231
1.205MetAsn: 1.205 ± 0.333
1.134MetPro: 1.134 ± 0.231
0.851MetGln: 0.851 ± 0.236
1.489MetArg: 1.489 ± 0.283
2.197MetSer: 2.197 ± 0.454
1.914MetThr: 1.914 ± 0.382
1.843MetVal: 1.843 ± 0.481
0.425MetTrp: 0.425 ± 0.139
0.638MetTyr: 0.638 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
2.835AsnAla: 2.835 ± 0.529
0.567AsnCys: 0.567 ± 0.185
2.268AsnAsp: 2.268 ± 0.304
2.977AsnGlu: 2.977 ± 0.449
1.843AsnPhe: 1.843 ± 0.373
4.111AsnGly: 4.111 ± 0.59
1.134AsnHis: 1.134 ± 0.326
3.261AsnIle: 3.261 ± 0.496
1.489AsnLys: 1.489 ± 0.326
3.403AsnLeu: 3.403 ± 0.49
0.922AsnMet: 0.922 ± 0.245
1.772AsnAsn: 1.772 ± 0.404
2.41AsnPro: 2.41 ± 0.485
1.772AsnGln: 1.772 ± 0.36
2.694AsnArg: 2.694 ± 0.416
1.772AsnSer: 1.772 ± 0.334
2.552AsnThr: 2.552 ± 0.493
2.835AsnVal: 2.835 ± 0.541
0.851AsnTrp: 0.851 ± 0.183
1.772AsnTyr: 1.772 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
2.977ProAla: 2.977 ± 0.519
0.354ProCys: 0.354 ± 0.156
2.906ProAsp: 2.906 ± 0.442
4.679ProGlu: 4.679 ± 0.701
2.41ProPhe: 2.41 ± 0.318
3.048ProGly: 3.048 ± 0.405
0.922ProHis: 0.922 ± 0.251
1.63ProIle: 1.63 ± 0.327
2.977ProLys: 2.977 ± 0.42
2.765ProLeu: 2.765 ± 0.406
0.992ProMet: 0.992 ± 0.271
1.914ProAsn: 1.914 ± 0.403
1.489ProPro: 1.489 ± 0.482
1.985ProGln: 1.985 ± 0.354
1.56ProArg: 1.56 ± 0.381
1.985ProSer: 1.985 ± 0.401
2.056ProThr: 2.056 ± 0.416
2.835ProVal: 2.835 ± 0.453
0.851ProTrp: 0.851 ± 0.194
1.56ProTyr: 1.56 ± 0.375
0.0ProXaa: 0.0 ± 0.0
Gln
5.104GlnAla: 5.104 ± 0.669
0.213GlnCys: 0.213 ± 0.118
2.339GlnAsp: 2.339 ± 0.401
3.544GlnGlu: 3.544 ± 0.451
0.992GlnPhe: 0.992 ± 0.257
3.261GlnGly: 3.261 ± 0.636
0.425GlnHis: 0.425 ± 0.201
1.418GlnIle: 1.418 ± 0.269
1.701GlnLys: 1.701 ± 0.379
3.403GlnLeu: 3.403 ± 0.589
1.347GlnMet: 1.347 ± 0.375
1.63GlnAsn: 1.63 ± 0.429
0.78GlnPro: 0.78 ± 0.256
1.63GlnGln: 1.63 ± 0.393
2.906GlnArg: 2.906 ± 0.467
2.127GlnSer: 2.127 ± 0.302
1.701GlnThr: 1.701 ± 0.405
2.481GlnVal: 2.481 ± 0.358
0.638GlnTrp: 0.638 ± 0.186
1.205GlnTyr: 1.205 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
4.82ArgAla: 4.82 ± 0.697
0.425ArgCys: 0.425 ± 0.169
3.048ArgAsp: 3.048 ± 0.476
3.119ArgGlu: 3.119 ± 0.597
2.41ArgPhe: 2.41 ± 0.348
4.679ArgGly: 4.679 ± 0.617
1.205ArgHis: 1.205 ± 0.309
3.473ArgIle: 3.473 ± 0.406
3.473ArgLys: 3.473 ± 0.558
5.671ArgLeu: 5.671 ± 0.714
1.914ArgMet: 1.914 ± 0.41
3.19ArgAsn: 3.19 ± 0.494
2.339ArgPro: 2.339 ± 0.363
2.552ArgGln: 2.552 ± 0.432
3.332ArgArg: 3.332 ± 0.547
2.623ArgSer: 2.623 ± 0.425
2.339ArgThr: 2.339 ± 0.355
4.749ArgVal: 4.749 ± 0.659
1.063ArgTrp: 1.063 ± 0.33
1.63ArgTyr: 1.63 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
4.395SerAla: 4.395 ± 0.626
0.922SerCys: 0.922 ± 0.235
3.261SerAsp: 3.261 ± 0.413
4.253SerGlu: 4.253 ± 0.531
2.127SerPhe: 2.127 ± 0.466
5.954SerGly: 5.954 ± 0.561
0.992SerHis: 0.992 ± 0.212
3.544SerIle: 3.544 ± 0.521
3.048SerLys: 3.048 ± 0.521
3.97SerLeu: 3.97 ± 0.651
1.63SerMet: 1.63 ± 0.376
2.694SerAsn: 2.694 ± 0.457
2.197SerPro: 2.197 ± 0.473
2.056SerGln: 2.056 ± 0.335
2.765SerArg: 2.765 ± 0.495
3.757SerSer: 3.757 ± 0.557
2.056SerThr: 2.056 ± 0.443
3.828SerVal: 3.828 ± 0.589
1.063SerTrp: 1.063 ± 0.314
1.914SerTyr: 1.914 ± 0.443
0.0SerXaa: 0.0 ± 0.0
Thr
4.253ThrAla: 4.253 ± 0.739
0.284ThrCys: 0.284 ± 0.119
2.481ThrAsp: 2.481 ± 0.412
3.332ThrGlu: 3.332 ± 0.482
2.481ThrPhe: 2.481 ± 0.354
3.757ThrGly: 3.757 ± 0.582
0.78ThrHis: 0.78 ± 0.225
2.197ThrIle: 2.197 ± 0.445
2.339ThrLys: 2.339 ± 0.351
5.458ThrLeu: 5.458 ± 0.594
1.276ThrMet: 1.276 ± 0.263
1.985ThrAsn: 1.985 ± 0.353
3.19ThrPro: 3.19 ± 0.565
2.197ThrGln: 2.197 ± 0.492
2.977ThrArg: 2.977 ± 0.375
3.332ThrSer: 3.332 ± 0.59
2.694ThrThr: 2.694 ± 0.454
3.544ThrVal: 3.544 ± 0.528
0.851ThrTrp: 0.851 ± 0.316
1.418ThrTyr: 1.418 ± 0.34
0.0ThrXaa: 0.0 ± 0.0
Val
5.742ValAla: 5.742 ± 0.666
1.134ValCys: 1.134 ± 0.335
4.537ValAsp: 4.537 ± 0.556
4.749ValGlu: 4.749 ± 0.572
3.332ValPhe: 3.332 ± 0.572
5.742ValGly: 5.742 ± 0.598
1.63ValHis: 1.63 ± 0.384
3.19ValIle: 3.19 ± 0.501
3.757ValLys: 3.757 ± 0.435
5.529ValLeu: 5.529 ± 0.766
1.772ValMet: 1.772 ± 0.359
3.544ValAsn: 3.544 ± 0.574
2.127ValPro: 2.127 ± 0.371
1.914ValGln: 1.914 ± 0.292
4.041ValArg: 4.041 ± 0.509
4.324ValSer: 4.324 ± 0.515
4.395ValThr: 4.395 ± 0.462
5.671ValVal: 5.671 ± 0.568
0.992ValTrp: 0.992 ± 0.26
2.906ValTyr: 2.906 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
1.205TrpAla: 1.205 ± 0.242
0.284TrpCys: 0.284 ± 0.139
1.347TrpAsp: 1.347 ± 0.285
1.347TrpGlu: 1.347 ± 0.277
0.638TrpPhe: 0.638 ± 0.231
1.489TrpGly: 1.489 ± 0.328
0.0TrpHis: 0.0 ± 0.0
0.709TrpIle: 0.709 ± 0.229
1.56TrpLys: 1.56 ± 0.275
1.205TrpLeu: 1.205 ± 0.262
0.496TrpMet: 0.496 ± 0.14
0.638TrpAsn: 0.638 ± 0.196
0.567TrpPro: 0.567 ± 0.208
0.354TrpGln: 0.354 ± 0.159
1.347TrpArg: 1.347 ± 0.259
0.78TrpSer: 0.78 ± 0.334
0.851TrpThr: 0.851 ± 0.266
0.567TrpVal: 0.567 ± 0.203
0.284TrpTrp: 0.284 ± 0.14
0.567TrpTyr: 0.567 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.276TyrAla: 1.276 ± 0.289
0.496TyrCys: 0.496 ± 0.186
2.765TyrAsp: 2.765 ± 0.405
2.41TyrGlu: 2.41 ± 0.444
1.276TyrPhe: 1.276 ± 0.241
2.623TyrGly: 2.623 ± 0.351
0.638TyrHis: 0.638 ± 0.199
1.914TyrIle: 1.914 ± 0.375
1.63TyrLys: 1.63 ± 0.338
2.906TyrLeu: 2.906 ± 0.567
0.425TyrMet: 0.425 ± 0.19
1.489TyrAsn: 1.489 ± 0.348
1.843TyrPro: 1.843 ± 0.35
1.276TyrGln: 1.276 ± 0.245
2.197TyrArg: 2.197 ± 0.295
1.701TyrSer: 1.701 ± 0.363
2.127TyrThr: 2.127 ± 0.37
2.481TyrVal: 2.481 ± 0.538
0.496TyrTrp: 0.496 ± 0.193
1.489TyrTyr: 1.489 ± 0.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (14108 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski