Amino acid dipepetide frequency for Pseudoalteromonas phage H103

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.974AlaAla: 7.974 ± 1.249
1.655AlaCys: 1.655 ± 0.299
4.513AlaAsp: 4.513 ± 0.631
4.438AlaGlu: 4.438 ± 0.636
2.633AlaPhe: 2.633 ± 0.404
4.965AlaGly: 4.965 ± 0.71
1.73AlaHis: 1.73 ± 0.442
4.664AlaIle: 4.664 ± 0.583
5.867AlaLys: 5.867 ± 0.589
6.695AlaLeu: 6.695 ± 0.848
2.708AlaMet: 2.708 ± 0.582
5.491AlaAsn: 5.491 ± 0.647
2.934AlaPro: 2.934 ± 0.509
3.084AlaGln: 3.084 ± 0.508
2.633AlaArg: 2.633 ± 0.511
4.889AlaSer: 4.889 ± 0.646
5.642AlaThr: 5.642 ± 0.806
5.642AlaVal: 5.642 ± 0.689
0.752AlaTrp: 0.752 ± 0.215
2.934AlaTyr: 2.934 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
1.128CysAla: 1.128 ± 0.256
0.527CysCys: 0.527 ± 0.239
0.752CysAsp: 0.752 ± 0.232
0.752CysGlu: 0.752 ± 0.216
0.677CysPhe: 0.677 ± 0.21
1.128CysGly: 1.128 ± 0.263
0.301CysHis: 0.301 ± 0.165
0.903CysIle: 0.903 ± 0.266
0.827CysLys: 0.827 ± 0.257
1.504CysLeu: 1.504 ± 0.399
0.0CysMet: 0.0 ± 0.0
1.053CysAsn: 1.053 ± 0.276
0.677CysPro: 0.677 ± 0.315
0.677CysGln: 0.677 ± 0.209
0.226CysArg: 0.226 ± 0.118
0.978CysSer: 0.978 ± 0.242
0.827CysThr: 0.827 ± 0.257
0.677CysVal: 0.677 ± 0.232
0.075CysTrp: 0.075 ± 0.08
0.752CysTyr: 0.752 ± 0.249
0.0CysXaa: 0.0 ± 0.0
Asp
4.513AspAla: 4.513 ± 0.567
1.354AspCys: 1.354 ± 0.347
4.589AspAsp: 4.589 ± 0.667
4.814AspGlu: 4.814 ± 0.795
3.084AspPhe: 3.084 ± 0.575
5.642AspGly: 5.642 ± 0.689
0.752AspHis: 0.752 ± 0.237
4.513AspIle: 4.513 ± 0.585
3.686AspLys: 3.686 ± 0.551
5.04AspLeu: 5.04 ± 0.652
1.429AspMet: 1.429 ± 0.312
2.482AspAsn: 2.482 ± 0.505
1.881AspPro: 1.881 ± 0.363
2.106AspGln: 2.106 ± 0.318
1.429AspArg: 1.429 ± 0.278
4.814AspSer: 4.814 ± 0.552
2.934AspThr: 2.934 ± 0.432
4.212AspVal: 4.212 ± 0.591
0.827AspTrp: 0.827 ± 0.293
3.385AspTyr: 3.385 ± 0.496
0.0AspXaa: 0.0 ± 0.0
Glu
5.943GluAla: 5.943 ± 0.539
0.827GluCys: 0.827 ± 0.234
2.558GluAsp: 2.558 ± 0.545
3.987GluGlu: 3.987 ± 0.681
2.558GluPhe: 2.558 ± 0.451
3.987GluGly: 3.987 ± 0.516
1.429GluHis: 1.429 ± 0.316
4.814GluIle: 4.814 ± 0.523
4.062GluLys: 4.062 ± 0.518
6.92GluLeu: 6.92 ± 0.719
1.504GluMet: 1.504 ± 0.313
2.858GluAsn: 2.858 ± 0.423
1.655GluPro: 1.655 ± 0.427
3.385GluGln: 3.385 ± 0.542
3.611GluArg: 3.611 ± 0.562
4.513GluSer: 4.513 ± 0.567
2.482GluThr: 2.482 ± 0.408
4.739GluVal: 4.739 ± 0.622
1.053GluTrp: 1.053 ± 0.295
3.836GluTyr: 3.836 ± 0.599
0.0GluXaa: 0.0 ± 0.0
Phe
2.407PheAla: 2.407 ± 0.453
1.053PheCys: 1.053 ± 0.284
3.535PheAsp: 3.535 ± 0.582
2.558PheGlu: 2.558 ± 0.499
0.827PhePhe: 0.827 ± 0.193
2.332PheGly: 2.332 ± 0.359
0.827PheHis: 0.827 ± 0.265
2.783PheIle: 2.783 ± 0.412
2.934PheLys: 2.934 ± 0.497
2.181PheLeu: 2.181 ± 0.414
0.978PheMet: 0.978 ± 0.307
3.535PheAsn: 3.535 ± 0.567
0.978PhePro: 0.978 ± 0.329
0.903PheGln: 0.903 ± 0.21
0.602PheArg: 0.602 ± 0.189
2.783PheSer: 2.783 ± 0.438
2.633PheThr: 2.633 ± 0.56
2.407PheVal: 2.407 ± 0.33
0.451PheTrp: 0.451 ± 0.216
1.956PheTyr: 1.956 ± 0.385
0.0PheXaa: 0.0 ± 0.0
Gly
6.018GlyAla: 6.018 ± 0.907
0.527GlyCys: 0.527 ± 0.196
3.987GlyAsp: 3.987 ± 0.59
5.491GlyGlu: 5.491 ± 0.673
3.235GlyPhe: 3.235 ± 0.443
5.867GlyGly: 5.867 ± 0.89
0.602GlyHis: 0.602 ± 0.218
3.987GlyIle: 3.987 ± 0.525
4.363GlyLys: 4.363 ± 0.662
6.394GlyLeu: 6.394 ± 0.715
1.956GlyMet: 1.956 ± 0.413
3.611GlyAsn: 3.611 ± 0.547
1.429GlyPro: 1.429 ± 0.361
2.482GlyGln: 2.482 ± 0.515
2.708GlyArg: 2.708 ± 0.385
4.889GlySer: 4.889 ± 0.885
4.814GlyThr: 4.814 ± 0.96
5.566GlyVal: 5.566 ± 0.619
0.978GlyTrp: 0.978 ± 0.38
3.084GlyTyr: 3.084 ± 0.511
0.0GlyXaa: 0.0 ± 0.0
His
1.279HisAla: 1.279 ± 0.279
0.752HisCys: 0.752 ± 0.273
0.903HisAsp: 0.903 ± 0.304
1.128HisGlu: 1.128 ± 0.318
0.827HisPhe: 0.827 ± 0.276
1.655HisGly: 1.655 ± 0.38
0.15HisHis: 0.15 ± 0.1
1.053HisIle: 1.053 ± 0.256
1.128HisLys: 1.128 ± 0.315
1.279HisLeu: 1.279 ± 0.3
0.15HisMet: 0.15 ± 0.105
0.602HisAsn: 0.602 ± 0.168
0.602HisPro: 0.602 ± 0.198
0.451HisGln: 0.451 ± 0.166
0.752HisArg: 0.752 ± 0.245
1.128HisSer: 1.128 ± 0.298
0.376HisThr: 0.376 ± 0.189
0.978HisVal: 0.978 ± 0.264
0.15HisTrp: 0.15 ± 0.119
0.602HisTyr: 0.602 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
5.491IleAla: 5.491 ± 0.643
0.602IleCys: 0.602 ± 0.215
4.889IleAsp: 4.889 ± 0.532
5.04IleGlu: 5.04 ± 0.675
2.181IlePhe: 2.181 ± 0.327
4.212IleGly: 4.212 ± 0.686
1.204IleHis: 1.204 ± 0.324
3.31IleIle: 3.31 ± 0.546
5.416IleLys: 5.416 ± 0.578
3.385IleLeu: 3.385 ± 0.492
1.354IleMet: 1.354 ± 0.363
5.266IleAsn: 5.266 ± 0.718
2.783IlePro: 2.783 ± 0.376
1.655IleGln: 1.655 ± 0.444
2.407IleArg: 2.407 ± 0.442
3.987IleSer: 3.987 ± 0.526
3.235IleThr: 3.235 ± 0.502
3.611IleVal: 3.611 ± 0.571
0.376IleTrp: 0.376 ± 0.176
2.181IleTyr: 2.181 ± 0.49
0.0IleXaa: 0.0 ± 0.0
Lys
5.943LysAla: 5.943 ± 0.764
0.903LysCys: 0.903 ± 0.25
2.708LysAsp: 2.708 ± 0.463
4.814LysGlu: 4.814 ± 0.675
2.106LysPhe: 2.106 ± 0.443
4.814LysGly: 4.814 ± 0.552
1.204LysHis: 1.204 ± 0.302
3.987LysIle: 3.987 ± 0.528
4.664LysLys: 4.664 ± 0.662
5.867LysLeu: 5.867 ± 0.781
2.031LysMet: 2.031 ± 0.53
3.009LysAsn: 3.009 ± 0.629
3.159LysPro: 3.159 ± 0.484
2.708LysGln: 2.708 ± 0.466
2.934LysArg: 2.934 ± 0.51
4.965LysSer: 4.965 ± 0.69
3.535LysThr: 3.535 ± 0.609
3.46LysVal: 3.46 ± 0.538
0.903LysTrp: 0.903 ± 0.269
3.084LysTyr: 3.084 ± 0.527
0.0LysXaa: 0.0 ± 0.0
Leu
5.266LeuAla: 5.266 ± 0.742
0.978LeuCys: 0.978 ± 0.251
5.566LeuAsp: 5.566 ± 0.541
5.341LeuGlu: 5.341 ± 0.752
2.181LeuPhe: 2.181 ± 0.378
5.19LeuGly: 5.19 ± 0.671
1.204LeuHis: 1.204 ± 0.371
4.589LeuIle: 4.589 ± 0.626
5.19LeuLys: 5.19 ± 0.693
4.889LeuLeu: 4.889 ± 0.876
2.106LeuMet: 2.106 ± 0.423
4.212LeuAsn: 4.212 ± 0.56
2.482LeuPro: 2.482 ± 0.422
4.062LeuGln: 4.062 ± 0.482
3.761LeuArg: 3.761 ± 0.546
6.845LeuSer: 6.845 ± 0.666
6.018LeuThr: 6.018 ± 0.822
5.341LeuVal: 5.341 ± 0.608
0.752LeuTrp: 0.752 ± 0.264
1.805LeuTyr: 1.805 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
2.558MetAla: 2.558 ± 0.426
0.075MetCys: 0.075 ± 0.067
1.354MetAsp: 1.354 ± 0.333
1.58MetGlu: 1.58 ± 0.299
1.128MetPhe: 1.128 ± 0.296
1.354MetGly: 1.354 ± 0.373
0.602MetHis: 0.602 ± 0.241
1.58MetIle: 1.58 ± 0.359
1.881MetLys: 1.881 ± 0.409
1.956MetLeu: 1.956 ± 0.459
0.827MetMet: 0.827 ± 0.298
1.354MetAsn: 1.354 ± 0.417
1.429MetPro: 1.429 ± 0.33
1.053MetGln: 1.053 ± 0.268
1.429MetArg: 1.429 ± 0.342
2.181MetSer: 2.181 ± 0.469
1.655MetThr: 1.655 ± 0.442
1.053MetVal: 1.053 ± 0.329
0.226MetTrp: 0.226 ± 0.123
0.602MetTyr: 0.602 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
5.266AsnAla: 5.266 ± 0.535
0.602AsnCys: 0.602 ± 0.232
4.363AsnAsp: 4.363 ± 0.603
3.535AsnGlu: 3.535 ± 0.489
2.257AsnPhe: 2.257 ± 0.382
5.115AsnGly: 5.115 ± 0.574
1.128AsnHis: 1.128 ± 0.269
2.407AsnIle: 2.407 ± 0.412
3.535AsnLys: 3.535 ± 0.499
4.814AsnLeu: 4.814 ± 0.645
1.279AsnMet: 1.279 ± 0.348
4.137AsnAsn: 4.137 ± 0.685
3.084AsnPro: 3.084 ± 0.517
1.805AsnGln: 1.805 ± 0.395
2.031AsnArg: 2.031 ± 0.362
3.159AsnSer: 3.159 ± 0.475
3.686AsnThr: 3.686 ± 0.532
2.708AsnVal: 2.708 ± 0.626
0.827AsnTrp: 0.827 ± 0.209
2.257AsnTyr: 2.257 ± 0.451
0.0AsnXaa: 0.0 ± 0.0
Pro
3.159ProAla: 3.159 ± 0.467
0.451ProCys: 0.451 ± 0.191
2.407ProAsp: 2.407 ± 0.508
2.783ProGlu: 2.783 ± 0.53
0.903ProPhe: 0.903 ± 0.257
0.677ProGly: 0.677 ± 0.204
0.903ProHis: 0.903 ± 0.258
2.257ProIle: 2.257 ± 0.485
2.031ProLys: 2.031 ± 0.477
3.235ProLeu: 3.235 ± 0.567
0.677ProMet: 0.677 ± 0.221
1.805ProAsn: 1.805 ± 0.354
1.354ProPro: 1.354 ± 0.309
1.053ProGln: 1.053 ± 0.25
1.204ProArg: 1.204 ± 0.345
2.332ProSer: 2.332 ± 0.43
2.482ProThr: 2.482 ± 0.633
2.708ProVal: 2.708 ± 0.405
0.602ProTrp: 0.602 ± 0.26
1.053ProTyr: 1.053 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
3.611GlnAla: 3.611 ± 0.528
0.451GlnCys: 0.451 ± 0.151
2.106GlnAsp: 2.106 ± 0.415
2.332GlnGlu: 2.332 ± 0.427
1.053GlnPhe: 1.053 ± 0.313
1.881GlnGly: 1.881 ± 0.314
0.752GlnHis: 0.752 ± 0.236
2.558GlnIle: 2.558 ± 0.6
2.482GlnLys: 2.482 ± 0.514
3.084GlnLeu: 3.084 ± 0.434
0.978GlnMet: 0.978 ± 0.279
2.031GlnAsn: 2.031 ± 0.323
0.451GlnPro: 0.451 ± 0.213
1.956GlnGln: 1.956 ± 0.321
1.956GlnArg: 1.956 ± 0.451
2.708GlnSer: 2.708 ± 0.505
1.58GlnThr: 1.58 ± 0.337
3.761GlnVal: 3.761 ± 0.611
0.602GlnTrp: 0.602 ± 0.226
1.354GlnTyr: 1.354 ± 0.32
0.0GlnXaa: 0.0 ± 0.0
Arg
2.633ArgAla: 2.633 ± 0.43
0.752ArgCys: 0.752 ± 0.226
2.181ArgAsp: 2.181 ± 0.36
3.009ArgGlu: 3.009 ± 0.487
1.956ArgPhe: 1.956 ± 0.493
2.106ArgGly: 2.106 ± 0.461
0.451ArgHis: 0.451 ± 0.17
2.106ArgIle: 2.106 ± 0.406
3.31ArgLys: 3.31 ± 0.631
3.535ArgLeu: 3.535 ± 0.564
1.279ArgMet: 1.279 ± 0.276
1.73ArgAsn: 1.73 ± 0.366
1.128ArgPro: 1.128 ± 0.299
1.504ArgGln: 1.504 ± 0.341
1.58ArgArg: 1.58 ± 0.386
2.482ArgSer: 2.482 ± 0.346
1.805ArgThr: 1.805 ± 0.372
3.535ArgVal: 3.535 ± 0.456
0.827ArgTrp: 0.827 ± 0.225
0.903ArgTyr: 0.903 ± 0.241
0.0ArgXaa: 0.0 ± 0.0
Ser
5.491SerAla: 5.491 ± 0.662
0.752SerCys: 0.752 ± 0.22
4.814SerAsp: 4.814 ± 0.549
3.987SerGlu: 3.987 ± 0.506
3.235SerPhe: 3.235 ± 0.418
6.319SerGly: 6.319 ± 0.803
0.602SerHis: 0.602 ± 0.183
5.19SerIle: 5.19 ± 0.845
4.513SerLys: 4.513 ± 0.71
5.115SerLeu: 5.115 ± 0.722
1.805SerMet: 1.805 ± 0.379
3.31SerAsn: 3.31 ± 0.518
1.956SerPro: 1.956 ± 0.374
2.482SerGln: 2.482 ± 0.463
2.332SerArg: 2.332 ± 0.51
4.664SerSer: 4.664 ± 0.589
3.912SerThr: 3.912 ± 0.669
4.739SerVal: 4.739 ± 0.595
0.677SerTrp: 0.677 ± 0.246
3.084SerTyr: 3.084 ± 0.489
0.0SerXaa: 0.0 ± 0.0
Thr
4.814ThrAla: 4.814 ± 0.688
0.752ThrCys: 0.752 ± 0.234
3.009ThrAsp: 3.009 ± 0.496
4.288ThrGlu: 4.288 ± 0.512
2.181ThrPhe: 2.181 ± 0.395
4.664ThrGly: 4.664 ± 0.759
1.128ThrHis: 1.128 ± 0.319
5.115ThrIle: 5.115 ± 0.689
3.761ThrLys: 3.761 ± 0.52
4.438ThrLeu: 4.438 ± 0.561
1.504ThrMet: 1.504 ± 0.38
3.009ThrAsn: 3.009 ± 0.56
2.257ThrPro: 2.257 ± 0.341
1.881ThrGln: 1.881 ± 0.418
2.031ThrArg: 2.031 ± 0.378
4.212ThrSer: 4.212 ± 0.757
3.686ThrThr: 3.686 ± 0.732
3.836ThrVal: 3.836 ± 0.587
0.827ThrTrp: 0.827 ± 0.289
1.805ThrTyr: 1.805 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
5.266ValAla: 5.266 ± 0.59
0.451ValCys: 0.451 ± 0.2
4.965ValAsp: 4.965 ± 0.53
3.535ValGlu: 3.535 ± 0.498
3.159ValPhe: 3.159 ± 0.354
6.394ValGly: 6.394 ± 0.681
0.677ValHis: 0.677 ± 0.246
3.535ValIle: 3.535 ± 0.483
3.912ValLys: 3.912 ± 0.651
3.836ValLeu: 3.836 ± 0.505
2.181ValMet: 2.181 ± 0.499
5.341ValAsn: 5.341 ± 0.705
1.58ValPro: 1.58 ± 0.366
1.805ValGln: 1.805 ± 0.36
2.332ValArg: 2.332 ± 0.387
4.137ValSer: 4.137 ± 0.448
4.513ValThr: 4.513 ± 0.64
3.611ValVal: 3.611 ± 0.547
0.752ValTrp: 0.752 ± 0.208
3.009ValTyr: 3.009 ± 0.572
0.0ValXaa: 0.0 ± 0.0
Trp
0.602TrpAla: 0.602 ± 0.197
0.15TrpCys: 0.15 ± 0.128
0.827TrpAsp: 0.827 ± 0.25
0.827TrpGlu: 0.827 ± 0.283
0.677TrpPhe: 0.677 ± 0.256
0.451TrpGly: 0.451 ± 0.193
0.226TrpHis: 0.226 ± 0.122
1.053TrpIle: 1.053 ± 0.296
0.752TrpLys: 0.752 ± 0.283
1.053TrpLeu: 1.053 ± 0.318
0.376TrpMet: 0.376 ± 0.195
0.527TrpAsn: 0.527 ± 0.256
0.451TrpPro: 0.451 ± 0.181
0.827TrpGln: 0.827 ± 0.24
0.903TrpArg: 0.903 ± 0.289
0.752TrpSer: 0.752 ± 0.27
0.677TrpThr: 0.677 ± 0.253
0.752TrpVal: 0.752 ± 0.251
0.075TrpTrp: 0.075 ± 0.084
0.527TrpTyr: 0.527 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.482TyrAla: 2.482 ± 0.511
0.827TyrCys: 0.827 ± 0.228
3.46TyrAsp: 3.46 ± 0.575
2.482TyrGlu: 2.482 ± 0.484
1.805TyrPhe: 1.805 ± 0.319
3.235TyrGly: 3.235 ± 0.404
0.075TyrHis: 0.075 ± 0.073
2.181TyrIle: 2.181 ± 0.334
2.558TyrLys: 2.558 ± 0.463
2.558TyrLeu: 2.558 ± 0.389
0.677TyrMet: 0.677 ± 0.263
2.633TyrAsn: 2.633 ± 0.413
1.805TyrPro: 1.805 ± 0.359
1.805TyrGln: 1.805 ± 0.402
1.956TyrArg: 1.956 ± 0.426
2.558TyrSer: 2.558 ± 0.369
2.708TyrThr: 2.708 ± 0.36
1.655TyrVal: 1.655 ± 0.399
0.752TyrTrp: 0.752 ± 0.186
1.429TyrTyr: 1.429 ± 0.38
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (13295 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski