Amino acid dipepetide frequency for Pseudoalteromonas phage XC

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.73AlaAla: 6.73 ± 1.364
0.902AlaCys: 0.902 ± 0.241
4.718AlaAsp: 4.718 ± 0.657
5.55AlaGlu: 5.55 ± 0.722
2.636AlaPhe: 2.636 ± 0.496
3.954AlaGly: 3.954 ± 0.497
1.457AlaHis: 1.457 ± 0.303
5.62AlaIle: 5.62 ± 0.618
6.799AlaLys: 6.799 ± 0.959
7.562AlaLeu: 7.562 ± 0.9
2.081AlaMet: 2.081 ± 0.417
4.856AlaAsn: 4.856 ± 0.58
2.636AlaPro: 2.636 ± 0.415
2.636AlaGln: 2.636 ± 0.533
2.706AlaArg: 2.706 ± 0.426
4.232AlaSer: 4.232 ± 0.509
4.301AlaThr: 4.301 ± 0.609
4.787AlaVal: 4.787 ± 0.58
0.971AlaTrp: 0.971 ± 0.314
3.33AlaTyr: 3.33 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.763CysAla: 0.763 ± 0.243
0.416CysCys: 0.416 ± 0.154
1.11CysAsp: 1.11 ± 0.302
1.11CysGlu: 1.11 ± 0.298
0.624CysPhe: 0.624 ± 0.206
0.694CysGly: 0.694 ± 0.223
0.347CysHis: 0.347 ± 0.155
0.763CysIle: 0.763 ± 0.181
0.971CysLys: 0.971 ± 0.265
1.11CysLeu: 1.11 ± 0.276
0.486CysMet: 0.486 ± 0.175
0.694CysAsn: 0.694 ± 0.213
0.208CysPro: 0.208 ± 0.101
0.486CysGln: 0.486 ± 0.154
0.486CysArg: 0.486 ± 0.194
0.694CysSer: 0.694 ± 0.238
0.694CysThr: 0.694 ± 0.219
0.971CysVal: 0.971 ± 0.3
0.208CysTrp: 0.208 ± 0.105
0.624CysTyr: 0.624 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
4.995AspAla: 4.995 ± 0.556
0.902AspCys: 0.902 ± 0.278
5.203AspAsp: 5.203 ± 0.563
4.44AspGlu: 4.44 ± 0.437
2.844AspPhe: 2.844 ± 0.371
4.995AspGly: 4.995 ± 0.604
1.041AspHis: 1.041 ± 0.213
4.787AspIle: 4.787 ± 0.622
4.718AspLys: 4.718 ± 0.514
6.105AspLeu: 6.105 ± 0.828
1.804AspMet: 1.804 ± 0.342
3.469AspAsn: 3.469 ± 0.506
1.596AspPro: 1.596 ± 0.457
1.041AspGln: 1.041 ± 0.308
1.734AspArg: 1.734 ± 0.304
4.301AspSer: 4.301 ± 0.459
3.954AspThr: 3.954 ± 0.527
3.816AspVal: 3.816 ± 0.594
1.388AspTrp: 1.388 ± 0.304
2.914AspTyr: 2.914 ± 0.42
0.0AspXaa: 0.0 ± 0.0
Glu
4.301GluAla: 4.301 ± 0.539
0.624GluCys: 0.624 ± 0.216
3.538GluAsp: 3.538 ± 0.539
3.885GluGlu: 3.885 ± 0.602
3.469GluPhe: 3.469 ± 0.437
4.856GluGly: 4.856 ± 0.505
1.804GluHis: 1.804 ± 0.355
4.995GluIle: 4.995 ± 0.553
4.163GluLys: 4.163 ± 0.612
8.464GluLeu: 8.464 ± 0.853
1.734GluMet: 1.734 ± 0.337
2.983GluAsn: 2.983 ± 0.418
1.873GluPro: 1.873 ± 0.412
3.122GluGln: 3.122 ± 0.529
3.191GluArg: 3.191 ± 0.373
4.44GluSer: 4.44 ± 0.555
3.33GluThr: 3.33 ± 0.543
4.718GluVal: 4.718 ± 0.662
1.665GluTrp: 1.665 ± 0.422
3.191GluTyr: 3.191 ± 0.376
0.0GluXaa: 0.0 ± 0.0
Phe
2.706PheAla: 2.706 ± 0.385
0.833PheCys: 0.833 ± 0.2
3.122PheAsp: 3.122 ± 0.398
2.567PheGlu: 2.567 ± 0.448
1.526PhePhe: 1.526 ± 0.245
2.983PheGly: 2.983 ± 0.506
0.486PheHis: 0.486 ± 0.16
3.469PheIle: 3.469 ± 0.543
2.775PheLys: 2.775 ± 0.441
2.151PheLeu: 2.151 ± 0.316
0.833PheMet: 0.833 ± 0.266
3.122PheAsn: 3.122 ± 0.424
1.179PhePro: 1.179 ± 0.326
0.555PheGln: 0.555 ± 0.148
1.249PheArg: 1.249 ± 0.316
3.538PheSer: 3.538 ± 0.422
3.469PheThr: 3.469 ± 0.484
2.081PheVal: 2.081 ± 0.375
0.278PheTrp: 0.278 ± 0.12
1.457PheTyr: 1.457 ± 0.267
0.0PheXaa: 0.0 ± 0.0
Gly
3.885GlyAla: 3.885 ± 0.585
0.971GlyCys: 0.971 ± 0.3
4.995GlyAsp: 4.995 ± 0.583
5.203GlyGlu: 5.203 ± 0.482
2.775GlyPhe: 2.775 ± 0.424
4.648GlyGly: 4.648 ± 0.627
0.971GlyHis: 0.971 ± 0.307
3.261GlyIle: 3.261 ± 0.5
5.411GlyLys: 5.411 ± 0.656
4.51GlyLeu: 4.51 ± 0.638
1.734GlyMet: 1.734 ± 0.377
4.301GlyAsn: 4.301 ± 0.529
0.624GlyPro: 0.624 ± 0.207
1.943GlyGln: 1.943 ± 0.396
2.22GlyArg: 2.22 ± 0.367
4.44GlySer: 4.44 ± 0.509
3.954GlyThr: 3.954 ± 0.509
5.758GlyVal: 5.758 ± 0.538
0.833GlyTrp: 0.833 ± 0.252
2.428GlyTyr: 2.428 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
1.041HisAla: 1.041 ± 0.26
0.624HisCys: 0.624 ± 0.198
0.902HisAsp: 0.902 ± 0.252
1.041HisGlu: 1.041 ± 0.212
0.763HisPhe: 0.763 ± 0.242
1.179HisGly: 1.179 ± 0.336
0.555HisHis: 0.555 ± 0.248
0.902HisIle: 0.902 ± 0.231
1.457HisLys: 1.457 ± 0.329
1.943HisLeu: 1.943 ± 0.35
0.208HisMet: 0.208 ± 0.124
1.11HisAsn: 1.11 ± 0.266
0.971HisPro: 0.971 ± 0.227
0.694HisGln: 0.694 ± 0.205
0.694HisArg: 0.694 ± 0.202
1.179HisSer: 1.179 ± 0.335
0.555HisThr: 0.555 ± 0.168
0.902HisVal: 0.902 ± 0.248
0.069HisTrp: 0.069 ± 0.073
0.902HisTyr: 0.902 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
5.828IleAla: 5.828 ± 0.508
0.833IleCys: 0.833 ± 0.244
4.995IleAsp: 4.995 ± 0.674
6.105IleGlu: 6.105 ± 0.573
1.596IlePhe: 1.596 ± 0.279
3.677IleGly: 3.677 ± 0.455
1.041IleHis: 1.041 ± 0.246
3.608IleIle: 3.608 ± 0.465
5.273IleLys: 5.273 ± 0.37
3.399IleLeu: 3.399 ± 0.433
1.388IleMet: 1.388 ± 0.304
4.718IleAsn: 4.718 ± 0.606
2.081IlePro: 2.081 ± 0.394
1.596IleGln: 1.596 ± 0.245
1.526IleArg: 1.526 ± 0.328
3.954IleSer: 3.954 ± 0.689
4.024IleThr: 4.024 ± 0.473
3.608IleVal: 3.608 ± 0.407
0.347IleTrp: 0.347 ± 0.156
1.943IleTyr: 1.943 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
6.799LysAla: 6.799 ± 0.715
0.971LysCys: 0.971 ± 0.273
3.608LysAsp: 3.608 ± 0.475
5.065LysGlu: 5.065 ± 0.743
2.983LysPhe: 2.983 ± 0.548
4.024LysGly: 4.024 ± 0.453
1.943LysHis: 1.943 ± 0.365
3.746LysIle: 3.746 ± 0.541
4.718LysLys: 4.718 ± 0.744
5.55LysLeu: 5.55 ± 0.756
1.943LysMet: 1.943 ± 0.454
3.122LysAsn: 3.122 ± 0.59
3.261LysPro: 3.261 ± 0.422
3.261LysGln: 3.261 ± 0.536
3.053LysArg: 3.053 ± 0.497
5.411LysSer: 5.411 ± 0.646
3.954LysThr: 3.954 ± 0.632
4.648LysVal: 4.648 ± 0.649
1.526LysTrp: 1.526 ± 0.325
3.885LysTyr: 3.885 ± 0.522
0.0LysXaa: 0.0 ± 0.0
Leu
6.73LeuAla: 6.73 ± 0.86
1.318LeuCys: 1.318 ± 0.294
5.758LeuAsp: 5.758 ± 0.593
4.995LeuGlu: 4.995 ± 0.659
2.151LeuPhe: 2.151 ± 0.348
5.134LeuGly: 5.134 ± 0.459
1.457LeuHis: 1.457 ± 0.342
4.787LeuIle: 4.787 ± 0.501
5.897LeuLys: 5.897 ± 0.826
5.481LeuLeu: 5.481 ± 0.661
2.706LeuMet: 2.706 ± 0.492
4.926LeuAsn: 4.926 ± 0.696
2.567LeuPro: 2.567 ± 0.517
3.191LeuGln: 3.191 ± 0.419
3.399LeuArg: 3.399 ± 0.457
4.718LeuSer: 4.718 ± 0.592
5.411LeuThr: 5.411 ± 0.766
4.232LeuVal: 4.232 ± 0.701
1.179LeuTrp: 1.179 ± 0.291
2.706LeuTyr: 2.706 ± 0.474
0.0LeuXaa: 0.0 ± 0.0
Met
3.053MetAla: 3.053 ± 0.452
0.139MetCys: 0.139 ± 0.094
1.041MetAsp: 1.041 ± 0.25
1.179MetGlu: 1.179 ± 0.266
0.416MetPhe: 0.416 ± 0.147
2.151MetGly: 2.151 ± 0.457
0.278MetHis: 0.278 ± 0.151
1.873MetIle: 1.873 ± 0.428
2.359MetLys: 2.359 ± 0.442
1.665MetLeu: 1.665 ± 0.388
0.347MetMet: 0.347 ± 0.147
1.179MetAsn: 1.179 ± 0.265
0.763MetPro: 0.763 ± 0.215
1.388MetGln: 1.388 ± 0.311
1.526MetArg: 1.526 ± 0.278
2.498MetSer: 2.498 ± 0.439
2.012MetThr: 2.012 ± 0.392
1.596MetVal: 1.596 ± 0.296
0.139MetTrp: 0.139 ± 0.101
0.902MetTyr: 0.902 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
4.856AsnAla: 4.856 ± 0.756
0.624AsnCys: 0.624 ± 0.183
4.301AsnAsp: 4.301 ± 0.515
3.954AsnGlu: 3.954 ± 0.456
2.151AsnPhe: 2.151 ± 0.365
4.579AsnGly: 4.579 ± 0.598
1.179AsnHis: 1.179 ± 0.284
2.775AsnIle: 2.775 ± 0.483
4.371AsnLys: 4.371 ± 0.588
4.51AsnLeu: 4.51 ± 0.62
1.11AsnMet: 1.11 ± 0.283
3.053AsnAsn: 3.053 ± 0.497
3.469AsnPro: 3.469 ± 0.457
1.734AsnGln: 1.734 ± 0.337
2.012AsnArg: 2.012 ± 0.403
3.677AsnSer: 3.677 ± 0.458
3.122AsnThr: 3.122 ± 0.478
2.844AsnVal: 2.844 ± 0.495
0.902AsnTrp: 0.902 ± 0.232
2.22AsnTyr: 2.22 ± 0.347
0.0AsnXaa: 0.0 ± 0.0
Pro
2.636ProAla: 2.636 ± 0.514
0.486ProCys: 0.486 ± 0.219
2.081ProAsp: 2.081 ± 0.447
3.122ProGlu: 3.122 ± 0.464
1.388ProPhe: 1.388 ± 0.256
0.902ProGly: 0.902 ± 0.342
0.486ProHis: 0.486 ± 0.174
2.081ProIle: 2.081 ± 0.378
2.983ProLys: 2.983 ± 0.487
2.359ProLeu: 2.359 ± 0.407
0.833ProMet: 0.833 ± 0.228
1.388ProAsn: 1.388 ± 0.278
0.555ProPro: 0.555 ± 0.16
1.318ProGln: 1.318 ± 0.301
0.833ProArg: 0.833 ± 0.252
2.151ProSer: 2.151 ± 0.374
2.081ProThr: 2.081 ± 0.336
1.804ProVal: 1.804 ± 0.312
0.416ProTrp: 0.416 ± 0.139
1.11ProTyr: 1.11 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
3.053GlnAla: 3.053 ± 0.524
0.416GlnCys: 0.416 ± 0.163
2.081GlnAsp: 2.081 ± 0.274
2.012GlnGlu: 2.012 ± 0.362
1.665GlnPhe: 1.665 ± 0.349
2.289GlnGly: 2.289 ± 0.303
0.278GlnHis: 0.278 ± 0.15
2.844GlnIle: 2.844 ± 0.376
1.804GlnLys: 1.804 ± 0.297
2.289GlnLeu: 2.289 ± 0.467
0.902GlnMet: 0.902 ± 0.223
1.665GlnAsn: 1.665 ± 0.305
0.833GlnPro: 0.833 ± 0.266
1.388GlnGln: 1.388 ± 0.303
1.596GlnArg: 1.596 ± 0.353
3.399GlnSer: 3.399 ± 0.528
1.804GlnThr: 1.804 ± 0.326
3.122GlnVal: 3.122 ± 0.574
0.624GlnTrp: 0.624 ± 0.236
1.179GlnTyr: 1.179 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
2.775ArgAla: 2.775 ± 0.461
0.416ArgCys: 0.416 ± 0.178
1.943ArgAsp: 1.943 ± 0.335
3.122ArgGlu: 3.122 ± 0.433
2.012ArgPhe: 2.012 ± 0.357
2.22ArgGly: 2.22 ± 0.478
0.624ArgHis: 0.624 ± 0.23
2.289ArgIle: 2.289 ± 0.497
3.191ArgLys: 3.191 ± 0.594
2.983ArgLeu: 2.983 ± 0.564
1.179ArgMet: 1.179 ± 0.349
2.012ArgAsn: 2.012 ± 0.383
0.833ArgPro: 0.833 ± 0.206
1.318ArgGln: 1.318 ± 0.319
2.012ArgArg: 2.012 ± 0.45
2.636ArgSer: 2.636 ± 0.523
2.151ArgThr: 2.151 ± 0.376
3.399ArgVal: 3.399 ± 0.639
0.833ArgTrp: 0.833 ± 0.207
1.318ArgTyr: 1.318 ± 0.244
0.0ArgXaa: 0.0 ± 0.0
Ser
4.995SerAla: 4.995 ± 0.658
0.347SerCys: 0.347 ± 0.158
4.718SerAsp: 4.718 ± 0.507
4.718SerGlu: 4.718 ± 0.618
2.706SerPhe: 2.706 ± 0.35
4.787SerGly: 4.787 ± 0.453
1.11SerHis: 1.11 ± 0.258
3.954SerIle: 3.954 ± 0.57
5.342SerLys: 5.342 ± 0.619
4.787SerLeu: 4.787 ± 0.47
1.943SerMet: 1.943 ± 0.326
3.746SerAsn: 3.746 ± 0.482
1.457SerPro: 1.457 ± 0.295
2.983SerGln: 2.983 ± 0.452
3.469SerArg: 3.469 ± 0.525
5.481SerSer: 5.481 ± 0.642
3.816SerThr: 3.816 ± 0.526
4.787SerVal: 4.787 ± 0.524
0.694SerTrp: 0.694 ± 0.186
2.636SerTyr: 2.636 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
5.273ThrAla: 5.273 ± 0.789
0.694ThrCys: 0.694 ± 0.243
3.677ThrAsp: 3.677 ± 0.387
3.954ThrGlu: 3.954 ± 0.545
2.775ThrPhe: 2.775 ± 0.534
4.44ThrGly: 4.44 ± 0.489
0.763ThrHis: 0.763 ± 0.246
3.191ThrIle: 3.191 ± 0.635
3.053ThrLys: 3.053 ± 0.612
4.787ThrLeu: 4.787 ± 0.554
1.318ThrMet: 1.318 ± 0.258
4.51ThrAsn: 4.51 ± 0.477
2.428ThrPro: 2.428 ± 0.416
2.22ThrGln: 2.22 ± 0.365
2.22ThrArg: 2.22 ± 0.343
3.191ThrSer: 3.191 ± 0.498
3.261ThrThr: 3.261 ± 0.505
3.816ThrVal: 3.816 ± 0.476
0.763ThrTrp: 0.763 ± 0.199
2.636ThrTyr: 2.636 ± 0.529
0.0ThrXaa: 0.0 ± 0.0
Val
4.579ValAla: 4.579 ± 0.542
0.833ValCys: 0.833 ± 0.233
4.232ValAsp: 4.232 ± 0.609
4.856ValGlu: 4.856 ± 0.558
3.261ValPhe: 3.261 ± 0.425
3.677ValGly: 3.677 ± 0.476
0.694ValHis: 0.694 ± 0.262
3.538ValIle: 3.538 ± 0.408
4.718ValLys: 4.718 ± 0.614
4.44ValLeu: 4.44 ± 0.564
2.359ValMet: 2.359 ± 0.455
3.816ValAsn: 3.816 ± 0.523
1.943ValPro: 1.943 ± 0.334
2.359ValGln: 2.359 ± 0.36
2.914ValArg: 2.914 ± 0.419
4.718ValSer: 4.718 ± 0.57
4.371ValThr: 4.371 ± 0.848
3.191ValVal: 3.191 ± 0.428
1.11ValTrp: 1.11 ± 0.27
2.498ValTyr: 2.498 ± 0.419
0.0ValXaa: 0.0 ± 0.0
Trp
1.041TrpAla: 1.041 ± 0.281
0.278TrpCys: 0.278 ± 0.121
0.833TrpAsp: 0.833 ± 0.224
1.11TrpGlu: 1.11 ± 0.289
1.041TrpPhe: 1.041 ± 0.317
0.833TrpGly: 0.833 ± 0.259
0.624TrpHis: 0.624 ± 0.204
0.763TrpIle: 0.763 ± 0.273
0.486TrpLys: 0.486 ± 0.166
1.318TrpLeu: 1.318 ± 0.368
0.624TrpMet: 0.624 ± 0.201
0.763TrpAsn: 0.763 ± 0.226
0.486TrpPro: 0.486 ± 0.186
0.624TrpGln: 0.624 ± 0.206
0.902TrpArg: 0.902 ± 0.222
0.624TrpSer: 0.624 ± 0.187
0.347TrpThr: 0.347 ± 0.134
1.179TrpVal: 1.179 ± 0.265
0.139TrpTrp: 0.139 ± 0.085
0.416TrpTyr: 0.416 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.407
0.833TyrCys: 0.833 ± 0.244
3.191TyrAsp: 3.191 ± 0.377
2.359TyrGlu: 2.359 ± 0.423
1.596TyrPhe: 1.596 ± 0.359
2.844TyrGly: 2.844 ± 0.466
0.555TyrHis: 0.555 ± 0.211
2.289TyrIle: 2.289 ± 0.401
2.983TyrLys: 2.983 ± 0.463
3.33TyrLeu: 3.33 ± 0.455
0.902TyrMet: 0.902 ± 0.258
2.012TyrAsn: 2.012 ± 0.338
1.249TyrPro: 1.249 ± 0.226
1.457TyrGln: 1.457 ± 0.292
1.526TyrArg: 1.526 ± 0.288
3.191TyrSer: 3.191 ± 0.453
2.359TyrThr: 2.359 ± 0.432
2.844TyrVal: 2.844 ± 0.369
0.347TyrTrp: 0.347 ± 0.131
1.249TyrTyr: 1.249 ± 0.284
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (14415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski