Amino acid dipepetide frequency for Moraxella phage Mcat22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.477AlaAla: 4.477 ± 0.971
1.436AlaCys: 1.436 ± 0.43
5.068AlaAsp: 5.068 ± 0.576
5.913AlaGlu: 5.913 ± 0.798
2.872AlaPhe: 2.872 ± 0.489
5.491AlaGly: 5.491 ± 0.688
1.352AlaHis: 1.352 ± 0.338
4.477AlaIle: 4.477 ± 0.526
7.265AlaLys: 7.265 ± 0.914
8.363AlaLeu: 8.363 ± 1.107
2.027AlaMet: 2.027 ± 0.553
3.632AlaAsn: 3.632 ± 0.657
2.196AlaPro: 2.196 ± 0.497
4.562AlaGln: 4.562 ± 0.731
3.886AlaArg: 3.886 ± 0.465
6.167AlaSer: 6.167 ± 0.817
5.406AlaThr: 5.406 ± 1.256
4.899AlaVal: 4.899 ± 0.545
0.845AlaTrp: 0.845 ± 0.308
2.872AlaTyr: 2.872 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.507CysAla: 0.507 ± 0.231
0.253CysCys: 0.253 ± 0.143
0.676CysAsp: 0.676 ± 0.243
0.76CysGlu: 0.76 ± 0.304
0.422CysPhe: 0.422 ± 0.166
0.76CysGly: 0.76 ± 0.347
0.676CysHis: 0.676 ± 0.251
0.929CysIle: 0.929 ± 0.33
0.845CysLys: 0.845 ± 0.339
1.183CysLeu: 1.183 ± 0.365
0.253CysMet: 0.253 ± 0.141
0.169CysAsn: 0.169 ± 0.114
0.338CysPro: 0.338 ± 0.162
0.507CysGln: 0.507 ± 0.199
0.845CysArg: 0.845 ± 0.252
0.845CysSer: 0.845 ± 0.261
0.676CysThr: 0.676 ± 0.277
0.845CysVal: 0.845 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.422CysTyr: 0.422 ± 0.205
0.0CysXaa: 0.0 ± 0.0
Asp
4.815AspAla: 4.815 ± 0.63
0.676AspCys: 0.676 ± 0.254
4.393AspAsp: 4.393 ± 0.713
5.237AspGlu: 5.237 ± 0.721
3.126AspPhe: 3.126 ± 0.604
5.744AspGly: 5.744 ± 0.652
1.014AspHis: 1.014 ± 0.281
4.646AspIle: 4.646 ± 0.692
4.477AspLys: 4.477 ± 0.557
4.731AspLeu: 4.731 ± 0.649
1.436AspMet: 1.436 ± 0.313
3.379AspAsn: 3.379 ± 0.555
1.689AspPro: 1.689 ± 0.355
1.352AspGln: 1.352 ± 0.392
2.45AspArg: 2.45 ± 0.472
3.126AspSer: 3.126 ± 0.488
2.957AspThr: 2.957 ± 0.446
3.801AspVal: 3.801 ± 0.59
1.014AspTrp: 1.014 ± 0.296
2.112AspTyr: 2.112 ± 0.515
0.0AspXaa: 0.0 ± 0.0
Glu
3.21GluAla: 3.21 ± 0.551
1.014GluCys: 1.014 ± 0.273
2.534GluAsp: 2.534 ± 0.632
2.281GluGlu: 2.281 ± 0.592
2.703GluPhe: 2.703 ± 0.466
2.112GluGly: 2.112 ± 0.392
1.943GluHis: 1.943 ± 0.475
4.393GluIle: 4.393 ± 0.533
5.153GluLys: 5.153 ± 0.806
7.687GluLeu: 7.687 ± 0.888
2.365GluMet: 2.365 ± 0.461
2.957GluAsn: 2.957 ± 0.494
2.534GluPro: 2.534 ± 0.512
3.294GluGln: 3.294 ± 0.558
3.463GluArg: 3.463 ± 0.733
2.196GluSer: 2.196 ± 0.384
2.788GluThr: 2.788 ± 0.426
3.21GluVal: 3.21 ± 0.491
0.76GluTrp: 0.76 ± 0.283
2.534GluTyr: 2.534 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
2.45PheAla: 2.45 ± 0.638
0.929PheCys: 0.929 ± 0.327
3.21PheAsp: 3.21 ± 0.521
3.041PheGlu: 3.041 ± 0.438
0.76PhePhe: 0.76 ± 0.262
3.21PheGly: 3.21 ± 0.611
0.929PheHis: 0.929 ± 0.314
2.534PheIle: 2.534 ± 0.305
1.689PheLys: 1.689 ± 0.339
2.703PheLeu: 2.703 ± 0.586
0.676PheMet: 0.676 ± 0.27
1.943PheAsn: 1.943 ± 0.33
0.929PhePro: 0.929 ± 0.248
0.676PheGln: 0.676 ± 0.275
1.436PheArg: 1.436 ± 0.375
2.112PheSer: 2.112 ± 0.474
2.45PheThr: 2.45 ± 0.388
2.027PheVal: 2.027 ± 0.421
0.676PheTrp: 0.676 ± 0.233
1.352PheTyr: 1.352 ± 0.332
0.0PheXaa: 0.0 ± 0.0
Gly
4.224GlyAla: 4.224 ± 0.859
0.591GlyCys: 0.591 ± 0.227
3.97GlyAsp: 3.97 ± 0.664
4.646GlyGlu: 4.646 ± 0.684
3.126GlyPhe: 3.126 ± 0.578
5.66GlyGly: 5.66 ± 1.181
1.183GlyHis: 1.183 ± 0.328
3.886GlyIle: 3.886 ± 0.502
6.842GlyLys: 6.842 ± 0.652
5.913GlyLeu: 5.913 ± 0.605
1.943GlyMet: 1.943 ± 0.389
3.379GlyAsn: 3.379 ± 0.549
0.169GlyPro: 0.169 ± 0.155
1.774GlyGln: 1.774 ± 0.313
3.717GlyArg: 3.717 ± 0.559
2.872GlySer: 2.872 ± 0.424
3.041GlyThr: 3.041 ± 0.577
4.899GlyVal: 4.899 ± 0.666
1.098GlyTrp: 1.098 ± 0.31
1.605GlyTyr: 1.605 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
2.619HisAla: 2.619 ± 0.485
0.169HisCys: 0.169 ± 0.097
1.521HisAsp: 1.521 ± 0.357
1.183HisGlu: 1.183 ± 0.279
0.76HisPhe: 0.76 ± 0.252
1.858HisGly: 1.858 ± 0.367
1.352HisHis: 1.352 ± 0.459
1.605HisIle: 1.605 ± 0.382
1.098HisLys: 1.098 ± 0.303
2.957HisLeu: 2.957 ± 0.527
0.507HisMet: 0.507 ± 0.198
1.183HisAsn: 1.183 ± 0.309
1.183HisPro: 1.183 ± 0.374
0.929HisGln: 0.929 ± 0.228
0.676HisArg: 0.676 ± 0.198
1.014HisSer: 1.014 ± 0.285
1.689HisThr: 1.689 ± 0.34
0.845HisVal: 0.845 ± 0.268
0.507HisTrp: 0.507 ± 0.205
1.098HisTyr: 1.098 ± 0.373
0.0HisXaa: 0.0 ± 0.0
Ile
5.491IleAla: 5.491 ± 0.523
0.676IleCys: 0.676 ± 0.282
5.322IleAsp: 5.322 ± 0.608
3.97IleGlu: 3.97 ± 0.573
1.774IlePhe: 1.774 ± 0.469
4.477IleGly: 4.477 ± 0.587
1.267IleHis: 1.267 ± 0.288
5.66IleIle: 5.66 ± 0.604
5.66IleLys: 5.66 ± 0.804
4.815IleLeu: 4.815 ± 0.711
2.027IleMet: 2.027 ± 0.466
3.886IleAsn: 3.886 ± 0.647
1.605IlePro: 1.605 ± 0.358
2.788IleGln: 2.788 ± 0.607
1.943IleArg: 1.943 ± 0.315
3.294IleSer: 3.294 ± 0.617
4.562IleThr: 4.562 ± 0.734
2.788IleVal: 2.788 ± 0.578
0.338IleTrp: 0.338 ± 0.149
2.534IleTyr: 2.534 ± 0.396
0.0IleXaa: 0.0 ± 0.0
Lys
8.194LysAla: 8.194 ± 1.262
0.76LysCys: 0.76 ± 0.314
3.463LysAsp: 3.463 ± 0.529
2.534LysGlu: 2.534 ± 0.509
2.534LysPhe: 2.534 ± 0.494
3.886LysGly: 3.886 ± 0.567
1.267LysHis: 1.267 ± 0.408
4.815LysIle: 4.815 ± 0.593
5.406LysLys: 5.406 ± 0.88
6.167LysLeu: 6.167 ± 0.658
2.534LysMet: 2.534 ± 0.472
4.308LysAsn: 4.308 ± 0.627
2.872LysPro: 2.872 ± 0.536
3.463LysGln: 3.463 ± 0.637
4.055LysArg: 4.055 ± 0.643
5.66LysSer: 5.66 ± 0.763
4.477LysThr: 4.477 ± 0.703
3.463LysVal: 3.463 ± 0.521
1.098LysTrp: 1.098 ± 0.333
2.027LysTyr: 2.027 ± 0.445
0.0LysXaa: 0.0 ± 0.0
Leu
8.701LeuAla: 8.701 ± 0.932
0.929LeuCys: 0.929 ± 0.362
6.589LeuAsp: 6.589 ± 0.692
5.153LeuGlu: 5.153 ± 0.799
2.619LeuPhe: 2.619 ± 0.334
5.406LeuGly: 5.406 ± 0.776
2.196LeuHis: 2.196 ± 0.456
4.899LeuIle: 4.899 ± 0.604
7.687LeuLys: 7.687 ± 0.986
7.434LeuLeu: 7.434 ± 1.128
1.858LeuMet: 1.858 ± 0.431
4.731LeuAsn: 4.731 ± 0.619
3.97LeuPro: 3.97 ± 0.61
3.97LeuGln: 3.97 ± 0.452
4.899LeuArg: 4.899 ± 0.539
7.096LeuSer: 7.096 ± 0.83
4.731LeuThr: 4.731 ± 0.858
4.984LeuVal: 4.984 ± 0.616
1.352LeuTrp: 1.352 ± 0.228
3.126LeuTyr: 3.126 ± 0.366
0.0LeuXaa: 0.0 ± 0.0
Met
2.027MetAla: 2.027 ± 0.488
0.169MetCys: 0.169 ± 0.116
1.605MetAsp: 1.605 ± 0.404
1.098MetGlu: 1.098 ± 0.352
1.098MetPhe: 1.098 ± 0.33
1.943MetGly: 1.943 ± 0.447
0.76MetHis: 0.76 ± 0.26
2.112MetIle: 2.112 ± 0.474
1.352MetLys: 1.352 ± 0.319
2.365MetLeu: 2.365 ± 0.444
0.76MetMet: 0.76 ± 0.307
1.436MetAsn: 1.436 ± 0.409
1.098MetPro: 1.098 ± 0.277
1.605MetGln: 1.605 ± 0.365
0.929MetArg: 0.929 ± 0.308
1.858MetSer: 1.858 ± 0.375
1.689MetThr: 1.689 ± 0.311
2.365MetVal: 2.365 ± 0.662
0.169MetTrp: 0.169 ± 0.109
0.591MetTyr: 0.591 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
4.224AsnAla: 4.224 ± 0.58
0.507AsnCys: 0.507 ± 0.232
2.872AsnAsp: 2.872 ± 0.582
3.126AsnGlu: 3.126 ± 0.607
2.027AsnPhe: 2.027 ± 0.508
2.872AsnGly: 2.872 ± 0.506
1.605AsnHis: 1.605 ± 0.506
3.632AsnIle: 3.632 ± 0.46
3.126AsnLys: 3.126 ± 0.529
5.322AsnLeu: 5.322 ± 0.689
1.352AsnMet: 1.352 ± 0.474
2.872AsnAsn: 2.872 ± 0.617
2.703AsnPro: 2.703 ± 0.47
1.436AsnGln: 1.436 ± 0.368
2.45AsnArg: 2.45 ± 0.46
3.041AsnSer: 3.041 ± 0.683
2.619AsnThr: 2.619 ± 0.478
2.788AsnVal: 2.788 ± 0.458
0.422AsnTrp: 0.422 ± 0.187
1.689AsnTyr: 1.689 ± 0.384
0.0AsnXaa: 0.0 ± 0.0
Pro
2.281ProAla: 2.281 ± 0.558
0.676ProCys: 0.676 ± 0.225
2.112ProAsp: 2.112 ± 0.431
1.605ProGlu: 1.605 ± 0.396
1.436ProPhe: 1.436 ± 0.467
0.76ProGly: 0.76 ± 0.208
0.929ProHis: 0.929 ± 0.278
2.703ProIle: 2.703 ± 0.429
2.196ProLys: 2.196 ± 0.442
2.027ProLeu: 2.027 ± 0.416
1.436ProMet: 1.436 ± 0.322
2.872ProAsn: 2.872 ± 0.5
1.774ProPro: 1.774 ± 0.432
1.352ProGln: 1.352 ± 0.364
1.352ProArg: 1.352 ± 0.344
2.872ProSer: 2.872 ± 0.524
2.027ProThr: 2.027 ± 0.351
2.619ProVal: 2.619 ± 0.434
0.507ProTrp: 0.507 ± 0.214
0.591ProTyr: 0.591 ± 0.253
0.0ProXaa: 0.0 ± 0.0
Gln
4.899GlnAla: 4.899 ± 0.635
0.169GlnCys: 0.169 ± 0.124
2.703GlnAsp: 2.703 ± 0.593
2.703GlnGlu: 2.703 ± 0.489
1.014GlnPhe: 1.014 ± 0.356
2.281GlnGly: 2.281 ± 0.484
0.676GlnHis: 0.676 ± 0.23
2.957GlnIle: 2.957 ± 0.545
3.717GlnLys: 3.717 ± 0.501
3.463GlnLeu: 3.463 ± 0.555
1.267GlnMet: 1.267 ± 0.26
1.943GlnAsn: 1.943 ± 0.506
1.436GlnPro: 1.436 ± 0.377
1.943GlnGln: 1.943 ± 0.472
1.689GlnArg: 1.689 ± 0.431
2.872GlnSer: 2.872 ± 0.444
3.379GlnThr: 3.379 ± 0.651
2.45GlnVal: 2.45 ± 0.614
0.591GlnTrp: 0.591 ± 0.236
1.689GlnTyr: 1.689 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
4.899ArgAla: 4.899 ± 0.641
0.338ArgCys: 0.338 ± 0.169
2.703ArgAsp: 2.703 ± 0.313
2.703ArgGlu: 2.703 ± 0.536
2.196ArgPhe: 2.196 ± 0.417
2.957ArgGly: 2.957 ± 0.435
1.605ArgHis: 1.605 ± 0.316
2.365ArgIle: 2.365 ± 0.438
2.027ArgLys: 2.027 ± 0.413
5.744ArgLeu: 5.744 ± 0.76
0.845ArgMet: 0.845 ± 0.296
1.858ArgAsn: 1.858 ± 0.424
1.521ArgPro: 1.521 ± 0.368
2.365ArgGln: 2.365 ± 0.482
2.281ArgArg: 2.281 ± 0.477
2.534ArgSer: 2.534 ± 0.505
2.619ArgThr: 2.619 ± 0.407
2.619ArgVal: 2.619 ± 0.498
1.014ArgTrp: 1.014 ± 0.275
2.703ArgTyr: 2.703 ± 0.482
0.0ArgXaa: 0.0 ± 0.0
Ser
4.562SerAla: 4.562 ± 0.727
0.422SerCys: 0.422 ± 0.185
2.957SerAsp: 2.957 ± 0.636
4.646SerGlu: 4.646 ± 0.524
2.112SerPhe: 2.112 ± 0.458
2.534SerGly: 2.534 ± 0.337
2.027SerHis: 2.027 ± 0.376
4.055SerIle: 4.055 ± 0.729
3.801SerLys: 3.801 ± 0.628
6.251SerLeu: 6.251 ± 0.807
1.098SerMet: 1.098 ± 0.308
3.548SerAsn: 3.548 ± 0.551
1.858SerPro: 1.858 ± 0.477
3.21SerGln: 3.21 ± 0.754
3.801SerArg: 3.801 ± 0.559
2.196SerSer: 2.196 ± 0.494
3.126SerThr: 3.126 ± 0.502
4.899SerVal: 4.899 ± 0.78
0.676SerTrp: 0.676 ± 0.181
1.858SerTyr: 1.858 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
7.18ThrAla: 7.18 ± 0.986
0.338ThrCys: 0.338 ± 0.224
3.97ThrAsp: 3.97 ± 0.581
3.041ThrGlu: 3.041 ± 0.599
1.352ThrPhe: 1.352 ± 0.354
4.646ThrGly: 4.646 ± 0.657
1.098ThrHis: 1.098 ± 0.313
2.788ThrIle: 2.788 ± 0.44
3.632ThrLys: 3.632 ± 0.739
4.984ThrLeu: 4.984 ± 0.651
1.605ThrMet: 1.605 ± 0.332
2.788ThrAsn: 2.788 ± 0.538
2.957ThrPro: 2.957 ± 0.485
2.45ThrGln: 2.45 ± 0.413
1.267ThrArg: 1.267 ± 0.369
2.788ThrSer: 2.788 ± 0.516
3.886ThrThr: 3.886 ± 0.755
4.308ThrVal: 4.308 ± 0.691
0.929ThrTrp: 0.929 ± 0.262
1.436ThrTyr: 1.436 ± 0.26
0.0ThrXaa: 0.0 ± 0.0
Val
5.153ValAla: 5.153 ± 0.761
0.845ValCys: 0.845 ± 0.292
3.632ValAsp: 3.632 ± 0.548
2.703ValGlu: 2.703 ± 0.544
2.281ValPhe: 2.281 ± 0.456
4.308ValGly: 4.308 ± 0.685
1.352ValHis: 1.352 ± 0.366
3.97ValIle: 3.97 ± 0.62
4.224ValLys: 4.224 ± 0.795
5.66ValLeu: 5.66 ± 0.772
1.858ValMet: 1.858 ± 0.408
2.788ValAsn: 2.788 ± 0.42
1.521ValPro: 1.521 ± 0.255
2.788ValGln: 2.788 ± 0.43
3.548ValArg: 3.548 ± 0.529
4.308ValSer: 4.308 ± 0.607
3.041ValThr: 3.041 ± 0.377
3.041ValVal: 3.041 ± 0.461
0.845ValTrp: 0.845 ± 0.265
1.858ValTyr: 1.858 ± 0.41
0.0ValXaa: 0.0 ± 0.0
Trp
1.014TrpAla: 1.014 ± 0.323
0.338TrpCys: 0.338 ± 0.171
0.845TrpAsp: 0.845 ± 0.305
0.845TrpGlu: 0.845 ± 0.273
0.422TrpPhe: 0.422 ± 0.192
0.591TrpGly: 0.591 ± 0.217
0.338TrpHis: 0.338 ± 0.148
0.507TrpIle: 0.507 ± 0.194
0.845TrpLys: 0.845 ± 0.394
1.436TrpLeu: 1.436 ± 0.453
0.169TrpMet: 0.169 ± 0.117
0.253TrpAsn: 0.253 ± 0.175
0.253TrpPro: 0.253 ± 0.123
1.521TrpGln: 1.521 ± 0.363
0.676TrpArg: 0.676 ± 0.208
0.676TrpSer: 0.676 ± 0.255
0.422TrpThr: 0.422 ± 0.195
1.183TrpVal: 1.183 ± 0.308
0.253TrpTrp: 0.253 ± 0.132
0.76TrpTyr: 0.76 ± 0.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.281TyrAla: 2.281 ± 0.607
0.676TyrCys: 0.676 ± 0.247
2.027TyrAsp: 2.027 ± 0.458
1.943TyrGlu: 1.943 ± 0.412
1.098TyrPhe: 1.098 ± 0.386
3.041TyrGly: 3.041 ± 0.529
1.183TyrHis: 1.183 ± 0.335
1.943TyrIle: 1.943 ± 0.373
1.943TyrLys: 1.943 ± 0.429
3.126TyrLeu: 3.126 ± 0.675
0.929TyrMet: 0.929 ± 0.339
0.845TyrAsn: 0.845 ± 0.245
1.521TyrPro: 1.521 ± 0.406
1.858TyrGln: 1.858 ± 0.386
2.534TyrArg: 2.534 ± 0.425
2.112TyrSer: 2.112 ± 0.454
1.858TyrThr: 1.858 ± 0.351
1.689TyrVal: 1.689 ± 0.338
0.253TyrTrp: 0.253 ± 0.146
1.774TyrTyr: 1.774 ± 0.385
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11839 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski