Amino acid dipepetide frequency for Aeromonas phage AsXd-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.143AlaAla: 11.143 ± 1.619
1.089AlaCys: 1.089 ± 0.413
5.948AlaAsp: 5.948 ± 0.75
4.859AlaGlu: 4.859 ± 0.639
3.184AlaPhe: 3.184 ± 0.553
8.378AlaGly: 8.378 ± 0.868
1.843AlaHis: 1.843 ± 0.56
6.367AlaIle: 6.367 ± 0.576
5.194AlaLys: 5.194 ± 0.678
9.048AlaLeu: 9.048 ± 1.087
2.765AlaMet: 2.765 ± 0.516
4.44AlaAsn: 4.44 ± 0.741
2.262AlaPro: 2.262 ± 0.414
5.194AlaGln: 5.194 ± 0.888
5.194AlaArg: 5.194 ± 0.763
6.535AlaSer: 6.535 ± 0.998
5.446AlaThr: 5.446 ± 0.913
8.21AlaVal: 8.21 ± 1.137
1.676AlaTrp: 1.676 ± 0.393
1.759AlaTyr: 1.759 ± 0.47
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.238
0.168CysCys: 0.168 ± 0.12
0.586CysAsp: 0.586 ± 0.225
1.005CysGlu: 1.005 ± 0.324
0.335CysPhe: 0.335 ± 0.166
1.424CysGly: 1.424 ± 0.429
0.251CysHis: 0.251 ± 0.154
0.503CysIle: 0.503 ± 0.233
0.503CysLys: 0.503 ± 0.208
0.335CysLeu: 0.335 ± 0.175
0.419CysMet: 0.419 ± 0.204
0.67CysAsn: 0.67 ± 0.229
0.251CysPro: 0.251 ± 0.164
0.335CysGln: 0.335 ± 0.189
0.503CysArg: 0.503 ± 0.17
0.67CysSer: 0.67 ± 0.302
0.754CysThr: 0.754 ± 0.233
0.67CysVal: 0.67 ± 0.258
0.335CysTrp: 0.335 ± 0.153
0.168CysTyr: 0.168 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
6.032AspAla: 6.032 ± 0.755
0.419AspCys: 0.419 ± 0.196
3.854AspAsp: 3.854 ± 0.655
3.938AspGlu: 3.938 ± 0.59
1.592AspPhe: 1.592 ± 0.524
4.943AspGly: 4.943 ± 0.841
1.005AspHis: 1.005 ± 0.326
3.016AspIle: 3.016 ± 0.503
2.849AspLys: 2.849 ± 0.466
5.529AspLeu: 5.529 ± 0.682
1.676AspMet: 1.676 ± 0.504
3.184AspAsn: 3.184 ± 0.392
1.843AspPro: 1.843 ± 0.321
1.927AspGln: 1.927 ± 0.344
3.184AspArg: 3.184 ± 0.451
3.77AspSer: 3.77 ± 0.619
2.262AspThr: 2.262 ± 0.472
4.943AspVal: 4.943 ± 0.809
0.586AspTrp: 0.586 ± 0.207
1.843AspTyr: 1.843 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
6.116GluAla: 6.116 ± 0.778
0.754GluCys: 0.754 ± 0.243
1.927GluAsp: 1.927 ± 0.442
4.44GluGlu: 4.44 ± 0.898
2.513GluPhe: 2.513 ± 0.321
3.1GluGly: 3.1 ± 0.496
1.005GluHis: 1.005 ± 0.336
4.189GluIle: 4.189 ± 0.589
3.519GluLys: 3.519 ± 0.523
6.032GluLeu: 6.032 ± 0.872
1.424GluMet: 1.424 ± 0.332
2.765GluAsn: 2.765 ± 0.51
2.346GluPro: 2.346 ± 0.457
3.686GluGln: 3.686 ± 0.628
4.273GluArg: 4.273 ± 0.586
3.351GluSer: 3.351 ± 0.583
2.346GluThr: 2.346 ± 0.458
4.021GluVal: 4.021 ± 0.695
0.67GluTrp: 0.67 ± 0.227
1.759GluTyr: 1.759 ± 0.323
0.0GluXaa: 0.0 ± 0.0
Phe
2.681PheAla: 2.681 ± 0.488
0.168PheCys: 0.168 ± 0.123
1.927PheAsp: 1.927 ± 0.537
2.262PheGlu: 2.262 ± 0.418
1.34PhePhe: 1.34 ± 0.442
3.435PheGly: 3.435 ± 0.611
0.251PheHis: 0.251 ± 0.142
2.262PheIle: 2.262 ± 0.563
1.34PheLys: 1.34 ± 0.326
1.508PheLeu: 1.508 ± 0.333
0.754PheMet: 0.754 ± 0.246
1.508PheAsn: 1.508 ± 0.328
1.759PhePro: 1.759 ± 0.439
1.173PheGln: 1.173 ± 0.3
2.346PheArg: 2.346 ± 0.451
3.1PheSer: 3.1 ± 0.497
2.178PheThr: 2.178 ± 0.442
2.262PheVal: 2.262 ± 0.573
0.503PheTrp: 0.503 ± 0.184
0.754PheTyr: 0.754 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
6.954GlyAla: 6.954 ± 0.834
0.586GlyCys: 0.586 ± 0.231
4.524GlyAsp: 4.524 ± 0.726
4.273GlyGlu: 4.273 ± 0.566
2.849GlyPhe: 2.849 ± 0.421
5.865GlyGly: 5.865 ± 0.585
1.005GlyHis: 1.005 ± 0.328
4.943GlyIle: 4.943 ± 0.563
4.273GlyLys: 4.273 ± 0.597
7.121GlyLeu: 7.121 ± 0.891
2.43GlyMet: 2.43 ± 0.386
4.524GlyAsn: 4.524 ± 0.741
1.843GlyPro: 1.843 ± 0.451
3.351GlyGln: 3.351 ± 0.48
5.027GlyArg: 5.027 ± 0.547
4.859GlySer: 4.859 ± 0.797
5.529GlyThr: 5.529 ± 0.777
5.613GlyVal: 5.613 ± 0.783
1.257GlyTrp: 1.257 ± 0.292
2.095GlyTyr: 2.095 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
1.759HisAla: 1.759 ± 0.478
0.0HisCys: 0.0 ± 0.0
0.922HisAsp: 0.922 ± 0.325
1.508HisGlu: 1.508 ± 0.47
0.419HisPhe: 0.419 ± 0.176
1.843HisGly: 1.843 ± 0.48
0.503HisHis: 0.503 ± 0.208
1.676HisIle: 1.676 ± 0.349
0.67HisLys: 0.67 ± 0.279
1.257HisLeu: 1.257 ± 0.466
0.168HisMet: 0.168 ± 0.113
0.922HisAsn: 0.922 ± 0.301
1.34HisPro: 1.34 ± 0.373
0.67HisGln: 0.67 ± 0.268
1.089HisArg: 1.089 ± 0.364
1.257HisSer: 1.257 ± 0.303
0.754HisThr: 0.754 ± 0.253
0.67HisVal: 0.67 ± 0.282
0.335HisTrp: 0.335 ± 0.193
0.503HisTyr: 0.503 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
6.535IleAla: 6.535 ± 0.73
0.335IleCys: 0.335 ± 0.163
4.357IleAsp: 4.357 ± 0.424
3.686IleGlu: 3.686 ± 0.585
1.927IlePhe: 1.927 ± 0.397
2.849IleGly: 2.849 ± 0.508
1.34IleHis: 1.34 ± 0.418
3.519IleIle: 3.519 ± 0.5
3.016IleLys: 3.016 ± 0.481
3.351IleLeu: 3.351 ± 0.688
1.005IleMet: 1.005 ± 0.373
2.932IleAsn: 2.932 ± 0.431
1.508IlePro: 1.508 ± 0.325
3.016IleGln: 3.016 ± 0.495
2.849IleArg: 2.849 ± 0.393
4.524IleSer: 4.524 ± 0.754
3.854IleThr: 3.854 ± 0.659
2.513IleVal: 2.513 ± 0.638
1.005IleTrp: 1.005 ± 0.311
1.843IleTyr: 1.843 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
5.697LysAla: 5.697 ± 0.826
0.168LysCys: 0.168 ± 0.116
3.1LysAsp: 3.1 ± 0.64
2.597LysGlu: 2.597 ± 0.536
0.922LysPhe: 0.922 ± 0.256
3.267LysGly: 3.267 ± 0.434
1.173LysHis: 1.173 ± 0.373
2.262LysIle: 2.262 ± 0.425
3.267LysLys: 3.267 ± 0.652
3.016LysLeu: 3.016 ± 0.518
1.508LysMet: 1.508 ± 0.371
2.849LysAsn: 2.849 ± 0.517
2.178LysPro: 2.178 ± 0.59
2.597LysGln: 2.597 ± 0.418
3.267LysArg: 3.267 ± 0.555
3.77LysSer: 3.77 ± 0.615
3.686LysThr: 3.686 ± 0.548
3.686LysVal: 3.686 ± 0.473
0.754LysTrp: 0.754 ± 0.268
1.34LysTyr: 1.34 ± 0.335
0.0LysXaa: 0.0 ± 0.0
Leu
9.97LeuAla: 9.97 ± 0.99
1.592LeuCys: 1.592 ± 0.414
4.357LeuAsp: 4.357 ± 0.59
3.686LeuGlu: 3.686 ± 0.532
2.178LeuPhe: 2.178 ± 0.337
3.938LeuGly: 3.938 ± 0.524
0.838LeuHis: 0.838 ± 0.292
4.859LeuIle: 4.859 ± 0.561
4.189LeuLys: 4.189 ± 0.6
5.362LeuLeu: 5.362 ± 0.669
1.676LeuMet: 1.676 ± 0.41
4.189LeuAsn: 4.189 ± 0.648
2.932LeuPro: 2.932 ± 0.445
3.1LeuGln: 3.1 ± 0.591
7.54LeuArg: 7.54 ± 0.734
6.032LeuSer: 6.032 ± 0.896
4.775LeuThr: 4.775 ± 0.717
4.775LeuVal: 4.775 ± 0.688
0.754LeuTrp: 0.754 ± 0.257
1.676LeuTyr: 1.676 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
2.765MetAla: 2.765 ± 0.43
0.503MetCys: 0.503 ± 0.184
1.257MetAsp: 1.257 ± 0.355
1.424MetGlu: 1.424 ± 0.32
1.005MetPhe: 1.005 ± 0.384
1.592MetGly: 1.592 ± 0.328
0.67MetHis: 0.67 ± 0.277
0.586MetIle: 0.586 ± 0.182
1.34MetLys: 1.34 ± 0.356
2.513MetLeu: 2.513 ± 0.49
0.586MetMet: 0.586 ± 0.205
1.257MetAsn: 1.257 ± 0.327
1.508MetPro: 1.508 ± 0.322
1.173MetGln: 1.173 ± 0.271
1.508MetArg: 1.508 ± 0.331
1.676MetSer: 1.676 ± 0.397
2.346MetThr: 2.346 ± 0.517
0.67MetVal: 0.67 ± 0.287
0.251MetTrp: 0.251 ± 0.155
0.922MetTyr: 0.922 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
4.775AsnAla: 4.775 ± 0.578
0.503AsnCys: 0.503 ± 0.208
2.765AsnAsp: 2.765 ± 0.517
2.597AsnGlu: 2.597 ± 0.469
1.424AsnPhe: 1.424 ± 0.339
5.697AsnGly: 5.697 ± 0.942
1.089AsnHis: 1.089 ± 0.312
2.262AsnIle: 2.262 ± 0.459
2.262AsnLys: 2.262 ± 0.479
4.105AsnLeu: 4.105 ± 0.795
0.67AsnMet: 0.67 ± 0.249
2.011AsnAsn: 2.011 ± 0.358
2.932AsnPro: 2.932 ± 0.504
1.927AsnGln: 1.927 ± 0.415
2.513AsnArg: 2.513 ± 0.377
3.1AsnSer: 3.1 ± 0.571
2.765AsnThr: 2.765 ± 0.572
2.765AsnVal: 2.765 ± 0.473
1.089AsnTrp: 1.089 ± 0.257
0.838AsnTyr: 0.838 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
3.435ProAla: 3.435 ± 0.565
0.586ProCys: 0.586 ± 0.271
2.765ProAsp: 2.765 ± 0.604
2.765ProGlu: 2.765 ± 0.518
1.089ProPhe: 1.089 ± 0.374
3.519ProGly: 3.519 ± 0.52
0.838ProHis: 0.838 ± 0.296
0.838ProIle: 0.838 ± 0.283
2.011ProLys: 2.011 ± 0.363
3.016ProLeu: 3.016 ± 0.632
0.922ProMet: 0.922 ± 0.34
1.34ProAsn: 1.34 ± 0.397
1.257ProPro: 1.257 ± 0.318
2.178ProGln: 2.178 ± 0.414
1.508ProArg: 1.508 ± 0.343
2.681ProSer: 2.681 ± 0.436
1.927ProThr: 1.927 ± 0.572
3.351ProVal: 3.351 ± 0.515
0.503ProTrp: 0.503 ± 0.219
1.005ProTyr: 1.005 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
5.948GlnAla: 5.948 ± 1.129
0.419GlnCys: 0.419 ± 0.185
1.759GlnAsp: 1.759 ± 0.341
1.927GlnGlu: 1.927 ± 0.448
1.424GlnPhe: 1.424 ± 0.321
3.184GlnGly: 3.184 ± 0.686
0.67GlnHis: 0.67 ± 0.227
2.011GlnIle: 2.011 ± 0.317
2.597GlnLys: 2.597 ± 0.552
3.77GlnLeu: 3.77 ± 0.682
1.257GlnMet: 1.257 ± 0.456
2.43GlnAsn: 2.43 ± 0.553
2.011GlnPro: 2.011 ± 0.462
2.681GlnGln: 2.681 ± 0.508
3.016GlnArg: 3.016 ± 0.525
3.351GlnSer: 3.351 ± 0.417
2.095GlnThr: 2.095 ± 0.447
3.686GlnVal: 3.686 ± 0.636
0.251GlnTrp: 0.251 ± 0.16
1.173GlnTyr: 1.173 ± 0.407
0.0GlnXaa: 0.0 ± 0.0
Arg
4.357ArgAla: 4.357 ± 0.792
0.838ArgCys: 0.838 ± 0.372
3.435ArgAsp: 3.435 ± 0.533
4.189ArgGlu: 4.189 ± 0.531
2.765ArgPhe: 2.765 ± 0.424
3.854ArgGly: 3.854 ± 0.507
1.508ArgHis: 1.508 ± 0.376
3.854ArgIle: 3.854 ± 0.703
3.1ArgLys: 3.1 ± 0.653
6.116ArgLeu: 6.116 ± 0.937
1.676ArgMet: 1.676 ± 0.378
3.603ArgAsn: 3.603 ± 0.475
1.592ArgPro: 1.592 ± 0.419
3.016ArgGln: 3.016 ± 0.502
3.686ArgArg: 3.686 ± 0.515
3.016ArgSer: 3.016 ± 0.437
2.932ArgThr: 2.932 ± 0.399
4.021ArgVal: 4.021 ± 0.654
1.34ArgTrp: 1.34 ± 0.397
1.843ArgTyr: 1.843 ± 0.404
0.0ArgXaa: 0.0 ± 0.0
Ser
6.535SerAla: 6.535 ± 1.039
0.419SerCys: 0.419 ± 0.247
4.273SerAsp: 4.273 ± 0.742
4.273SerGlu: 4.273 ± 0.587
2.346SerPhe: 2.346 ± 0.464
7.959SerGly: 7.959 ± 0.981
1.34SerHis: 1.34 ± 0.338
3.184SerIle: 3.184 ± 0.573
2.932SerLys: 2.932 ± 0.662
5.697SerLeu: 5.697 ± 0.695
1.759SerMet: 1.759 ± 0.432
2.43SerAsn: 2.43 ± 0.412
2.932SerPro: 2.932 ± 0.55
2.765SerGln: 2.765 ± 0.609
3.603SerArg: 3.603 ± 0.747
5.027SerSer: 5.027 ± 0.795
3.603SerThr: 3.603 ± 0.639
4.859SerVal: 4.859 ± 0.671
1.508SerTrp: 1.508 ± 0.286
1.592SerTyr: 1.592 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
5.529ThrAla: 5.529 ± 0.883
0.335ThrCys: 0.335 ± 0.273
3.77ThrAsp: 3.77 ± 0.668
3.686ThrGlu: 3.686 ± 0.657
2.932ThrPhe: 2.932 ± 0.514
6.702ThrGly: 6.702 ± 0.684
0.586ThrHis: 0.586 ± 0.223
3.854ThrIle: 3.854 ± 0.609
1.592ThrLys: 1.592 ± 0.406
3.519ThrLeu: 3.519 ± 0.49
1.508ThrMet: 1.508 ± 0.282
1.592ThrAsn: 1.592 ± 0.408
2.597ThrPro: 2.597 ± 0.528
2.262ThrGln: 2.262 ± 0.393
2.597ThrArg: 2.597 ± 0.457
3.686ThrSer: 3.686 ± 0.584
3.1ThrThr: 3.1 ± 0.55
4.775ThrVal: 4.775 ± 0.616
0.838ThrTrp: 0.838 ± 0.244
1.257ThrTyr: 1.257 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
5.446ValAla: 5.446 ± 0.553
0.754ValCys: 0.754 ± 0.253
4.608ValAsp: 4.608 ± 0.715
4.357ValGlu: 4.357 ± 0.543
2.095ValPhe: 2.095 ± 0.36
4.105ValGly: 4.105 ± 0.648
1.508ValHis: 1.508 ± 0.461
4.021ValIle: 4.021 ± 0.541
4.189ValLys: 4.189 ± 0.754
4.943ValLeu: 4.943 ± 0.744
2.011ValMet: 2.011 ± 0.402
4.105ValAsn: 4.105 ± 0.65
3.016ValPro: 3.016 ± 0.424
2.178ValGln: 2.178 ± 0.334
3.184ValArg: 3.184 ± 0.557
5.446ValSer: 5.446 ± 0.603
4.44ValThr: 4.44 ± 0.696
4.105ValVal: 4.105 ± 0.607
1.34ValTrp: 1.34 ± 0.323
3.016ValTyr: 3.016 ± 0.451
0.0ValXaa: 0.0 ± 0.0
Trp
1.005TrpAla: 1.005 ± 0.286
0.503TrpCys: 0.503 ± 0.215
0.754TrpAsp: 0.754 ± 0.226
1.005TrpGlu: 1.005 ± 0.225
0.586TrpPhe: 0.586 ± 0.241
0.922TrpGly: 0.922 ± 0.277
0.503TrpHis: 0.503 ± 0.197
0.754TrpIle: 0.754 ± 0.21
1.257TrpLys: 1.257 ± 0.339
0.754TrpLeu: 0.754 ± 0.239
0.754TrpMet: 0.754 ± 0.237
0.503TrpAsn: 0.503 ± 0.171
0.419TrpPro: 0.419 ± 0.146
1.005TrpGln: 1.005 ± 0.272
1.34TrpArg: 1.34 ± 0.308
1.089TrpSer: 1.089 ± 0.328
0.838TrpThr: 0.838 ± 0.242
0.838TrpVal: 0.838 ± 0.225
0.251TrpTrp: 0.251 ± 0.153
0.419TrpTyr: 0.419 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.351TyrAla: 3.351 ± 0.502
0.251TyrCys: 0.251 ± 0.142
1.508TyrAsp: 1.508 ± 0.303
1.759TyrGlu: 1.759 ± 0.462
0.586TyrPhe: 0.586 ± 0.255
2.346TyrGly: 2.346 ± 0.449
0.335TyrHis: 0.335 ± 0.159
1.005TyrIle: 1.005 ± 0.32
1.005TyrLys: 1.005 ± 0.235
1.424TyrLeu: 1.424 ± 0.354
0.503TyrMet: 0.503 ± 0.198
0.922TyrAsn: 0.922 ± 0.342
1.005TyrPro: 1.005 ± 0.273
1.424TyrGln: 1.424 ± 0.31
2.513TyrArg: 2.513 ± 0.474
2.095TyrSer: 2.095 ± 0.541
1.089TyrThr: 1.089 ± 0.321
2.43TyrVal: 2.43 ± 0.562
0.251TyrTrp: 0.251 ± 0.136
0.754TyrTyr: 0.754 ± 0.278
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski