Amino acid dipepetide frequency for Xenohaliotis phage pCXc-HR2015

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.555AlaAla: 8.555 ± 1.172
0.583AlaCys: 0.583 ± 0.329
5.542AlaAsp: 5.542 ± 1.302
3.889AlaGlu: 3.889 ± 0.668
1.944AlaPhe: 1.944 ± 0.377
5.347AlaGly: 5.347 ± 0.805
1.069AlaHis: 1.069 ± 0.381
4.472AlaIle: 4.472 ± 0.587
5.055AlaLys: 5.055 ± 0.679
7.291AlaLeu: 7.291 ± 0.846
1.556AlaMet: 1.556 ± 0.421
5.153AlaAsn: 5.153 ± 0.635
2.625AlaPro: 2.625 ± 0.584
3.111AlaGln: 3.111 ± 0.639
3.208AlaArg: 3.208 ± 0.671
6.125AlaSer: 6.125 ± 0.843
6.125AlaThr: 6.125 ± 1.209
5.444AlaVal: 5.444 ± 0.823
0.486AlaTrp: 0.486 ± 0.191
2.43AlaTyr: 2.43 ± 0.387
0.0AlaXaa: 0.0 ± 0.0
Cys
0.389CysAla: 0.389 ± 0.273
0.194CysCys: 0.194 ± 0.125
0.292CysAsp: 0.292 ± 0.177
0.681CysGlu: 0.681 ± 0.269
0.583CysPhe: 0.583 ± 0.237
0.486CysGly: 0.486 ± 0.202
0.097CysHis: 0.097 ± 0.109
0.389CysIle: 0.389 ± 0.222
0.486CysLys: 0.486 ± 0.21
0.875CysLeu: 0.875 ± 0.258
0.194CysMet: 0.194 ± 0.181
0.389CysAsn: 0.389 ± 0.313
0.194CysPro: 0.194 ± 0.13
0.486CysGln: 0.486 ± 0.219
0.292CysArg: 0.292 ± 0.193
0.583CysSer: 0.583 ± 0.248
0.389CysThr: 0.389 ± 0.218
0.097CysVal: 0.097 ± 0.095
0.0CysTrp: 0.0 ± 0.0
1.069CysTyr: 1.069 ± 0.376
0.0CysXaa: 0.0 ± 0.0
Asp
4.958AspAla: 4.958 ± 0.857
0.875AspCys: 0.875 ± 0.305
3.403AspAsp: 3.403 ± 0.474
2.43AspGlu: 2.43 ± 0.43
4.764AspPhe: 4.764 ± 0.65
5.444AspGly: 5.444 ± 1.126
0.681AspHis: 0.681 ± 0.263
4.861AspIle: 4.861 ± 0.643
2.528AspLys: 2.528 ± 0.448
3.694AspLeu: 3.694 ± 0.678
1.361AspMet: 1.361 ± 0.449
2.43AspAsn: 2.43 ± 0.442
2.528AspPro: 2.528 ± 0.57
1.556AspGln: 1.556 ± 0.398
2.43AspArg: 2.43 ± 0.485
4.375AspSer: 4.375 ± 0.928
3.111AspThr: 3.111 ± 0.691
4.667AspVal: 4.667 ± 0.818
0.486AspTrp: 0.486 ± 0.291
2.917AspTyr: 2.917 ± 0.464
0.0AspXaa: 0.0 ± 0.0
Glu
3.597GluAla: 3.597 ± 0.699
0.681GluCys: 0.681 ± 0.276
3.111GluAsp: 3.111 ± 0.556
4.18GluGlu: 4.18 ± 1.079
2.917GluPhe: 2.917 ± 0.506
2.139GluGly: 2.139 ± 0.465
1.264GluHis: 1.264 ± 0.444
4.278GluIle: 4.278 ± 0.673
4.18GluLys: 4.18 ± 0.636
5.153GluLeu: 5.153 ± 0.601
1.653GluMet: 1.653 ± 0.404
3.597GluAsn: 3.597 ± 0.595
2.042GluPro: 2.042 ± 0.488
2.625GluGln: 2.625 ± 0.566
2.819GluArg: 2.819 ± 0.465
2.917GluSer: 2.917 ± 0.632
2.042GluThr: 2.042 ± 0.406
2.43GluVal: 2.43 ± 0.513
0.583GluTrp: 0.583 ± 0.204
1.361GluTyr: 1.361 ± 0.377
0.0GluXaa: 0.0 ± 0.0
Phe
2.625PheAla: 2.625 ± 0.568
0.292PheCys: 0.292 ± 0.166
2.236PheAsp: 2.236 ± 0.543
1.458PheGlu: 1.458 ± 0.305
3.014PhePhe: 3.014 ± 0.585
2.236PheGly: 2.236 ± 0.53
0.875PheHis: 0.875 ± 0.306
3.889PheIle: 3.889 ± 0.626
3.111PheLys: 3.111 ± 0.508
4.472PheLeu: 4.472 ± 0.775
1.069PheMet: 1.069 ± 0.315
2.722PheAsn: 2.722 ± 0.476
1.653PhePro: 1.653 ± 0.449
1.264PheGln: 1.264 ± 0.336
1.847PheArg: 1.847 ± 0.49
3.986PheSer: 3.986 ± 0.483
2.819PheThr: 2.819 ± 0.542
2.333PheVal: 2.333 ± 0.471
0.194PheTrp: 0.194 ± 0.14
2.528PheTyr: 2.528 ± 0.731
0.0PheXaa: 0.0 ± 0.0
Gly
5.542GlyAla: 5.542 ± 0.892
0.681GlyCys: 0.681 ± 0.317
4.278GlyAsp: 4.278 ± 0.611
2.236GlyGlu: 2.236 ± 0.448
2.819GlyPhe: 2.819 ± 0.516
3.792GlyGly: 3.792 ± 0.586
0.681GlyHis: 0.681 ± 0.311
5.444GlyIle: 5.444 ± 0.603
3.5GlyLys: 3.5 ± 0.56
4.375GlyLeu: 4.375 ± 0.531
1.458GlyMet: 1.458 ± 0.326
3.694GlyAsn: 3.694 ± 0.677
0.486GlyPro: 0.486 ± 0.216
1.847GlyGln: 1.847 ± 0.426
2.625GlyArg: 2.625 ± 0.618
4.667GlySer: 4.667 ± 0.508
5.833GlyThr: 5.833 ± 0.968
4.958GlyVal: 4.958 ± 0.961
0.389GlyTrp: 0.389 ± 0.171
2.819GlyTyr: 2.819 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
0.583HisAla: 0.583 ± 0.261
0.389HisCys: 0.389 ± 0.21
0.875HisAsp: 0.875 ± 0.369
0.583HisGlu: 0.583 ± 0.25
0.486HisPhe: 0.486 ± 0.226
0.486HisGly: 0.486 ± 0.255
0.292HisHis: 0.292 ± 0.168
1.069HisIle: 1.069 ± 0.282
0.972HisLys: 0.972 ± 0.393
1.653HisLeu: 1.653 ± 0.483
0.292HisMet: 0.292 ± 0.188
0.875HisAsn: 0.875 ± 0.304
0.583HisPro: 0.583 ± 0.241
0.583HisGln: 0.583 ± 0.193
0.778HisArg: 0.778 ± 0.29
1.361HisSer: 1.361 ± 0.406
0.486HisThr: 0.486 ± 0.231
0.972HisVal: 0.972 ± 0.276
0.194HisTrp: 0.194 ± 0.151
0.681HisTyr: 0.681 ± 0.294
0.0HisXaa: 0.0 ± 0.0
Ile
5.736IleAla: 5.736 ± 0.717
0.681IleCys: 0.681 ± 0.255
4.278IleAsp: 4.278 ± 0.683
4.472IleGlu: 4.472 ± 0.728
2.43IlePhe: 2.43 ± 0.437
3.014IleGly: 3.014 ± 0.428
1.458IleHis: 1.458 ± 0.516
4.569IleIle: 4.569 ± 0.682
4.278IleLys: 4.278 ± 0.644
5.542IleLeu: 5.542 ± 0.674
1.167IleMet: 1.167 ± 0.28
3.792IleAsn: 3.792 ± 0.566
3.305IlePro: 3.305 ± 0.693
2.042IleGln: 2.042 ± 0.359
3.986IleArg: 3.986 ± 0.632
6.708IleSer: 6.708 ± 0.828
5.93IleThr: 5.93 ± 0.721
6.514IleVal: 6.514 ± 1.035
0.194IleTrp: 0.194 ± 0.124
3.305IleTyr: 3.305 ± 0.456
0.0IleXaa: 0.0 ± 0.0
Lys
3.986LysAla: 3.986 ± 0.556
0.194LysCys: 0.194 ± 0.14
3.597LysAsp: 3.597 ± 0.489
3.986LysGlu: 3.986 ± 0.825
2.625LysPhe: 2.625 ± 0.543
2.43LysGly: 2.43 ± 0.509
1.167LysHis: 1.167 ± 0.387
5.153LysIle: 5.153 ± 0.627
6.125LysLys: 6.125 ± 1.096
5.542LysLeu: 5.542 ± 0.85
1.653LysMet: 1.653 ± 0.528
5.055LysAsn: 5.055 ± 0.919
2.528LysPro: 2.528 ± 0.691
1.556LysGln: 1.556 ± 0.433
2.528LysArg: 2.528 ± 0.39
3.986LysSer: 3.986 ± 0.526
4.375LysThr: 4.375 ± 0.939
2.917LysVal: 2.917 ± 0.698
0.292LysTrp: 0.292 ± 0.151
3.111LysTyr: 3.111 ± 0.482
0.0LysXaa: 0.0 ± 0.0
Leu
6.611LeuAla: 6.611 ± 0.748
0.681LeuCys: 0.681 ± 0.288
5.153LeuAsp: 5.153 ± 0.757
5.444LeuGlu: 5.444 ± 0.85
4.083LeuPhe: 4.083 ± 0.577
4.083LeuGly: 4.083 ± 0.656
1.458LeuHis: 1.458 ± 0.295
5.25LeuIle: 5.25 ± 0.803
6.125LeuLys: 6.125 ± 0.703
10.305LeuLeu: 10.305 ± 1.109
3.014LeuMet: 3.014 ± 0.739
4.667LeuAsn: 4.667 ± 0.634
3.014LeuPro: 3.014 ± 0.56
2.625LeuGln: 2.625 ± 0.377
3.694LeuArg: 3.694 ± 0.679
7.583LeuSer: 7.583 ± 1.195
5.444LeuThr: 5.444 ± 0.63
4.472LeuVal: 4.472 ± 0.492
0.583LeuTrp: 0.583 ± 0.184
2.625LeuTyr: 2.625 ± 0.557
0.0LeuXaa: 0.0 ± 0.0
Met
2.236MetAla: 2.236 ± 0.597
0.486MetCys: 0.486 ± 0.231
1.361MetAsp: 1.361 ± 0.33
0.875MetGlu: 0.875 ± 0.499
0.583MetPhe: 0.583 ± 0.188
1.264MetGly: 1.264 ± 0.415
0.389MetHis: 0.389 ± 0.171
1.361MetIle: 1.361 ± 0.312
1.167MetLys: 1.167 ± 0.411
2.917MetLeu: 2.917 ± 0.599
0.486MetMet: 0.486 ± 0.161
1.069MetAsn: 1.069 ± 0.387
0.778MetPro: 0.778 ± 0.205
0.972MetGln: 0.972 ± 0.296
0.875MetArg: 0.875 ± 0.288
2.819MetSer: 2.819 ± 0.538
1.361MetThr: 1.361 ± 0.358
1.847MetVal: 1.847 ± 0.482
0.0MetTrp: 0.0 ± 0.0
0.486MetTyr: 0.486 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
5.444AsnAla: 5.444 ± 0.924
0.194AsnCys: 0.194 ± 0.122
3.792AsnAsp: 3.792 ± 0.524
3.597AsnGlu: 3.597 ± 0.439
2.625AsnPhe: 2.625 ± 0.632
4.472AsnGly: 4.472 ± 0.79
0.875AsnHis: 0.875 ± 0.265
6.416AsnIle: 6.416 ± 0.811
2.819AsnLys: 2.819 ± 0.556
3.986AsnLeu: 3.986 ± 0.602
1.556AsnMet: 1.556 ± 0.442
3.889AsnAsn: 3.889 ± 0.824
2.236AsnPro: 2.236 ± 0.541
2.139AsnGln: 2.139 ± 0.474
1.653AsnArg: 1.653 ± 0.389
3.014AsnSer: 3.014 ± 0.606
3.597AsnThr: 3.597 ± 0.523
2.917AsnVal: 2.917 ± 0.509
0.583AsnTrp: 0.583 ± 0.186
2.625AsnTyr: 2.625 ± 0.625
0.0AsnXaa: 0.0 ± 0.0
Pro
3.014ProAla: 3.014 ± 0.695
0.194ProCys: 0.194 ± 0.163
2.139ProAsp: 2.139 ± 0.331
3.014ProGlu: 3.014 ± 0.471
0.972ProPhe: 0.972 ± 0.273
1.361ProGly: 1.361 ± 0.522
0.097ProHis: 0.097 ± 0.094
2.528ProIle: 2.528 ± 0.498
1.556ProLys: 1.556 ± 0.38
3.208ProLeu: 3.208 ± 0.695
0.875ProMet: 0.875 ± 0.29
2.236ProAsn: 2.236 ± 0.61
1.556ProPro: 1.556 ± 0.376
1.361ProGln: 1.361 ± 0.456
1.361ProArg: 1.361 ± 0.343
3.694ProSer: 3.694 ± 0.589
2.139ProThr: 2.139 ± 0.413
3.305ProVal: 3.305 ± 0.667
0.097ProTrp: 0.097 ± 0.102
1.847ProTyr: 1.847 ± 0.443
0.0ProXaa: 0.0 ± 0.0
Gln
2.722GlnAla: 2.722 ± 0.699
0.194GlnCys: 0.194 ± 0.139
1.264GlnAsp: 1.264 ± 0.25
2.139GlnGlu: 2.139 ± 0.402
0.875GlnPhe: 0.875 ± 0.29
1.653GlnGly: 1.653 ± 0.368
0.583GlnHis: 0.583 ± 0.295
2.722GlnIle: 2.722 ± 0.594
2.333GlnLys: 2.333 ± 0.449
3.305GlnLeu: 3.305 ± 0.597
0.389GlnMet: 0.389 ± 0.186
2.042GlnAsn: 2.042 ± 0.522
1.069GlnPro: 1.069 ± 0.319
2.528GlnGln: 2.528 ± 0.558
1.75GlnArg: 1.75 ± 0.423
2.333GlnSer: 2.333 ± 0.386
2.333GlnThr: 2.333 ± 0.552
1.458GlnVal: 1.458 ± 0.345
0.097GlnTrp: 0.097 ± 0.087
1.458GlnTyr: 1.458 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
2.819ArgAla: 2.819 ± 0.5
0.194ArgCys: 0.194 ± 0.145
2.43ArgAsp: 2.43 ± 0.562
2.528ArgGlu: 2.528 ± 0.526
1.75ArgPhe: 1.75 ± 0.507
2.917ArgGly: 2.917 ± 0.521
0.972ArgHis: 0.972 ± 0.341
3.014ArgIle: 3.014 ± 0.681
2.43ArgLys: 2.43 ± 0.529
3.792ArgLeu: 3.792 ± 0.52
1.264ArgMet: 1.264 ± 0.385
2.917ArgAsn: 2.917 ± 0.514
1.167ArgPro: 1.167 ± 0.459
1.264ArgGln: 1.264 ± 0.344
1.458ArgArg: 1.458 ± 0.532
3.208ArgSer: 3.208 ± 0.729
2.819ArgThr: 2.819 ± 0.518
3.208ArgVal: 3.208 ± 0.675
0.194ArgTrp: 0.194 ± 0.126
1.847ArgTyr: 1.847 ± 0.479
0.0ArgXaa: 0.0 ± 0.0
Ser
6.514SerAla: 6.514 ± 0.755
0.292SerCys: 0.292 ± 0.164
4.472SerAsp: 4.472 ± 0.714
3.111SerGlu: 3.111 ± 0.508
4.569SerPhe: 4.569 ± 0.71
6.805SerGly: 6.805 ± 0.655
0.681SerHis: 0.681 ± 0.21
5.25SerIle: 5.25 ± 0.623
4.18SerLys: 4.18 ± 0.573
6.319SerLeu: 6.319 ± 0.792
1.75SerMet: 1.75 ± 0.338
4.861SerAsn: 4.861 ± 0.718
2.722SerPro: 2.722 ± 0.605
2.236SerGln: 2.236 ± 0.47
3.889SerArg: 3.889 ± 0.669
7.583SerSer: 7.583 ± 0.85
6.222SerThr: 6.222 ± 0.9
5.055SerVal: 5.055 ± 0.778
0.486SerTrp: 0.486 ± 0.197
3.403SerTyr: 3.403 ± 0.76
0.0SerXaa: 0.0 ± 0.0
Thr
6.222ThrAla: 6.222 ± 0.889
0.097ThrCys: 0.097 ± 0.079
3.889ThrAsp: 3.889 ± 1.15
3.208ThrGlu: 3.208 ± 0.437
2.236ThrPhe: 2.236 ± 0.394
6.319ThrGly: 6.319 ± 0.828
0.0ThrHis: 0.0 ± 0.0
5.444ThrIle: 5.444 ± 1.057
4.375ThrLys: 4.375 ± 0.609
5.153ThrLeu: 5.153 ± 0.621
0.875ThrMet: 0.875 ± 0.29
4.18ThrAsn: 4.18 ± 0.681
3.014ThrPro: 3.014 ± 0.416
2.139ThrGln: 2.139 ± 0.387
2.139ThrArg: 2.139 ± 0.577
5.542ThrSer: 5.542 ± 0.751
5.347ThrThr: 5.347 ± 0.814
4.472ThrVal: 4.472 ± 0.925
0.292ThrTrp: 0.292 ± 0.159
2.528ThrTyr: 2.528 ± 0.443
0.0ThrXaa: 0.0 ± 0.0
Val
5.347ValAla: 5.347 ± 0.646
0.875ValCys: 0.875 ± 0.406
5.25ValAsp: 5.25 ± 0.625
3.208ValGlu: 3.208 ± 0.616
3.014ValPhe: 3.014 ± 0.637
4.958ValGly: 4.958 ± 0.701
0.389ValHis: 0.389 ± 0.223
4.18ValIle: 4.18 ± 0.494
3.597ValLys: 3.597 ± 0.62
4.472ValLeu: 4.472 ± 0.574
1.847ValMet: 1.847 ± 0.362
2.819ValAsn: 2.819 ± 0.586
2.819ValPro: 2.819 ± 0.503
1.847ValGln: 1.847 ± 0.463
2.528ValArg: 2.528 ± 0.557
5.347ValSer: 5.347 ± 0.693
3.792ValThr: 3.792 ± 0.814
5.736ValVal: 5.736 ± 0.915
0.583ValTrp: 0.583 ± 0.256
2.236ValTyr: 2.236 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.292TrpAla: 0.292 ± 0.149
0.0TrpCys: 0.0 ± 0.0
0.389TrpAsp: 0.389 ± 0.143
0.292TrpGlu: 0.292 ± 0.155
0.389TrpPhe: 0.389 ± 0.167
0.194TrpGly: 0.194 ± 0.111
0.194TrpHis: 0.194 ± 0.135
0.292TrpIle: 0.292 ± 0.177
0.681TrpLys: 0.681 ± 0.2
0.486TrpLeu: 0.486 ± 0.171
0.097TrpMet: 0.097 ± 0.089
0.194TrpAsn: 0.194 ± 0.14
0.0TrpPro: 0.0 ± 0.0
0.292TrpGln: 0.292 ± 0.144
0.389TrpArg: 0.389 ± 0.248
0.486TrpSer: 0.486 ± 0.202
0.486TrpThr: 0.486 ± 0.243
0.292TrpVal: 0.292 ± 0.136
0.0TrpTrp: 0.0 ± 0.0
0.583TrpTyr: 0.583 ± 0.283
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.819TyrAla: 2.819 ± 0.575
0.486TyrCys: 0.486 ± 0.236
1.75TyrAsp: 1.75 ± 0.523
2.236TyrGlu: 2.236 ± 0.58
1.944TyrPhe: 1.944 ± 0.529
2.722TyrGly: 2.722 ± 0.548
0.972TyrHis: 0.972 ± 0.346
2.819TyrIle: 2.819 ± 0.471
3.305TyrLys: 3.305 ± 0.763
4.375TyrLeu: 4.375 ± 0.587
0.681TyrMet: 0.681 ± 0.222
1.847TyrAsn: 1.847 ± 0.484
2.236TyrPro: 2.236 ± 0.399
0.778TyrGln: 0.778 ± 0.284
1.944TyrArg: 1.944 ± 0.407
4.083TyrSer: 4.083 ± 0.747
2.819TyrThr: 2.819 ± 0.473
1.75TyrVal: 1.75 ± 0.33
0.292TyrTrp: 0.292 ± 0.208
2.333TyrTyr: 2.333 ± 0.526
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (10287 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski