Amino acid dipepetide frequency for Actinomyces phage xhp1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.865AlaAla: 17.865 ± 1.848
1.013AlaCys: 1.013 ± 0.284
6.815AlaAsp: 6.815 ± 0.899
7.643AlaGlu: 7.643 ± 0.823
3.315AlaPhe: 3.315 ± 0.564
10.59AlaGly: 10.59 ± 0.99
2.394AlaHis: 2.394 ± 0.411
5.157AlaIle: 5.157 ± 0.854
3.499AlaLys: 3.499 ± 0.54
11.235AlaLeu: 11.235 ± 1.349
2.302AlaMet: 2.302 ± 0.387
2.118AlaAsn: 2.118 ± 0.569
6.723AlaPro: 6.723 ± 1.108
4.42AlaGln: 4.42 ± 0.645
9.761AlaArg: 9.761 ± 0.951
9.117AlaSer: 9.117 ± 0.85
8.288AlaThr: 8.288 ± 1.245
9.669AlaVal: 9.669 ± 0.825
2.671AlaTrp: 2.671 ± 0.615
2.855AlaTyr: 2.855 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
1.658CysAla: 1.658 ± 0.411
0.092CysCys: 0.092 ± 0.086
0.645CysAsp: 0.645 ± 0.296
0.553CysGlu: 0.553 ± 0.247
0.276CysPhe: 0.276 ± 0.154
0.645CysGly: 0.645 ± 0.255
0.184CysHis: 0.184 ± 0.143
0.276CysIle: 0.276 ± 0.161
0.092CysLys: 0.092 ± 0.099
0.368CysLeu: 0.368 ± 0.225
0.184CysMet: 0.184 ± 0.123
0.092CysAsn: 0.092 ± 0.105
0.645CysPro: 0.645 ± 0.236
0.184CysGln: 0.184 ± 0.145
0.645CysArg: 0.645 ± 0.227
0.645CysSer: 0.645 ± 0.236
0.553CysThr: 0.553 ± 0.224
0.553CysVal: 0.553 ± 0.22
0.184CysTrp: 0.184 ± 0.123
0.276CysTyr: 0.276 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
6.538AspAla: 6.538 ± 0.83
0.645AspCys: 0.645 ± 0.246
2.947AspAsp: 2.947 ± 0.564
5.341AspGlu: 5.341 ± 0.909
1.75AspPhe: 1.75 ± 0.357
6.815AspGly: 6.815 ± 0.783
0.921AspHis: 0.921 ± 0.308
1.566AspIle: 1.566 ± 0.335
2.579AspLys: 2.579 ± 0.508
5.894AspLeu: 5.894 ± 0.765
1.013AspMet: 1.013 ± 0.256
0.829AspAsn: 0.829 ± 0.28
3.868AspPro: 3.868 ± 0.83
2.302AspGln: 2.302 ± 0.489
3.96AspArg: 3.96 ± 0.527
2.21AspSer: 2.21 ± 0.462
3.131AspThr: 3.131 ± 0.537
4.512AspVal: 4.512 ± 0.562
1.658AspTrp: 1.658 ± 0.366
0.921AspTyr: 0.921 ± 0.224
0.0AspXaa: 0.0 ± 0.0
Glu
8.841GluAla: 8.841 ± 1.202
0.368GluCys: 0.368 ± 0.164
3.591GluAsp: 3.591 ± 0.661
4.052GluGlu: 4.052 ± 0.666
1.75GluPhe: 1.75 ± 0.327
5.617GluGly: 5.617 ± 0.575
1.197GluHis: 1.197 ± 0.319
3.131GluIle: 3.131 ± 0.607
1.934GluLys: 1.934 ± 0.46
3.96GluLeu: 3.96 ± 0.787
1.473GluMet: 1.473 ± 0.311
1.842GluAsn: 1.842 ± 0.33
2.671GluPro: 2.671 ± 0.45
3.591GluGln: 3.591 ± 0.671
5.065GluArg: 5.065 ± 1.162
2.947GluSer: 2.947 ± 0.507
3.499GluThr: 3.499 ± 0.637
4.236GluVal: 4.236 ± 0.663
1.105GluTrp: 1.105 ± 0.28
1.105GluTyr: 1.105 ± 0.305
0.0GluXaa: 0.0 ± 0.0
Phe
2.486PheAla: 2.486 ± 0.413
0.368PheCys: 0.368 ± 0.189
2.579PheAsp: 2.579 ± 0.494
1.842PheGlu: 1.842 ± 0.342
0.645PhePhe: 0.645 ± 0.284
2.026PheGly: 2.026 ± 0.447
0.276PheHis: 0.276 ± 0.14
0.46PheIle: 0.46 ± 0.216
1.197PheLys: 1.197 ± 0.222
2.579PheLeu: 2.579 ± 0.543
0.46PheMet: 0.46 ± 0.18
0.368PheAsn: 0.368 ± 0.199
1.013PhePro: 1.013 ± 0.286
0.46PheGln: 0.46 ± 0.189
1.842PheArg: 1.842 ± 0.507
0.921PheSer: 0.921 ± 0.376
1.842PheThr: 1.842 ± 0.415
1.658PheVal: 1.658 ± 0.373
0.645PheTrp: 0.645 ± 0.288
0.553PheTyr: 0.553 ± 0.199
0.0PheXaa: 0.0 ± 0.0
Gly
9.117GlyAla: 9.117 ± 0.881
0.645GlyCys: 0.645 ± 0.24
4.236GlyAsp: 4.236 ± 0.695
5.065GlyGlu: 5.065 ± 0.619
3.039GlyPhe: 3.039 ± 0.613
7.275GlyGly: 7.275 ± 1.011
1.842GlyHis: 1.842 ± 0.412
5.249GlyIle: 5.249 ± 1.027
3.315GlyLys: 3.315 ± 0.496
8.196GlyLeu: 8.196 ± 1.019
1.013GlyMet: 1.013 ± 0.261
1.566GlyAsn: 1.566 ± 0.464
3.499GlyPro: 3.499 ± 0.533
2.855GlyGln: 2.855 ± 0.499
5.986GlyArg: 5.986 ± 0.751
5.157GlySer: 5.157 ± 0.764
6.354GlyThr: 6.354 ± 1.045
5.894GlyVal: 5.894 ± 0.659
2.486GlyTrp: 2.486 ± 0.515
2.118GlyTyr: 2.118 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
1.658HisAla: 1.658 ± 0.348
0.092HisCys: 0.092 ± 0.098
1.197HisAsp: 1.197 ± 0.37
1.013HisGlu: 1.013 ± 0.346
0.46HisPhe: 0.46 ± 0.206
1.013HisGly: 1.013 ± 0.284
0.184HisHis: 0.184 ± 0.138
1.013HisIle: 1.013 ± 0.371
0.645HisLys: 0.645 ± 0.23
1.566HisLeu: 1.566 ± 0.38
0.276HisMet: 0.276 ± 0.136
0.276HisAsn: 0.276 ± 0.147
0.921HisPro: 0.921 ± 0.31
0.46HisGln: 0.46 ± 0.208
1.289HisArg: 1.289 ± 0.359
0.368HisSer: 0.368 ± 0.187
1.289HisThr: 1.289 ± 0.36
2.026HisVal: 2.026 ± 0.397
0.184HisTrp: 0.184 ± 0.136
0.368HisTyr: 0.368 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
4.512IleAla: 4.512 ± 0.624
0.276IleCys: 0.276 ± 0.153
3.868IleAsp: 3.868 ± 0.541
3.96IleGlu: 3.96 ± 0.56
0.829IlePhe: 0.829 ± 0.259
3.591IleGly: 3.591 ± 0.59
0.921IleHis: 0.921 ± 0.284
1.105IleIle: 1.105 ± 0.309
2.394IleLys: 2.394 ± 0.583
1.658IleLeu: 1.658 ± 0.365
0.184IleMet: 0.184 ± 0.114
1.473IleAsn: 1.473 ± 0.322
2.855IlePro: 2.855 ± 0.409
1.013IleGln: 1.013 ± 0.323
2.394IleArg: 2.394 ± 0.409
2.763IleSer: 2.763 ± 0.658
3.868IleThr: 3.868 ± 0.725
2.579IleVal: 2.579 ± 0.477
1.197IleTrp: 1.197 ± 0.325
1.289IleTyr: 1.289 ± 0.32
0.0IleXaa: 0.0 ± 0.0
Lys
5.525LysAla: 5.525 ± 0.86
0.092LysCys: 0.092 ± 0.087
1.473LysAsp: 1.473 ± 0.316
1.105LysGlu: 1.105 ± 0.303
1.658LysPhe: 1.658 ± 0.386
2.763LysGly: 2.763 ± 0.345
0.737LysHis: 0.737 ± 0.205
1.658LysIle: 1.658 ± 0.398
1.75LysLys: 1.75 ± 0.427
2.671LysLeu: 2.671 ± 0.524
1.013LysMet: 1.013 ± 0.354
0.553LysAsn: 0.553 ± 0.233
3.039LysPro: 3.039 ± 0.567
1.289LysGln: 1.289 ± 0.26
2.118LysArg: 2.118 ± 0.56
1.934LysSer: 1.934 ± 0.432
3.407LysThr: 3.407 ± 0.65
2.579LysVal: 2.579 ± 0.455
0.645LysTrp: 0.645 ± 0.232
0.921LysTyr: 0.921 ± 0.254
0.0LysXaa: 0.0 ± 0.0
Leu
11.695LeuAla: 11.695 ± 1.094
0.553LeuCys: 0.553 ± 0.232
5.249LeuAsp: 5.249 ± 0.581
3.684LeuGlu: 3.684 ± 0.687
1.473LeuPhe: 1.473 ± 0.373
5.617LeuGly: 5.617 ± 0.595
1.842LeuHis: 1.842 ± 0.44
3.131LeuIle: 3.131 ± 0.517
2.302LeuLys: 2.302 ± 0.489
7.275LeuLeu: 7.275 ± 0.881
2.394LeuMet: 2.394 ± 0.427
1.658LeuAsn: 1.658 ± 0.34
4.512LeuPro: 4.512 ± 0.587
2.394LeuGln: 2.394 ± 0.363
5.802LeuArg: 5.802 ± 0.836
5.157LeuSer: 5.157 ± 0.546
7.92LeuThr: 7.92 ± 1.017
6.17LeuVal: 6.17 ± 0.577
0.921LeuTrp: 0.921 ± 0.418
2.486LeuTyr: 2.486 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
2.026MetAla: 2.026 ± 0.427
0.276MetCys: 0.276 ± 0.149
1.105MetAsp: 1.105 ± 0.302
0.737MetGlu: 0.737 ± 0.272
0.276MetPhe: 0.276 ± 0.136
1.658MetGly: 1.658 ± 0.325
0.276MetHis: 0.276 ± 0.165
1.013MetIle: 1.013 ± 0.294
1.013MetLys: 1.013 ± 0.323
1.105MetLeu: 1.105 ± 0.298
0.092MetMet: 0.092 ± 0.079
0.46MetAsn: 0.46 ± 0.226
1.289MetPro: 1.289 ± 0.337
0.553MetGln: 0.553 ± 0.224
1.473MetArg: 1.473 ± 0.358
2.026MetSer: 2.026 ± 0.404
3.131MetThr: 3.131 ± 0.481
0.553MetVal: 0.553 ± 0.185
0.276MetTrp: 0.276 ± 0.166
0.184MetTyr: 0.184 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.591AsnAla: 3.591 ± 0.727
0.184AsnCys: 0.184 ± 0.145
0.921AsnAsp: 0.921 ± 0.292
1.197AsnGlu: 1.197 ± 0.297
1.013AsnPhe: 1.013 ± 0.284
1.842AsnGly: 1.842 ± 0.347
0.276AsnHis: 0.276 ± 0.154
0.368AsnIle: 0.368 ± 0.171
0.645AsnLys: 0.645 ± 0.251
1.934AsnLeu: 1.934 ± 0.455
0.092AsnMet: 0.092 ± 0.088
0.46AsnAsn: 0.46 ± 0.207
1.105AsnPro: 1.105 ± 0.392
0.921AsnGln: 0.921 ± 0.246
1.197AsnArg: 1.197 ± 0.357
0.553AsnSer: 0.553 ± 0.203
1.289AsnThr: 1.289 ± 0.429
2.118AsnVal: 2.118 ± 0.42
0.368AsnTrp: 0.368 ± 0.212
0.368AsnTyr: 0.368 ± 0.159
0.0AsnXaa: 0.0 ± 0.0
Pro
8.841ProAla: 8.841 ± 1.007
0.184ProCys: 0.184 ± 0.138
3.591ProAsp: 3.591 ± 0.384
2.947ProGlu: 2.947 ± 0.574
1.473ProPhe: 1.473 ± 0.379
5.71ProGly: 5.71 ± 0.803
0.829ProHis: 0.829 ± 0.29
1.75ProIle: 1.75 ± 0.393
2.671ProLys: 2.671 ± 0.602
4.052ProLeu: 4.052 ± 0.489
1.566ProMet: 1.566 ± 0.344
1.197ProAsn: 1.197 ± 0.294
2.486ProPro: 2.486 ± 0.407
2.118ProGln: 2.118 ± 0.471
3.039ProArg: 3.039 ± 0.585
3.407ProSer: 3.407 ± 0.566
4.789ProThr: 4.789 ± 0.892
5.341ProVal: 5.341 ± 0.81
1.197ProTrp: 1.197 ± 0.278
1.013ProTyr: 1.013 ± 0.258
0.0ProXaa: 0.0 ± 0.0
Gln
5.433GlnAla: 5.433 ± 0.801
0.553GlnCys: 0.553 ± 0.232
1.566GlnAsp: 1.566 ± 0.413
1.381GlnGlu: 1.381 ± 0.301
0.553GlnPhe: 0.553 ± 0.213
2.026GlnGly: 2.026 ± 0.513
0.553GlnHis: 0.553 ± 0.21
2.21GlnIle: 2.21 ± 0.419
1.105GlnLys: 1.105 ± 0.283
3.315GlnLeu: 3.315 ± 0.591
0.737GlnMet: 0.737 ± 0.247
0.737GlnAsn: 0.737 ± 0.288
2.763GlnPro: 2.763 ± 0.476
1.473GlnGln: 1.473 ± 0.377
3.131GlnArg: 3.131 ± 0.597
1.842GlnSer: 1.842 ± 0.459
2.579GlnThr: 2.579 ± 0.525
4.236GlnVal: 4.236 ± 0.598
0.46GlnTrp: 0.46 ± 0.202
0.368GlnTyr: 0.368 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
8.564ArgAla: 8.564 ± 0.844
0.921ArgCys: 0.921 ± 0.346
4.328ArgAsp: 4.328 ± 0.712
4.052ArgGlu: 4.052 ± 0.735
1.381ArgPhe: 1.381 ± 0.353
4.881ArgGly: 4.881 ± 0.712
0.829ArgHis: 0.829 ± 0.311
3.223ArgIle: 3.223 ± 0.467
2.21ArgLys: 2.21 ± 0.572
7.643ArgLeu: 7.643 ± 0.841
1.473ArgMet: 1.473 ± 0.399
1.197ArgAsn: 1.197 ± 0.324
3.315ArgPro: 3.315 ± 0.431
2.947ArgGln: 2.947 ± 0.419
7.459ArgArg: 7.459 ± 1.171
3.591ArgSer: 3.591 ± 0.557
4.236ArgThr: 4.236 ± 0.824
4.697ArgVal: 4.697 ± 0.856
1.658ArgTrp: 1.658 ± 0.332
1.381ArgTyr: 1.381 ± 0.397
0.0ArgXaa: 0.0 ± 0.0
Ser
8.104SerAla: 8.104 ± 0.965
0.368SerCys: 0.368 ± 0.168
2.855SerAsp: 2.855 ± 0.443
3.039SerGlu: 3.039 ± 0.529
1.289SerPhe: 1.289 ± 0.317
6.446SerGly: 6.446 ± 1.042
0.737SerHis: 0.737 ± 0.24
1.842SerIle: 1.842 ± 0.297
2.21SerLys: 2.21 ± 0.514
4.328SerLeu: 4.328 ± 0.675
1.473SerMet: 1.473 ± 0.311
1.013SerAsn: 1.013 ± 0.3
4.881SerPro: 4.881 ± 0.633
2.302SerGln: 2.302 ± 0.364
3.591SerArg: 3.591 ± 0.46
4.144SerSer: 4.144 ± 0.656
3.499SerThr: 3.499 ± 0.589
3.776SerVal: 3.776 ± 0.511
0.645SerTrp: 0.645 ± 0.261
1.381SerTyr: 1.381 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
7.091ThrAla: 7.091 ± 0.77
0.553ThrCys: 0.553 ± 0.243
5.433ThrAsp: 5.433 ± 0.694
3.407ThrGlu: 3.407 ± 0.482
1.289ThrPhe: 1.289 ± 0.345
6.17ThrGly: 6.17 ± 0.825
0.737ThrHis: 0.737 ± 0.278
4.236ThrIle: 4.236 ± 0.745
3.131ThrLys: 3.131 ± 0.784
6.907ThrLeu: 6.907 ± 0.895
1.289ThrMet: 1.289 ± 0.292
1.934ThrAsn: 1.934 ± 0.433
6.262ThrPro: 6.262 ± 0.826
2.855ThrGln: 2.855 ± 0.531
4.052ThrArg: 4.052 ± 0.714
3.868ThrSer: 3.868 ± 0.612
4.052ThrThr: 4.052 ± 0.553
5.894ThrVal: 5.894 ± 0.847
2.026ThrTrp: 2.026 ± 0.642
1.473ThrTyr: 1.473 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
9.117ValAla: 9.117 ± 1.187
1.105ValCys: 1.105 ± 0.317
3.776ValAsp: 3.776 ± 0.522
8.012ValGlu: 8.012 ± 0.937
1.381ValPhe: 1.381 ± 0.33
5.433ValGly: 5.433 ± 0.726
1.197ValHis: 1.197 ± 0.439
3.868ValIle: 3.868 ± 0.614
2.486ValLys: 2.486 ± 0.516
4.881ValLeu: 4.881 ± 0.561
1.473ValMet: 1.473 ± 0.341
1.381ValAsn: 1.381 ± 0.43
4.42ValPro: 4.42 ± 0.651
3.131ValGln: 3.131 ± 0.444
4.789ValArg: 4.789 ± 0.739
4.236ValSer: 4.236 ± 0.649
4.789ValThr: 4.789 ± 0.621
5.525ValVal: 5.525 ± 0.703
2.21ValTrp: 2.21 ± 0.448
2.118ValTyr: 2.118 ± 0.46
0.0ValXaa: 0.0 ± 0.0
Trp
2.118TrpAla: 2.118 ± 0.384
0.368TrpCys: 0.368 ± 0.164
1.566TrpAsp: 1.566 ± 0.352
1.75TrpGlu: 1.75 ± 0.326
0.092TrpPhe: 0.092 ± 0.099
2.118TrpGly: 2.118 ± 0.507
0.276TrpHis: 0.276 ± 0.138
0.921TrpIle: 0.921 ± 0.306
0.829TrpLys: 0.829 ± 0.334
1.289TrpLeu: 1.289 ± 0.405
0.46TrpMet: 0.46 ± 0.177
0.645TrpAsn: 0.645 ± 0.235
0.737TrpPro: 0.737 ± 0.246
1.013TrpGln: 1.013 ± 0.454
1.289TrpArg: 1.289 ± 0.505
1.381TrpSer: 1.381 ± 0.345
2.21TrpThr: 2.21 ± 0.54
1.75TrpVal: 1.75 ± 0.416
0.368TrpTrp: 0.368 ± 0.178
0.46TrpTyr: 0.46 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.671TyrAla: 2.671 ± 0.618
0.092TyrCys: 0.092 ± 0.105
1.658TyrAsp: 1.658 ± 0.426
1.197TyrGlu: 1.197 ± 0.352
0.276TyrPhe: 0.276 ± 0.157
2.671TyrGly: 2.671 ± 0.66
0.092TyrHis: 0.092 ± 0.068
0.553TyrIle: 0.553 ± 0.238
1.013TyrLys: 1.013 ± 0.273
1.381TyrLeu: 1.381 ± 0.322
0.46TyrMet: 0.46 ± 0.242
0.645TyrAsn: 0.645 ± 0.201
1.197TyrPro: 1.197 ± 0.327
0.737TyrGln: 0.737 ± 0.277
1.105TyrArg: 1.105 ± 0.42
1.658TyrSer: 1.658 ± 0.344
1.934TyrThr: 1.934 ± 0.396
1.566TyrVal: 1.566 ± 0.31
0.737TyrTrp: 0.737 ± 0.239
0.737TyrTyr: 0.737 ± 0.271
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (10860 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski