Amino acid dipepetide frequency for Cronobacter phage GW1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.811AlaAla: 9.811 ± 1.167
0.831AlaCys: 0.831 ± 0.276
5.654AlaAsp: 5.654 ± 0.729
5.987AlaGlu: 5.987 ± 0.723
3.326AlaPhe: 3.326 ± 0.443
9.229AlaGly: 9.229 ± 1.057
1.081AlaHis: 1.081 ± 0.325
6.07AlaIle: 6.07 ± 0.713
6.569AlaLys: 6.569 ± 0.566
7.483AlaLeu: 7.483 ± 0.95
2.494AlaMet: 2.494 ± 0.395
3.243AlaAsn: 3.243 ± 0.453
2.411AlaPro: 2.411 ± 0.571
3.409AlaGln: 3.409 ± 0.473
3.658AlaArg: 3.658 ± 0.524
5.072AlaSer: 5.072 ± 0.675
3.991AlaThr: 3.991 ± 0.648
6.07AlaVal: 6.07 ± 0.819
1.081AlaTrp: 1.081 ± 0.378
2.91AlaTyr: 2.91 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.582CysAla: 0.582 ± 0.179
0.0CysCys: 0.0 ± 0.0
0.748CysAsp: 0.748 ± 0.336
0.665CysGlu: 0.665 ± 0.288
0.665CysPhe: 0.665 ± 0.24
0.665CysGly: 0.665 ± 0.29
0.333CysHis: 0.333 ± 0.166
0.166CysIle: 0.166 ± 0.12
0.582CysLys: 0.582 ± 0.255
0.915CysLeu: 0.915 ± 0.245
0.166CysMet: 0.166 ± 0.178
0.249CysAsn: 0.249 ± 0.145
0.416CysPro: 0.416 ± 0.169
0.166CysGln: 0.166 ± 0.121
0.582CysArg: 0.582 ± 0.266
0.416CysSer: 0.416 ± 0.244
0.582CysThr: 0.582 ± 0.21
0.582CysVal: 0.582 ± 0.269
0.0CysTrp: 0.0 ± 0.0
0.249CysTyr: 0.249 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
6.236AspAla: 6.236 ± 0.695
0.582AspCys: 0.582 ± 0.317
3.825AspAsp: 3.825 ± 0.693
3.991AspGlu: 3.991 ± 0.43
2.079AspPhe: 2.079 ± 0.461
6.402AspGly: 6.402 ± 0.651
1.164AspHis: 1.164 ± 0.359
4.157AspIle: 4.157 ± 0.448
3.243AspLys: 3.243 ± 0.628
5.321AspLeu: 5.321 ± 0.654
1.663AspMet: 1.663 ± 0.341
2.328AspAsn: 2.328 ± 0.546
2.744AspPro: 2.744 ± 0.511
2.245AspGln: 2.245 ± 0.384
2.079AspArg: 2.079 ± 0.364
2.91AspSer: 2.91 ± 0.423
3.326AspThr: 3.326 ± 0.488
4.24AspVal: 4.24 ± 0.561
1.164AspTrp: 1.164 ± 0.381
2.494AspTyr: 2.494 ± 0.387
0.0AspXaa: 0.0 ± 0.0
Glu
7.4GluAla: 7.4 ± 0.992
0.915GluCys: 0.915 ± 0.331
4.49GluAsp: 4.49 ± 0.783
4.49GluGlu: 4.49 ± 0.909
2.661GluPhe: 2.661 ± 0.506
5.072GluGly: 5.072 ± 0.786
1.081GluHis: 1.081 ± 0.292
2.411GluIle: 2.411 ± 0.372
2.827GluLys: 2.827 ± 0.525
6.07GluLeu: 6.07 ± 0.773
1.58GluMet: 1.58 ± 0.377
2.328GluAsn: 2.328 ± 0.457
2.162GluPro: 2.162 ± 0.455
3.658GluGln: 3.658 ± 0.507
4.407GluArg: 4.407 ± 0.519
3.326GluSer: 3.326 ± 0.456
4.157GluThr: 4.157 ± 0.534
4.407GluVal: 4.407 ± 0.729
1.247GluTrp: 1.247 ± 0.261
2.91GluTyr: 2.91 ± 0.546
0.0GluXaa: 0.0 ± 0.0
Phe
2.91PheAla: 2.91 ± 0.583
0.416PheCys: 0.416 ± 0.242
2.661PheAsp: 2.661 ± 0.406
1.912PheGlu: 1.912 ± 0.392
1.247PhePhe: 1.247 ± 0.33
2.328PheGly: 2.328 ± 0.468
0.748PheHis: 0.748 ± 0.21
1.746PheIle: 1.746 ± 0.38
2.993PheLys: 2.993 ± 0.508
3.492PheLeu: 3.492 ± 0.422
0.998PheMet: 0.998 ± 0.272
2.411PheAsn: 2.411 ± 0.41
1.247PhePro: 1.247 ± 0.318
0.915PheGln: 0.915 ± 0.274
1.746PheArg: 1.746 ± 0.326
2.494PheSer: 2.494 ± 0.3
2.578PheThr: 2.578 ± 0.342
2.578PheVal: 2.578 ± 0.57
0.166PheTrp: 0.166 ± 0.129
1.497PheTyr: 1.497 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
6.652GlyAla: 6.652 ± 0.896
0.582GlyCys: 0.582 ± 0.265
4.822GlyAsp: 4.822 ± 0.89
5.405GlyGlu: 5.405 ± 0.637
2.162GlyPhe: 2.162 ± 0.312
5.654GlyGly: 5.654 ± 0.799
0.998GlyHis: 0.998 ± 0.266
3.742GlyIle: 3.742 ± 0.611
6.984GlyLys: 6.984 ± 0.697
6.984GlyLeu: 6.984 ± 0.907
2.411GlyMet: 2.411 ± 0.541
2.494GlyAsn: 2.494 ± 0.438
1.413GlyPro: 1.413 ± 0.44
3.16GlyGln: 3.16 ± 0.455
5.405GlyArg: 5.405 ± 0.524
5.987GlySer: 5.987 ± 0.653
4.49GlyThr: 4.49 ± 0.698
4.822GlyVal: 4.822 ± 0.607
1.33GlyTrp: 1.33 ± 0.344
4.49GlyTyr: 4.49 ± 0.544
0.0GlyXaa: 0.0 ± 0.0
His
0.748HisAla: 0.748 ± 0.282
0.333HisCys: 0.333 ± 0.154
1.081HisAsp: 1.081 ± 0.428
0.915HisGlu: 0.915 ± 0.294
0.499HisPhe: 0.499 ± 0.226
1.33HisGly: 1.33 ± 0.399
0.416HisHis: 0.416 ± 0.196
0.998HisIle: 0.998 ± 0.252
0.831HisLys: 0.831 ± 0.246
1.996HisLeu: 1.996 ± 0.41
0.582HisMet: 0.582 ± 0.192
0.665HisAsn: 0.665 ± 0.187
0.665HisPro: 0.665 ± 0.228
0.665HisGln: 0.665 ± 0.214
0.748HisArg: 0.748 ± 0.214
0.915HisSer: 0.915 ± 0.247
1.58HisThr: 1.58 ± 0.367
0.915HisVal: 0.915 ± 0.289
0.499HisTrp: 0.499 ± 0.217
0.582HisTyr: 0.582 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
3.326IleAla: 3.326 ± 0.671
0.748IleCys: 0.748 ± 0.304
3.16IleAsp: 3.16 ± 0.372
2.578IleGlu: 2.578 ± 0.329
0.831IlePhe: 0.831 ± 0.245
4.24IleGly: 4.24 ± 0.573
1.33IleHis: 1.33 ± 0.353
1.829IleIle: 1.829 ± 0.356
3.658IleLys: 3.658 ± 0.653
3.16IleLeu: 3.16 ± 0.584
1.164IleMet: 1.164 ± 0.35
2.91IleAsn: 2.91 ± 0.569
1.912IlePro: 1.912 ± 0.442
1.912IleGln: 1.912 ± 0.503
2.91IleArg: 2.91 ± 0.472
2.494IleSer: 2.494 ± 0.341
3.076IleThr: 3.076 ± 0.407
3.658IleVal: 3.658 ± 0.56
0.665IleTrp: 0.665 ± 0.239
1.413IleTyr: 1.413 ± 0.307
0.0IleXaa: 0.0 ± 0.0
Lys
7.649LysAla: 7.649 ± 0.851
0.416LysCys: 0.416 ± 0.286
3.658LysAsp: 3.658 ± 0.46
3.825LysGlu: 3.825 ± 0.504
2.661LysPhe: 2.661 ± 0.47
4.074LysGly: 4.074 ± 0.585
0.998LysHis: 0.998 ± 0.312
2.245LysIle: 2.245 ± 0.405
3.742LysLys: 3.742 ± 0.999
5.321LysLeu: 5.321 ± 0.587
1.996LysMet: 1.996 ± 0.331
2.245LysAsn: 2.245 ± 0.407
2.827LysPro: 2.827 ± 0.65
2.411LysGln: 2.411 ± 0.45
3.991LysArg: 3.991 ± 0.592
4.573LysSer: 4.573 ± 0.602
3.409LysThr: 3.409 ± 0.503
5.238LysVal: 5.238 ± 0.631
0.998LysTrp: 0.998 ± 0.267
2.328LysTyr: 2.328 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
8.065LeuAla: 8.065 ± 1.064
0.166LeuCys: 0.166 ± 0.107
4.739LeuAsp: 4.739 ± 0.382
6.735LeuGlu: 6.735 ± 0.747
2.245LeuPhe: 2.245 ± 0.454
4.989LeuGly: 4.989 ± 0.603
0.998LeuHis: 0.998 ± 0.264
3.076LeuIle: 3.076 ± 0.402
6.818LeuLys: 6.818 ± 0.883
5.238LeuLeu: 5.238 ± 0.704
2.578LeuMet: 2.578 ± 0.471
4.407LeuAsn: 4.407 ± 0.606
3.742LeuPro: 3.742 ± 0.441
4.157LeuGln: 4.157 ± 0.541
4.573LeuArg: 4.573 ± 0.507
5.321LeuSer: 5.321 ± 0.635
5.155LeuThr: 5.155 ± 0.899
5.737LeuVal: 5.737 ± 0.648
0.665LeuTrp: 0.665 ± 0.266
2.162LeuTyr: 2.162 ± 0.471
0.0LeuXaa: 0.0 ± 0.0
Met
2.993MetAla: 2.993 ± 0.44
0.333MetCys: 0.333 ± 0.186
1.33MetAsp: 1.33 ± 0.362
1.829MetGlu: 1.829 ± 0.369
0.915MetPhe: 0.915 ± 0.314
2.661MetGly: 2.661 ± 0.452
0.249MetHis: 0.249 ± 0.138
1.33MetIle: 1.33 ± 0.297
0.831MetLys: 0.831 ± 0.263
2.079MetLeu: 2.079 ± 0.378
0.748MetMet: 0.748 ± 0.217
1.497MetAsn: 1.497 ± 0.349
0.915MetPro: 0.915 ± 0.263
0.915MetGln: 0.915 ± 0.289
1.247MetArg: 1.247 ± 0.251
2.079MetSer: 2.079 ± 0.361
1.912MetThr: 1.912 ± 0.405
2.494MetVal: 2.494 ± 0.438
0.333MetTrp: 0.333 ± 0.21
0.915MetTyr: 0.915 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
3.991AsnAla: 3.991 ± 0.719
0.416AsnCys: 0.416 ± 0.185
2.245AsnAsp: 2.245 ± 0.436
2.245AsnGlu: 2.245 ± 0.379
1.829AsnPhe: 1.829 ± 0.29
4.074AsnGly: 4.074 ± 0.487
0.582AsnHis: 0.582 ± 0.218
1.996AsnIle: 1.996 ± 0.37
1.996AsnLys: 1.996 ± 0.426
3.326AsnLeu: 3.326 ± 0.542
1.164AsnMet: 1.164 ± 0.286
1.746AsnAsn: 1.746 ± 0.343
2.744AsnPro: 2.744 ± 0.558
1.663AsnGln: 1.663 ± 0.24
2.328AsnArg: 2.328 ± 0.529
2.993AsnSer: 2.993 ± 0.654
1.829AsnThr: 1.829 ± 0.483
3.076AsnVal: 3.076 ± 0.474
0.333AsnTrp: 0.333 ± 0.167
1.247AsnTyr: 1.247 ± 0.272
0.0AsnXaa: 0.0 ± 0.0
Pro
3.076ProAla: 3.076 ± 0.471
0.416ProCys: 0.416 ± 0.247
2.079ProAsp: 2.079 ± 0.334
3.326ProGlu: 3.326 ± 0.542
1.497ProPhe: 1.497 ± 0.317
1.912ProGly: 1.912 ± 0.305
0.665ProHis: 0.665 ± 0.224
1.663ProIle: 1.663 ± 0.406
3.243ProLys: 3.243 ± 0.618
2.328ProLeu: 2.328 ± 0.458
1.081ProMet: 1.081 ± 0.323
2.411ProAsn: 2.411 ± 0.356
0.748ProPro: 0.748 ± 0.287
2.079ProGln: 2.079 ± 0.497
1.746ProArg: 1.746 ± 0.298
2.411ProSer: 2.411 ± 0.369
2.578ProThr: 2.578 ± 0.386
2.91ProVal: 2.91 ± 0.439
0.831ProTrp: 0.831 ± 0.285
0.998ProTyr: 0.998 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
4.074GlnAla: 4.074 ± 0.54
0.0GlnCys: 0.0 ± 0.0
3.825GlnAsp: 3.825 ± 0.691
2.578GlnGlu: 2.578 ± 0.518
1.912GlnPhe: 1.912 ± 0.311
2.661GlnGly: 2.661 ± 0.44
0.416GlnHis: 0.416 ± 0.202
1.33GlnIle: 1.33 ± 0.272
2.245GlnLys: 2.245 ± 0.471
4.656GlnLeu: 4.656 ± 0.683
1.413GlnMet: 1.413 ± 0.392
1.829GlnAsn: 1.829 ± 0.493
1.413GlnPro: 1.413 ± 0.389
1.912GlnGln: 1.912 ± 0.461
2.827GlnArg: 2.827 ± 0.566
2.661GlnSer: 2.661 ± 0.493
2.411GlnThr: 2.411 ± 0.47
2.411GlnVal: 2.411 ± 0.401
0.915GlnTrp: 0.915 ± 0.266
1.413GlnTyr: 1.413 ± 0.404
0.0GlnXaa: 0.0 ± 0.0
Arg
4.989ArgAla: 4.989 ± 0.743
0.665ArgCys: 0.665 ± 0.261
3.908ArgAsp: 3.908 ± 0.471
3.991ArgGlu: 3.991 ± 0.556
3.16ArgPhe: 3.16 ± 0.413
3.742ArgGly: 3.742 ± 0.48
0.748ArgHis: 0.748 ± 0.215
2.661ArgIle: 2.661 ± 0.514
3.076ArgLys: 3.076 ± 0.568
5.654ArgLeu: 5.654 ± 0.638
0.998ArgMet: 0.998 ± 0.328
1.912ArgAsn: 1.912 ± 0.396
1.746ArgPro: 1.746 ± 0.383
2.411ArgGln: 2.411 ± 0.379
1.829ArgArg: 1.829 ± 0.327
3.16ArgSer: 3.16 ± 0.511
2.578ArgThr: 2.578 ± 0.383
3.409ArgVal: 3.409 ± 0.563
1.081ArgTrp: 1.081 ± 0.308
1.497ArgTyr: 1.497 ± 0.288
0.0ArgXaa: 0.0 ± 0.0
Ser
4.739SerAla: 4.739 ± 0.646
0.665SerCys: 0.665 ± 0.22
4.822SerAsp: 4.822 ± 0.492
3.243SerGlu: 3.243 ± 0.48
2.91SerPhe: 2.91 ± 0.474
5.987SerGly: 5.987 ± 0.97
2.328SerHis: 2.328 ± 0.4
2.578SerIle: 2.578 ± 0.465
3.492SerLys: 3.492 ± 0.47
3.658SerLeu: 3.658 ± 0.56
1.413SerMet: 1.413 ± 0.498
1.58SerAsn: 1.58 ± 0.282
2.993SerPro: 2.993 ± 0.506
2.494SerGln: 2.494 ± 0.465
3.492SerArg: 3.492 ± 0.475
3.825SerSer: 3.825 ± 0.589
3.742SerThr: 3.742 ± 0.593
4.324SerVal: 4.324 ± 0.515
1.164SerTrp: 1.164 ± 0.303
2.494SerTyr: 2.494 ± 0.518
0.0SerXaa: 0.0 ± 0.0
Thr
3.658ThrAla: 3.658 ± 0.686
0.333ThrCys: 0.333 ± 0.165
3.16ThrAsp: 3.16 ± 0.576
4.822ThrGlu: 4.822 ± 0.531
1.829ThrPhe: 1.829 ± 0.365
5.571ThrGly: 5.571 ± 0.775
0.665ThrHis: 0.665 ± 0.217
3.409ThrIle: 3.409 ± 0.538
3.742ThrLys: 3.742 ± 0.557
4.739ThrLeu: 4.739 ± 0.625
1.164ThrMet: 1.164 ± 0.332
2.162ThrAsn: 2.162 ± 0.48
3.16ThrPro: 3.16 ± 0.345
2.411ThrGln: 2.411 ± 0.359
2.661ThrArg: 2.661 ± 0.428
2.827ThrSer: 2.827 ± 0.647
2.91ThrThr: 2.91 ± 0.519
5.405ThrVal: 5.405 ± 0.804
0.748ThrTrp: 0.748 ± 0.229
1.829ThrTyr: 1.829 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
5.238ValAla: 5.238 ± 0.725
0.416ValCys: 0.416 ± 0.176
3.409ValAsp: 3.409 ± 0.47
6.153ValGlu: 6.153 ± 0.758
2.744ValPhe: 2.744 ± 0.505
5.321ValGly: 5.321 ± 0.607
1.33ValHis: 1.33 ± 0.538
3.492ValIle: 3.492 ± 0.499
4.49ValLys: 4.49 ± 0.491
5.072ValLeu: 5.072 ± 0.621
2.411ValMet: 2.411 ± 0.483
2.91ValAsn: 2.91 ± 0.608
2.993ValPro: 2.993 ± 0.469
3.742ValGln: 3.742 ± 0.652
3.575ValArg: 3.575 ± 0.521
4.822ValSer: 4.822 ± 0.716
4.074ValThr: 4.074 ± 0.516
5.488ValVal: 5.488 ± 0.851
0.831ValTrp: 0.831 ± 0.267
2.411ValTyr: 2.411 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
0.499TrpAla: 0.499 ± 0.213
0.166TrpCys: 0.166 ± 0.141
0.582TrpAsp: 0.582 ± 0.183
0.998TrpGlu: 0.998 ± 0.304
0.748TrpPhe: 0.748 ± 0.251
1.247TrpGly: 1.247 ± 0.343
0.416TrpHis: 0.416 ± 0.2
0.499TrpIle: 0.499 ± 0.23
1.33TrpLys: 1.33 ± 0.358
1.996TrpLeu: 1.996 ± 0.401
0.166TrpMet: 0.166 ± 0.111
0.998TrpAsn: 0.998 ± 0.215
0.249TrpPro: 0.249 ± 0.136
0.748TrpGln: 0.748 ± 0.254
0.831TrpArg: 0.831 ± 0.251
1.164TrpSer: 1.164 ± 0.473
0.582TrpThr: 0.582 ± 0.265
1.164TrpVal: 1.164 ± 0.315
0.249TrpTrp: 0.249 ± 0.142
0.333TrpTyr: 0.333 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.742TyrAla: 3.742 ± 0.591
0.333TyrCys: 0.333 ± 0.173
1.996TyrAsp: 1.996 ± 0.365
1.912TyrGlu: 1.912 ± 0.436
1.164TyrPhe: 1.164 ± 0.288
2.827TyrGly: 2.827 ± 0.489
0.416TyrHis: 0.416 ± 0.156
1.663TyrIle: 1.663 ± 0.396
1.912TyrLys: 1.912 ± 0.332
2.079TyrLeu: 2.079 ± 0.4
1.33TyrMet: 1.33 ± 0.303
1.497TyrAsn: 1.497 ± 0.342
1.497TyrPro: 1.497 ± 0.375
1.829TyrGln: 1.829 ± 0.492
2.578TyrArg: 2.578 ± 0.574
2.494TyrSer: 2.494 ± 0.435
2.245TyrThr: 2.245 ± 0.431
1.996TyrVal: 1.996 ± 0.402
0.665TyrTrp: 0.665 ± 0.257
1.164TyrTyr: 1.164 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (12028 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski