Amino acid dipepetide frequency for Burkholderia phage phiE52237

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.198AlaAla: 22.198 ± 2.501
0.634AlaCys: 0.634 ± 0.214
8.517AlaAsp: 8.517 ± 0.92
5.708AlaGlu: 5.708 ± 0.599
3.171AlaPhe: 3.171 ± 0.554
11.507AlaGly: 11.507 ± 0.76
2.175AlaHis: 2.175 ± 0.369
5.074AlaIle: 5.074 ± 0.656
5.436AlaLys: 5.436 ± 1.036
14.044AlaLeu: 14.044 ± 1.771
3.624AlaMet: 3.624 ± 0.413
3.443AlaAsn: 3.443 ± 0.482
7.067AlaPro: 7.067 ± 0.731
4.711AlaGln: 4.711 ± 0.809
10.419AlaArg: 10.419 ± 0.949
6.705AlaSer: 6.705 ± 0.65
7.883AlaThr: 7.883 ± 0.966
6.795AlaVal: 6.795 ± 0.754
2.99AlaTrp: 2.99 ± 0.569
3.171AlaTyr: 3.171 ± 0.614
0.0AlaXaa: 0.0 ± 0.0
Cys
0.997CysAla: 0.997 ± 0.401
0.0CysCys: 0.0 ± 0.0
0.272CysAsp: 0.272 ± 0.17
0.725CysGlu: 0.725 ± 0.275
0.091CysPhe: 0.091 ± 0.081
0.634CysGly: 0.634 ± 0.284
0.362CysHis: 0.362 ± 0.186
0.362CysIle: 0.362 ± 0.205
0.181CysLys: 0.181 ± 0.145
0.815CysLeu: 0.815 ± 0.276
0.272CysMet: 0.272 ± 0.173
0.272CysAsn: 0.272 ± 0.132
0.181CysPro: 0.181 ± 0.121
0.362CysGln: 0.362 ± 0.185
0.725CysArg: 0.725 ± 0.275
0.634CysSer: 0.634 ± 0.212
0.634CysThr: 0.634 ± 0.28
0.362CysVal: 0.362 ± 0.177
0.272CysTrp: 0.272 ± 0.155
0.272CysTyr: 0.272 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
9.695AspAla: 9.695 ± 0.99
0.453AspCys: 0.453 ± 0.171
3.896AspAsp: 3.896 ± 0.76
3.896AspGlu: 3.896 ± 0.636
2.265AspPhe: 2.265 ± 0.418
6.07AspGly: 6.07 ± 1.086
1.45AspHis: 1.45 ± 0.387
2.809AspIle: 2.809 ± 0.713
2.084AspLys: 2.084 ± 0.64
4.983AspLeu: 4.983 ± 0.905
1.903AspMet: 1.903 ± 0.394
1.359AspAsn: 1.359 ± 0.316
2.809AspPro: 2.809 ± 0.485
1.903AspGln: 1.903 ± 0.349
4.349AspArg: 4.349 ± 0.617
1.631AspSer: 1.631 ± 0.396
3.443AspThr: 3.443 ± 0.518
4.077AspVal: 4.077 ± 0.698
0.453AspTrp: 0.453 ± 0.208
2.084AspTyr: 2.084 ± 0.421
0.0AspXaa: 0.0 ± 0.0
Glu
4.802GluAla: 4.802 ± 0.736
0.544GluCys: 0.544 ± 0.212
1.45GluAsp: 1.45 ± 0.436
1.812GluGlu: 1.812 ± 0.486
2.809GluPhe: 2.809 ± 0.444
2.899GluGly: 2.899 ± 0.577
1.359GluHis: 1.359 ± 0.387
2.537GluIle: 2.537 ± 0.443
2.628GluLys: 2.628 ± 0.569
6.161GluLeu: 6.161 ± 0.683
0.997GluMet: 0.997 ± 0.299
1.812GluAsn: 1.812 ± 0.348
2.718GluPro: 2.718 ± 0.569
1.812GluGln: 1.812 ± 0.343
6.524GluArg: 6.524 ± 0.497
2.899GluSer: 2.899 ± 0.615
2.537GluThr: 2.537 ± 0.521
2.899GluVal: 2.899 ± 0.601
1.178GluTrp: 1.178 ± 0.298
1.631GluTyr: 1.631 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
6.342PheAla: 6.342 ± 0.721
0.272PheCys: 0.272 ± 0.126
2.356PheAsp: 2.356 ± 0.475
2.175PheGlu: 2.175 ± 0.426
1.178PhePhe: 1.178 ± 0.339
2.718PheGly: 2.718 ± 0.59
0.634PheHis: 0.634 ± 0.346
1.178PheIle: 1.178 ± 0.372
1.631PheLys: 1.631 ± 0.459
1.721PheLeu: 1.721 ± 0.359
0.453PheMet: 0.453 ± 0.177
0.815PheAsn: 0.815 ± 0.295
1.45PhePro: 1.45 ± 0.349
0.634PheGln: 0.634 ± 0.248
2.356PheArg: 2.356 ± 0.519
2.084PheSer: 2.084 ± 0.353
1.631PheThr: 1.631 ± 0.339
1.812PheVal: 1.812 ± 0.386
0.634PheTrp: 0.634 ± 0.25
0.725PheTyr: 0.725 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
9.242GlyAla: 9.242 ± 1.258
0.815GlyCys: 0.815 ± 0.262
4.077GlyAsp: 4.077 ± 0.658
4.258GlyGlu: 4.258 ± 0.599
2.718GlyPhe: 2.718 ± 0.53
7.611GlyGly: 7.611 ± 1.142
1.721GlyHis: 1.721 ± 0.438
2.537GlyIle: 2.537 ± 0.59
3.624GlyLys: 3.624 ± 0.569
6.705GlyLeu: 6.705 ± 0.997
2.628GlyMet: 2.628 ± 0.472
1.993GlyAsn: 1.993 ± 0.462
3.081GlyPro: 3.081 ± 0.618
1.993GlyGln: 1.993 ± 0.39
6.795GlyArg: 6.795 ± 0.916
3.171GlySer: 3.171 ± 0.563
5.255GlyThr: 5.255 ± 0.743
6.161GlyVal: 6.161 ± 0.787
2.356GlyTrp: 2.356 ± 0.349
2.265GlyTyr: 2.265 ± 0.497
0.0GlyXaa: 0.0 ± 0.0
His
4.53HisAla: 4.53 ± 0.616
0.362HisCys: 0.362 ± 0.188
1.631HisAsp: 1.631 ± 0.554
1.359HisGlu: 1.359 ± 0.239
0.453HisPhe: 0.453 ± 0.203
1.631HisGly: 1.631 ± 0.336
0.906HisHis: 0.906 ± 0.216
0.815HisIle: 0.815 ± 0.305
0.815HisLys: 0.815 ± 0.329
1.631HisLeu: 1.631 ± 0.389
0.362HisMet: 0.362 ± 0.192
0.544HisAsn: 0.544 ± 0.23
1.178HisPro: 1.178 ± 0.299
0.815HisGln: 0.815 ± 0.256
1.54HisArg: 1.54 ± 0.579
1.087HisSer: 1.087 ± 0.38
1.903HisThr: 1.903 ± 0.406
2.356HisVal: 2.356 ± 0.393
0.272HisTrp: 0.272 ± 0.143
0.544HisTyr: 0.544 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
5.799IleAla: 5.799 ± 0.604
0.362IleCys: 0.362 ± 0.207
5.164IleAsp: 5.164 ± 0.671
2.899IleGlu: 2.899 ± 0.662
0.997IlePhe: 0.997 ± 0.221
3.805IleGly: 3.805 ± 0.684
1.45IleHis: 1.45 ± 0.293
0.544IleIle: 0.544 ± 0.234
1.359IleLys: 1.359 ± 0.308
2.356IleLeu: 2.356 ± 0.411
0.634IleMet: 0.634 ± 0.216
1.359IleAsn: 1.359 ± 0.324
1.631IlePro: 1.631 ± 0.452
1.268IleGln: 1.268 ± 0.303
2.356IleArg: 2.356 ± 0.42
2.356IleSer: 2.356 ± 0.448
1.45IleThr: 1.45 ± 0.319
2.809IleVal: 2.809 ± 0.515
0.181IleTrp: 0.181 ± 0.121
0.725IleTyr: 0.725 ± 0.296
0.0IleXaa: 0.0 ± 0.0
Lys
5.617LysAla: 5.617 ± 0.706
0.272LysCys: 0.272 ± 0.155
1.903LysAsp: 1.903 ± 0.437
1.268LysGlu: 1.268 ± 0.307
1.359LysPhe: 1.359 ± 0.38
2.99LysGly: 2.99 ± 0.588
0.906LysHis: 0.906 ± 0.311
0.906LysIle: 0.906 ± 0.383
2.084LysLys: 2.084 ± 0.445
3.715LysLeu: 3.715 ± 0.762
0.997LysMet: 0.997 ± 0.23
1.359LysAsn: 1.359 ± 0.432
1.721LysPro: 1.721 ± 0.389
2.446LysGln: 2.446 ± 0.468
4.711LysArg: 4.711 ± 0.733
2.084LysSer: 2.084 ± 0.368
2.356LysThr: 2.356 ± 0.421
2.265LysVal: 2.265 ± 0.549
0.362LysTrp: 0.362 ± 0.148
1.178LysTyr: 1.178 ± 0.348
0.0LysXaa: 0.0 ± 0.0
Leu
12.322LeuAla: 12.322 ± 1.29
0.997LeuCys: 0.997 ± 0.342
5.889LeuAsp: 5.889 ± 0.557
4.893LeuGlu: 4.893 ± 0.623
2.899LeuPhe: 2.899 ± 0.651
7.701LeuGly: 7.701 ± 1.637
2.084LeuHis: 2.084 ± 0.544
3.352LeuIle: 3.352 ± 0.513
3.081LeuLys: 3.081 ± 0.488
6.614LeuLeu: 6.614 ± 0.762
1.721LeuMet: 1.721 ± 0.449
3.171LeuAsn: 3.171 ± 0.554
3.896LeuPro: 3.896 ± 0.64
2.99LeuGln: 2.99 ± 0.384
7.52LeuArg: 7.52 ± 1.05
6.342LeuSer: 6.342 ± 0.7
5.074LeuThr: 5.074 ± 0.564
6.342LeuVal: 6.342 ± 0.691
0.272LeuTrp: 0.272 ± 0.151
2.265LeuTyr: 2.265 ± 0.519
0.0LeuXaa: 0.0 ± 0.0
Met
2.718MetAla: 2.718 ± 0.507
0.181MetCys: 0.181 ± 0.141
1.359MetAsp: 1.359 ± 0.369
0.906MetGlu: 0.906 ± 0.282
0.544MetPhe: 0.544 ± 0.281
1.359MetGly: 1.359 ± 0.465
0.544MetHis: 0.544 ± 0.199
0.634MetIle: 0.634 ± 0.308
0.815MetLys: 0.815 ± 0.316
1.721MetLeu: 1.721 ± 0.397
0.272MetMet: 0.272 ± 0.135
1.087MetAsn: 1.087 ± 0.289
1.721MetPro: 1.721 ± 0.365
0.906MetGln: 0.906 ± 0.39
2.537MetArg: 2.537 ± 0.512
1.178MetSer: 1.178 ± 0.303
2.628MetThr: 2.628 ± 0.439
1.54MetVal: 1.54 ± 0.328
0.091MetTrp: 0.091 ± 0.092
0.815MetTyr: 0.815 ± 0.275
0.0MetXaa: 0.0 ± 0.0
Asn
2.718AsnAla: 2.718 ± 0.423
0.0AsnCys: 0.0 ± 0.0
2.809AsnAsp: 2.809 ± 0.551
2.356AsnGlu: 2.356 ± 0.432
0.634AsnPhe: 0.634 ± 0.218
3.534AsnGly: 3.534 ± 0.69
0.906AsnHis: 0.906 ± 0.329
1.268AsnIle: 1.268 ± 0.27
1.178AsnLys: 1.178 ± 0.303
2.265AsnLeu: 2.265 ± 0.479
0.815AsnMet: 0.815 ± 0.293
0.725AsnAsn: 0.725 ± 0.293
1.631AsnPro: 1.631 ± 0.331
0.906AsnGln: 0.906 ± 0.27
1.903AsnArg: 1.903 ± 0.374
0.906AsnSer: 0.906 ± 0.267
1.268AsnThr: 1.268 ± 0.372
2.537AsnVal: 2.537 ± 0.426
0.181AsnTrp: 0.181 ± 0.112
0.815AsnTyr: 0.815 ± 0.246
0.0AsnXaa: 0.0 ± 0.0
Pro
6.795ProAla: 6.795 ± 0.954
0.453ProCys: 0.453 ± 0.249
3.262ProAsp: 3.262 ± 0.598
3.171ProGlu: 3.171 ± 0.639
1.178ProPhe: 1.178 ± 0.262
2.175ProGly: 2.175 ± 0.538
1.359ProHis: 1.359 ± 0.304
2.446ProIle: 2.446 ± 0.632
2.446ProLys: 2.446 ± 0.501
4.258ProLeu: 4.258 ± 0.566
0.362ProMet: 0.362 ± 0.152
1.087ProAsn: 1.087 ± 0.266
3.171ProPro: 3.171 ± 0.753
1.268ProGln: 1.268 ± 0.306
3.987ProArg: 3.987 ± 0.609
2.265ProSer: 2.265 ± 0.403
2.084ProThr: 2.084 ± 0.407
3.715ProVal: 3.715 ± 0.735
1.268ProTrp: 1.268 ± 0.263
0.906ProTyr: 0.906 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
4.621GlnAla: 4.621 ± 0.611
0.362GlnCys: 0.362 ± 0.176
1.087GlnAsp: 1.087 ± 0.265
0.997GlnGlu: 0.997 ± 0.35
1.45GlnPhe: 1.45 ± 0.452
1.721GlnGly: 1.721 ± 0.351
0.906GlnHis: 0.906 ± 0.28
1.54GlnIle: 1.54 ± 0.409
1.631GlnLys: 1.631 ± 0.405
3.262GlnLeu: 3.262 ± 0.525
1.087GlnMet: 1.087 ± 0.383
1.087GlnAsn: 1.087 ± 0.3
1.178GlnPro: 1.178 ± 0.295
2.265GlnGln: 2.265 ± 0.578
3.443GlnArg: 3.443 ± 0.664
2.084GlnSer: 2.084 ± 0.461
2.356GlnThr: 2.356 ± 0.445
1.993GlnVal: 1.993 ± 0.391
0.362GlnTrp: 0.362 ± 0.18
0.906GlnTyr: 0.906 ± 0.236
0.0GlnXaa: 0.0 ± 0.0
Arg
10.238ArgAla: 10.238 ± 1.395
0.906ArgCys: 0.906 ± 0.262
3.987ArgAsp: 3.987 ± 0.657
5.98ArgGlu: 5.98 ± 0.873
1.993ArgPhe: 1.993 ± 0.389
6.161ArgGly: 6.161 ± 0.822
2.537ArgHis: 2.537 ± 0.623
4.077ArgIle: 4.077 ± 0.596
4.077ArgLys: 4.077 ± 0.682
7.792ArgLeu: 7.792 ± 1.044
1.903ArgMet: 1.903 ± 0.359
2.537ArgAsn: 2.537 ± 0.56
3.443ArgPro: 3.443 ± 0.578
3.534ArgGln: 3.534 ± 0.478
7.973ArgArg: 7.973 ± 1.613
3.171ArgSer: 3.171 ± 0.456
4.349ArgThr: 4.349 ± 0.555
6.614ArgVal: 6.614 ± 0.8
1.178ArgTrp: 1.178 ± 0.36
2.175ArgTyr: 2.175 ± 0.446
0.0ArgXaa: 0.0 ± 0.0
Ser
6.705SerAla: 6.705 ± 0.738
0.362SerCys: 0.362 ± 0.169
3.262SerAsp: 3.262 ± 0.557
1.903SerGlu: 1.903 ± 0.354
1.721SerPhe: 1.721 ± 0.434
4.893SerGly: 4.893 ± 0.53
1.268SerHis: 1.268 ± 0.443
2.175SerIle: 2.175 ± 0.396
1.631SerLys: 1.631 ± 0.321
4.621SerLeu: 4.621 ± 0.648
1.359SerMet: 1.359 ± 0.403
1.812SerAsn: 1.812 ± 0.531
3.081SerPro: 3.081 ± 0.696
0.997SerGln: 0.997 ± 0.273
3.805SerArg: 3.805 ± 0.669
3.262SerSer: 3.262 ± 0.579
3.534SerThr: 3.534 ± 0.516
2.446SerVal: 2.446 ± 0.441
0.815SerTrp: 0.815 ± 0.297
0.997SerTyr: 0.997 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
6.161ThrAla: 6.161 ± 0.744
0.544ThrCys: 0.544 ± 0.202
4.349ThrAsp: 4.349 ± 0.713
2.265ThrGlu: 2.265 ± 0.423
2.537ThrPhe: 2.537 ± 0.666
5.074ThrGly: 5.074 ± 0.625
1.54ThrHis: 1.54 ± 0.473
2.628ThrIle: 2.628 ± 0.472
2.084ThrLys: 2.084 ± 0.444
5.527ThrLeu: 5.527 ± 0.689
1.45ThrMet: 1.45 ± 0.313
1.721ThrAsn: 1.721 ± 0.336
3.805ThrPro: 3.805 ± 0.421
1.812ThrGln: 1.812 ± 0.364
4.077ThrArg: 4.077 ± 0.54
2.718ThrSer: 2.718 ± 0.614
4.168ThrThr: 4.168 ± 0.613
4.53ThrVal: 4.53 ± 0.529
0.906ThrTrp: 0.906 ± 0.276
0.725ThrTyr: 0.725 ± 0.304
0.0ThrXaa: 0.0 ± 0.0
Val
8.698ValAla: 8.698 ± 0.682
0.544ValCys: 0.544 ± 0.206
4.53ValAsp: 4.53 ± 0.773
3.624ValGlu: 3.624 ± 0.54
2.899ValPhe: 2.899 ± 0.548
4.168ValGly: 4.168 ± 0.536
0.906ValHis: 0.906 ± 0.298
2.899ValIle: 2.899 ± 0.458
2.628ValLys: 2.628 ± 0.428
6.07ValLeu: 6.07 ± 0.699
1.993ValMet: 1.993 ± 0.451
2.265ValAsn: 2.265 ± 0.468
2.809ValPro: 2.809 ± 0.464
1.721ValGln: 1.721 ± 0.429
5.708ValArg: 5.708 ± 0.623
4.077ValSer: 4.077 ± 0.816
3.715ValThr: 3.715 ± 0.633
5.527ValVal: 5.527 ± 0.764
0.997ValTrp: 0.997 ± 0.355
1.993ValTyr: 1.993 ± 0.382
0.0ValXaa: 0.0 ± 0.0
Trp
1.993TrpAla: 1.993 ± 0.371
0.181TrpCys: 0.181 ± 0.127
0.453TrpAsp: 0.453 ± 0.234
0.362TrpGlu: 0.362 ± 0.161
0.634TrpPhe: 0.634 ± 0.218
0.906TrpGly: 0.906 ± 0.397
0.634TrpHis: 0.634 ± 0.224
0.815TrpIle: 0.815 ± 0.284
0.362TrpLys: 0.362 ± 0.159
2.537TrpLeu: 2.537 ± 0.462
0.272TrpMet: 0.272 ± 0.137
0.362TrpAsn: 0.362 ± 0.153
0.544TrpPro: 0.544 ± 0.229
0.725TrpGln: 0.725 ± 0.266
1.359TrpArg: 1.359 ± 0.4
1.178TrpSer: 1.178 ± 0.315
0.997TrpThr: 0.997 ± 0.313
0.725TrpVal: 0.725 ± 0.28
0.181TrpTrp: 0.181 ± 0.11
0.181TrpTyr: 0.181 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.081TyrAla: 3.081 ± 0.501
0.181TyrCys: 0.181 ± 0.122
1.359TyrAsp: 1.359 ± 0.288
1.45TyrGlu: 1.45 ± 0.418
1.268TyrPhe: 1.268 ± 0.49
1.45TyrGly: 1.45 ± 0.364
1.178TyrHis: 1.178 ± 0.345
0.906TyrIle: 0.906 ± 0.298
0.725TyrLys: 0.725 ± 0.235
2.628TyrLeu: 2.628 ± 0.591
0.362TyrMet: 0.362 ± 0.18
0.544TyrAsn: 0.544 ± 0.171
0.544TyrPro: 0.544 ± 0.193
1.268TyrGln: 1.268 ± 0.297
2.537TyrArg: 2.537 ± 0.473
0.815TyrSer: 0.815 ± 0.318
1.359TyrThr: 1.359 ± 0.383
2.265TyrVal: 2.265 ± 0.447
0.453TyrTrp: 0.453 ± 0.152
0.362TyrTyr: 0.362 ± 0.201
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (11038 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski