Amino acid dipepetide frequency for Microbacterium phage Lovelyunicorn

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.761AlaAla: 11.761 ± 1.183
0.611AlaCys: 0.611 ± 0.242
4.964AlaAsp: 4.964 ± 0.691
6.186AlaGlu: 6.186 ± 0.77
2.902AlaPhe: 2.902 ± 0.527
7.79AlaGly: 7.79 ± 1.328
1.909AlaHis: 1.909 ± 0.341
5.422AlaIle: 5.422 ± 0.992
5.193AlaLys: 5.193 ± 0.783
8.477AlaLeu: 8.477 ± 1.045
2.597AlaMet: 2.597 ± 0.452
3.055AlaAsn: 3.055 ± 0.429
4.048AlaPro: 4.048 ± 0.627
4.277AlaGln: 4.277 ± 0.623
5.881AlaArg: 5.881 ± 0.955
5.499AlaSer: 5.499 ± 0.692
6.033AlaThr: 6.033 ± 0.673
6.568AlaVal: 6.568 ± 0.678
1.986AlaTrp: 1.986 ± 0.403
2.749AlaTyr: 2.749 ± 0.527
0.0AlaXaa: 0.0 ± 0.0
Cys
0.382CysAla: 0.382 ± 0.166
0.0CysCys: 0.0 ± 0.0
0.305CysAsp: 0.305 ± 0.151
0.153CysGlu: 0.153 ± 0.106
0.153CysPhe: 0.153 ± 0.114
0.764CysGly: 0.764 ± 0.294
0.229CysHis: 0.229 ± 0.12
0.0CysIle: 0.0 ± 0.0
0.535CysLys: 0.535 ± 0.21
0.611CysLeu: 0.611 ± 0.275
0.076CysMet: 0.076 ± 0.075
0.229CysAsn: 0.229 ± 0.14
0.687CysPro: 0.687 ± 0.261
0.076CysGln: 0.076 ± 0.069
0.305CysArg: 0.305 ± 0.164
0.382CysSer: 0.382 ± 0.187
0.382CysThr: 0.382 ± 0.165
0.535CysVal: 0.535 ± 0.176
0.229CysTrp: 0.229 ± 0.103
0.458CysTyr: 0.458 ± 0.21
0.0CysXaa: 0.0 ± 0.0
Asp
5.117AspAla: 5.117 ± 0.628
0.764AspCys: 0.764 ± 0.25
5.422AspAsp: 5.422 ± 0.834
5.957AspGlu: 5.957 ± 1.407
2.215AspPhe: 2.215 ± 0.388
4.353AspGly: 4.353 ± 0.624
1.146AspHis: 1.146 ± 0.238
3.284AspIle: 3.284 ± 0.384
2.52AspLys: 2.52 ± 0.499
5.346AspLeu: 5.346 ± 0.731
1.375AspMet: 1.375 ± 0.326
1.527AspAsn: 1.527 ± 0.389
4.43AspPro: 4.43 ± 0.587
2.062AspGln: 2.062 ± 0.323
3.513AspArg: 3.513 ± 0.595
3.437AspSer: 3.437 ± 0.465
2.902AspThr: 2.902 ± 0.491
4.735AspVal: 4.735 ± 0.709
2.215AspTrp: 2.215 ± 0.374
2.749AspTyr: 2.749 ± 0.478
0.0AspXaa: 0.0 ± 0.0
Glu
7.943GluAla: 7.943 ± 0.761
0.305GluCys: 0.305 ± 0.186
4.964GluAsp: 4.964 ± 1.223
5.957GluGlu: 5.957 ± 1.381
1.909GluPhe: 1.909 ± 0.39
4.277GluGly: 4.277 ± 0.712
1.069GluHis: 1.069 ± 0.296
3.055GluIle: 3.055 ± 0.571
2.215GluLys: 2.215 ± 0.486
5.499GluLeu: 5.499 ± 0.646
2.367GluMet: 2.367 ± 0.489
1.909GluAsn: 1.909 ± 0.444
2.826GluPro: 2.826 ± 0.503
2.52GluGln: 2.52 ± 0.662
3.742GluArg: 3.742 ± 0.575
2.749GluSer: 2.749 ± 0.537
3.284GluThr: 3.284 ± 0.499
5.27GluVal: 5.27 ± 0.629
1.451GluTrp: 1.451 ± 0.306
1.909GluTyr: 1.909 ± 0.279
0.0GluXaa: 0.0 ± 0.0
Phe
2.673PheAla: 2.673 ± 0.445
0.153PheCys: 0.153 ± 0.105
2.52PheAsp: 2.52 ± 0.487
1.909PheGlu: 1.909 ± 0.381
0.611PhePhe: 0.611 ± 0.204
3.437PheGly: 3.437 ± 0.51
0.764PheHis: 0.764 ± 0.225
1.146PheIle: 1.146 ± 0.28
1.757PheLys: 1.757 ± 0.343
2.062PheLeu: 2.062 ± 0.393
0.535PheMet: 0.535 ± 0.298
0.993PheAsn: 0.993 ± 0.215
1.222PhePro: 1.222 ± 0.297
1.222PheGln: 1.222 ± 0.333
2.52PheArg: 2.52 ± 0.419
1.68PheSer: 1.68 ± 0.298
2.444PheThr: 2.444 ± 0.483
1.757PheVal: 1.757 ± 0.407
0.687PheTrp: 0.687 ± 0.263
0.84PheTyr: 0.84 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
6.568GlyAla: 6.568 ± 0.986
0.687GlyCys: 0.687 ± 0.228
3.971GlyAsp: 3.971 ± 0.413
3.895GlyGlu: 3.895 ± 0.472
2.978GlyPhe: 2.978 ± 0.414
5.117GlyGly: 5.117 ± 0.958
1.833GlyHis: 1.833 ± 0.369
5.193GlyIle: 5.193 ± 0.956
5.117GlyLys: 5.117 ± 0.674
6.11GlyLeu: 6.11 ± 0.727
2.367GlyMet: 2.367 ± 0.381
2.215GlyAsn: 2.215 ± 0.357
3.589GlyPro: 3.589 ± 0.596
3.284GlyGln: 3.284 ± 0.612
4.43GlyArg: 4.43 ± 0.718
5.117GlySer: 5.117 ± 0.609
6.11GlyThr: 6.11 ± 0.916
6.033GlyVal: 6.033 ± 0.758
1.451GlyTrp: 1.451 ± 0.273
2.138GlyTyr: 2.138 ± 0.64
0.0GlyXaa: 0.0 ± 0.0
His
1.069HisAla: 1.069 ± 0.253
0.153HisCys: 0.153 ± 0.102
0.993HisAsp: 0.993 ± 0.258
1.298HisGlu: 1.298 ± 0.337
0.916HisPhe: 0.916 ± 0.287
1.68HisGly: 1.68 ± 0.335
0.305HisHis: 0.305 ± 0.168
0.764HisIle: 0.764 ± 0.287
1.146HisLys: 1.146 ± 0.387
1.146HisLeu: 1.146 ± 0.246
0.382HisMet: 0.382 ± 0.171
0.764HisAsn: 0.764 ± 0.264
0.611HisPro: 0.611 ± 0.197
0.458HisGln: 0.458 ± 0.187
0.84HisArg: 0.84 ± 0.267
1.757HisSer: 1.757 ± 0.365
1.298HisThr: 1.298 ± 0.312
1.757HisVal: 1.757 ± 0.379
0.458HisTrp: 0.458 ± 0.148
0.84HisTyr: 0.84 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
5.575IleAla: 5.575 ± 0.612
0.305IleCys: 0.305 ± 0.156
4.735IleAsp: 4.735 ± 0.518
3.131IleGlu: 3.131 ± 0.608
0.611IlePhe: 0.611 ± 0.294
4.277IleGly: 4.277 ± 0.733
0.916IleHis: 0.916 ± 0.218
3.055IleIle: 3.055 ± 0.804
2.444IleLys: 2.444 ± 0.49
2.978IleLeu: 2.978 ± 0.467
1.375IleMet: 1.375 ± 0.347
1.68IleAsn: 1.68 ± 0.347
2.826IlePro: 2.826 ± 0.505
2.673IleGln: 2.673 ± 0.522
2.52IleArg: 2.52 ± 0.516
3.284IleSer: 3.284 ± 0.514
3.284IleThr: 3.284 ± 0.761
2.826IleVal: 2.826 ± 0.581
0.84IleTrp: 0.84 ± 0.258
1.604IleTyr: 1.604 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
4.811LysAla: 4.811 ± 0.791
0.305LysCys: 0.305 ± 0.148
2.444LysAsp: 2.444 ± 0.494
3.36LysGlu: 3.36 ± 0.592
1.069LysPhe: 1.069 ± 0.257
3.437LysGly: 3.437 ± 0.542
0.993LysHis: 0.993 ± 0.265
2.138LysIle: 2.138 ± 0.475
2.367LysLys: 2.367 ± 0.56
4.43LysLeu: 4.43 ± 0.519
1.146LysMet: 1.146 ± 0.311
1.527LysAsn: 1.527 ± 0.399
3.513LysPro: 3.513 ± 0.589
2.062LysGln: 2.062 ± 0.497
2.902LysArg: 2.902 ± 0.498
2.215LysSer: 2.215 ± 0.412
2.444LysThr: 2.444 ± 0.347
3.589LysVal: 3.589 ± 0.525
1.069LysTrp: 1.069 ± 0.266
1.222LysTyr: 1.222 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
8.706LeuAla: 8.706 ± 0.728
0.611LeuCys: 0.611 ± 0.19
6.415LeuAsp: 6.415 ± 0.726
5.881LeuGlu: 5.881 ± 0.822
2.062LeuPhe: 2.062 ± 0.343
6.11LeuGly: 6.11 ± 0.609
1.298LeuHis: 1.298 ± 0.392
4.888LeuIle: 4.888 ± 0.939
4.582LeuLys: 4.582 ± 0.727
7.943LeuLeu: 7.943 ± 0.705
2.138LeuMet: 2.138 ± 0.347
2.902LeuAsn: 2.902 ± 0.387
4.048LeuPro: 4.048 ± 0.585
2.749LeuGln: 2.749 ± 0.407
5.728LeuArg: 5.728 ± 0.628
4.277LeuSer: 4.277 ± 0.45
5.728LeuThr: 5.728 ± 0.554
6.186LeuVal: 6.186 ± 0.69
1.222LeuTrp: 1.222 ± 0.283
1.909LeuTyr: 1.909 ± 0.318
0.0LeuXaa: 0.0 ± 0.0
Met
2.902MetAla: 2.902 ± 0.382
0.229MetCys: 0.229 ± 0.125
1.68MetAsp: 1.68 ± 0.433
1.069MetGlu: 1.069 ± 0.233
0.764MetPhe: 0.764 ± 0.189
1.833MetGly: 1.833 ± 0.419
0.153MetHis: 0.153 ± 0.12
1.146MetIle: 1.146 ± 0.277
0.382MetLys: 0.382 ± 0.177
2.673MetLeu: 2.673 ± 0.586
0.611MetMet: 0.611 ± 0.181
0.993MetAsn: 0.993 ± 0.25
1.68MetPro: 1.68 ± 0.315
0.993MetGln: 0.993 ± 0.285
0.611MetArg: 0.611 ± 0.183
3.131MetSer: 3.131 ± 0.45
1.986MetThr: 1.986 ± 0.36
1.757MetVal: 1.757 ± 0.308
0.153MetTrp: 0.153 ± 0.119
0.382MetTyr: 0.382 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
2.749AsnAla: 2.749 ± 0.564
0.076AsnCys: 0.076 ± 0.08
1.909AsnAsp: 1.909 ± 0.338
1.833AsnGlu: 1.833 ± 0.413
0.764AsnPhe: 0.764 ± 0.237
3.131AsnGly: 3.131 ± 0.455
0.611AsnHis: 0.611 ± 0.187
1.833AsnIle: 1.833 ± 0.422
1.527AsnLys: 1.527 ± 0.348
3.131AsnLeu: 3.131 ± 0.449
0.458AsnMet: 0.458 ± 0.189
1.069AsnAsn: 1.069 ± 0.334
1.909AsnPro: 1.909 ± 0.402
1.451AsnGln: 1.451 ± 0.375
1.757AsnArg: 1.757 ± 0.434
2.215AsnSer: 2.215 ± 0.447
1.757AsnThr: 1.757 ± 0.385
1.68AsnVal: 1.68 ± 0.368
0.764AsnTrp: 0.764 ± 0.239
1.146AsnTyr: 1.146 ± 0.291
0.0AsnXaa: 0.0 ± 0.0
Pro
5.881ProAla: 5.881 ± 0.808
0.0ProCys: 0.0 ± 0.0
3.284ProAsp: 3.284 ± 0.618
3.819ProGlu: 3.819 ± 0.848
1.986ProPhe: 1.986 ± 0.453
3.895ProGly: 3.895 ± 0.632
0.611ProHis: 0.611 ± 0.255
2.215ProIle: 2.215 ± 0.315
1.68ProLys: 1.68 ± 0.311
3.819ProLeu: 3.819 ± 0.491
1.146ProMet: 1.146 ± 0.275
1.757ProAsn: 1.757 ± 0.392
1.298ProPro: 1.298 ± 0.397
2.597ProGln: 2.597 ± 0.48
1.909ProArg: 1.909 ± 0.495
2.826ProSer: 2.826 ± 0.389
3.819ProThr: 3.819 ± 0.654
4.582ProVal: 4.582 ± 0.481
0.687ProTrp: 0.687 ± 0.232
1.222ProTyr: 1.222 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
3.971GlnAla: 3.971 ± 0.699
0.229GlnCys: 0.229 ± 0.129
1.833GlnAsp: 1.833 ± 0.376
3.131GlnGlu: 3.131 ± 0.569
0.687GlnPhe: 0.687 ± 0.232
3.36GlnGly: 3.36 ± 0.555
1.375GlnHis: 1.375 ± 0.381
1.604GlnIle: 1.604 ± 0.367
1.069GlnLys: 1.069 ± 0.244
4.124GlnLeu: 4.124 ± 0.685
0.764GlnMet: 0.764 ± 0.252
1.68GlnAsn: 1.68 ± 0.353
1.833GlnPro: 1.833 ± 0.428
2.52GlnGln: 2.52 ± 0.589
2.749GlnArg: 2.749 ± 0.401
2.215GlnSer: 2.215 ± 0.37
2.673GlnThr: 2.673 ± 0.396
2.291GlnVal: 2.291 ± 0.358
1.298GlnTrp: 1.298 ± 0.334
0.993GlnTyr: 0.993 ± 0.258
0.0GlnXaa: 0.0 ± 0.0
Arg
4.811ArgAla: 4.811 ± 0.531
0.382ArgCys: 0.382 ± 0.16
3.819ArgAsp: 3.819 ± 0.379
3.284ArgGlu: 3.284 ± 0.526
2.291ArgPhe: 2.291 ± 0.404
3.437ArgGly: 3.437 ± 0.437
0.916ArgHis: 0.916 ± 0.302
2.749ArgIle: 2.749 ± 0.353
3.208ArgLys: 3.208 ± 0.647
5.346ArgLeu: 5.346 ± 0.668
1.986ArgMet: 1.986 ± 0.421
1.604ArgAsn: 1.604 ± 0.294
2.673ArgPro: 2.673 ± 0.491
2.215ArgGln: 2.215 ± 0.317
4.124ArgArg: 4.124 ± 0.706
4.2ArgSer: 4.2 ± 0.616
3.36ArgThr: 3.36 ± 0.622
4.353ArgVal: 4.353 ± 0.552
0.84ArgTrp: 0.84 ± 0.204
1.68ArgTyr: 1.68 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
5.804SerAla: 5.804 ± 0.907
0.305SerCys: 0.305 ± 0.173
3.589SerAsp: 3.589 ± 0.544
3.055SerGlu: 3.055 ± 0.539
2.749SerPhe: 2.749 ± 0.453
5.04SerGly: 5.04 ± 0.858
0.993SerHis: 0.993 ± 0.296
2.902SerIle: 2.902 ± 0.616
2.902SerLys: 2.902 ± 0.491
5.193SerLeu: 5.193 ± 0.55
2.062SerMet: 2.062 ± 0.444
1.68SerAsn: 1.68 ± 0.438
2.597SerPro: 2.597 ± 0.384
2.673SerGln: 2.673 ± 0.532
3.055SerArg: 3.055 ± 0.472
3.819SerSer: 3.819 ± 0.545
3.589SerThr: 3.589 ± 0.52
4.659SerVal: 4.659 ± 0.554
1.375SerTrp: 1.375 ± 0.322
1.909SerTyr: 1.909 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
5.422ThrAla: 5.422 ± 0.908
0.305ThrCys: 0.305 ± 0.156
3.208ThrAsp: 3.208 ± 0.4
3.895ThrGlu: 3.895 ± 0.484
2.902ThrPhe: 2.902 ± 0.441
5.651ThrGly: 5.651 ± 0.555
1.146ThrHis: 1.146 ± 0.365
3.437ThrIle: 3.437 ± 0.471
2.52ThrLys: 2.52 ± 0.471
6.415ThrLeu: 6.415 ± 0.711
1.298ThrMet: 1.298 ± 0.326
1.222ThrAsn: 1.222 ± 0.312
3.36ThrPro: 3.36 ± 0.444
2.291ThrGln: 2.291 ± 0.348
3.437ThrArg: 3.437 ± 0.477
3.513ThrSer: 3.513 ± 0.558
5.117ThrThr: 5.117 ± 0.715
5.881ThrVal: 5.881 ± 0.768
1.298ThrTrp: 1.298 ± 0.28
2.138ThrTyr: 2.138 ± 0.428
0.0ThrXaa: 0.0 ± 0.0
Val
7.79ValAla: 7.79 ± 0.837
0.535ValCys: 0.535 ± 0.245
4.811ValAsp: 4.811 ± 0.591
4.353ValGlu: 4.353 ± 0.496
1.757ValPhe: 1.757 ± 0.366
5.575ValGly: 5.575 ± 0.702
1.451ValHis: 1.451 ± 0.351
3.589ValIle: 3.589 ± 0.652
3.666ValLys: 3.666 ± 0.515
5.804ValLeu: 5.804 ± 0.638
1.68ValMet: 1.68 ± 0.499
3.36ValAsn: 3.36 ± 0.464
3.36ValPro: 3.36 ± 0.463
2.902ValGln: 2.902 ± 0.477
4.124ValArg: 4.124 ± 0.627
4.353ValSer: 4.353 ± 0.648
5.193ValThr: 5.193 ± 0.643
5.27ValVal: 5.27 ± 0.719
1.68ValTrp: 1.68 ± 0.4
2.673ValTyr: 2.673 ± 0.402
0.0ValXaa: 0.0 ± 0.0
Trp
1.298TrpAla: 1.298 ± 0.336
0.229TrpCys: 0.229 ± 0.131
1.986TrpAsp: 1.986 ± 0.378
1.146TrpGlu: 1.146 ± 0.284
0.764TrpPhe: 0.764 ± 0.221
1.604TrpGly: 1.604 ± 0.335
0.535TrpHis: 0.535 ± 0.166
1.146TrpIle: 1.146 ± 0.27
0.764TrpLys: 0.764 ± 0.236
2.138TrpLeu: 2.138 ± 0.378
0.229TrpMet: 0.229 ± 0.135
0.611TrpAsn: 0.611 ± 0.195
1.069TrpPro: 1.069 ± 0.297
0.687TrpGln: 0.687 ± 0.27
1.069TrpArg: 1.069 ± 0.276
1.069TrpSer: 1.069 ± 0.302
1.604TrpThr: 1.604 ± 0.306
1.68TrpVal: 1.68 ± 0.383
0.687TrpTrp: 0.687 ± 0.259
1.069TrpTyr: 1.069 ± 0.28
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.673TyrAla: 2.673 ± 0.384
0.305TyrCys: 0.305 ± 0.159
2.215TyrAsp: 2.215 ± 0.344
1.909TyrGlu: 1.909 ± 0.408
0.84TyrPhe: 0.84 ± 0.188
3.437TyrGly: 3.437 ± 0.606
0.382TyrHis: 0.382 ± 0.191
1.375TyrIle: 1.375 ± 0.254
1.527TyrLys: 1.527 ± 0.378
2.138TyrLeu: 2.138 ± 0.334
0.458TyrMet: 0.458 ± 0.187
0.993TyrAsn: 0.993 ± 0.325
1.451TyrPro: 1.451 ± 0.362
0.687TyrGln: 0.687 ± 0.22
2.062TyrArg: 2.062 ± 0.361
2.215TyrSer: 2.215 ± 0.391
1.451TyrThr: 1.451 ± 0.259
2.52TyrVal: 2.52 ± 0.554
0.993TyrTrp: 0.993 ± 0.269
1.222TyrTyr: 1.222 ± 0.338
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13095 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski