Amino acid dipepetide frequency for Lactobacillus phage ATCCB

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.905AlaAla: 3.905 ± 0.829
0.378AlaCys: 0.378 ± 0.136
3.233AlaAsp: 3.233 ± 0.439
2.604AlaGlu: 2.604 ± 0.406
2.226AlaPhe: 2.226 ± 0.32
3.191AlaGly: 3.191 ± 0.622
1.176AlaHis: 1.176 ± 0.27
4.199AlaIle: 4.199 ± 0.457
4.829AlaLys: 4.829 ± 0.673
5.081AlaLeu: 5.081 ± 0.474
1.344AlaMet: 1.344 ± 0.211
4.493AlaAsn: 4.493 ± 0.596
1.764AlaPro: 1.764 ± 0.279
2.436AlaGln: 2.436 ± 0.389
1.89AlaArg: 1.89 ± 0.255
4.031AlaSer: 4.031 ± 0.638
3.611AlaThr: 3.611 ± 0.552
2.562AlaVal: 2.562 ± 0.329
1.008AlaTrp: 1.008 ± 0.22
2.142AlaTyr: 2.142 ± 0.275
0.0AlaXaa: 0.0 ± 0.0
Cys
0.588CysAla: 0.588 ± 0.161
0.084CysCys: 0.084 ± 0.055
0.63CysAsp: 0.63 ± 0.179
0.378CysGlu: 0.378 ± 0.151
0.588CysPhe: 0.588 ± 0.167
0.84CysGly: 0.84 ± 0.181
0.21CysHis: 0.21 ± 0.105
0.252CysIle: 0.252 ± 0.091
0.588CysLys: 0.588 ± 0.179
0.672CysLeu: 0.672 ± 0.183
0.168CysMet: 0.168 ± 0.073
0.588CysAsn: 0.588 ± 0.168
0.378CysPro: 0.378 ± 0.131
0.336CysGln: 0.336 ± 0.135
0.42CysArg: 0.42 ± 0.153
0.504CysSer: 0.504 ± 0.15
0.336CysThr: 0.336 ± 0.131
0.462CysVal: 0.462 ± 0.141
0.126CysTrp: 0.126 ± 0.076
0.546CysTyr: 0.546 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
3.695AspAla: 3.695 ± 0.429
0.378AspCys: 0.378 ± 0.109
5.333AspAsp: 5.333 ± 0.566
5.459AspGlu: 5.459 ± 0.542
3.065AspPhe: 3.065 ± 0.411
4.829AspGly: 4.829 ± 0.457
0.714AspHis: 0.714 ± 0.151
6.089AspIle: 6.089 ± 0.66
5.627AspLys: 5.627 ± 0.563
5.333AspLeu: 5.333 ± 0.676
1.68AspMet: 1.68 ± 0.265
5.165AspAsn: 5.165 ± 0.568
1.386AspPro: 1.386 ± 0.304
1.134AspGln: 1.134 ± 0.173
1.386AspArg: 1.386 ± 0.237
4.619AspSer: 4.619 ± 0.386
3.443AspThr: 3.443 ± 0.543
4.241AspVal: 4.241 ± 0.446
0.588AspTrp: 0.588 ± 0.173
2.771AspTyr: 2.771 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
2.771GluAla: 2.771 ± 0.496
0.336GluCys: 0.336 ± 0.115
3.443GluAsp: 3.443 ± 0.396
3.317GluGlu: 3.317 ± 0.491
2.016GluPhe: 2.016 ± 0.267
2.226GluGly: 2.226 ± 0.281
0.882GluHis: 0.882 ± 0.188
4.619GluIle: 4.619 ± 0.598
6.845GluLys: 6.845 ± 0.595
6.005GluLeu: 6.005 ± 0.664
2.226GluMet: 2.226 ± 0.349
5.207GluAsn: 5.207 ± 0.633
1.008GluPro: 1.008 ± 0.234
2.604GluGln: 2.604 ± 0.392
1.68GluArg: 1.68 ± 0.319
3.275GluSer: 3.275 ± 0.388
2.52GluThr: 2.52 ± 0.398
2.646GluVal: 2.646 ± 0.439
0.42GluTrp: 0.42 ± 0.146
3.317GluTyr: 3.317 ± 0.389
0.0GluXaa: 0.0 ± 0.0
Phe
2.226PheAla: 2.226 ± 0.279
0.294PheCys: 0.294 ± 0.116
3.359PheAsp: 3.359 ± 0.352
2.687PheGlu: 2.687 ± 0.366
1.596PhePhe: 1.596 ± 0.29
3.317PheGly: 3.317 ± 0.351
0.42PheHis: 0.42 ± 0.111
3.149PheIle: 3.149 ± 0.528
3.317PheLys: 3.317 ± 0.461
2.687PheLeu: 2.687 ± 0.396
1.05PheMet: 1.05 ± 0.2
4.031PheAsn: 4.031 ± 0.541
0.756PhePro: 0.756 ± 0.182
1.176PheGln: 1.176 ± 0.215
1.176PheArg: 1.176 ± 0.204
3.485PheSer: 3.485 ± 0.418
2.478PheThr: 2.478 ± 0.304
2.436PheVal: 2.436 ± 0.334
0.252PheTrp: 0.252 ± 0.086
2.226PheTyr: 2.226 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
3.653GlyAla: 3.653 ± 0.609
0.504GlyCys: 0.504 ± 0.146
3.233GlyAsp: 3.233 ± 0.432
2.646GlyGlu: 2.646 ± 0.237
2.855GlyPhe: 2.855 ± 0.38
3.191GlyGly: 3.191 ± 0.765
1.05GlyHis: 1.05 ± 0.191
4.913GlyIle: 4.913 ± 0.511
6.257GlyLys: 6.257 ± 0.519
4.367GlyLeu: 4.367 ± 0.546
1.512GlyMet: 1.512 ± 0.242
4.829GlyAsn: 4.829 ± 0.54
0.882GlyPro: 0.882 ± 0.223
1.554GlyGln: 1.554 ± 0.254
1.89GlyArg: 1.89 ± 0.302
4.031GlySer: 4.031 ± 0.586
4.241GlyThr: 4.241 ± 0.48
4.409GlyVal: 4.409 ± 0.42
0.672GlyTrp: 0.672 ± 0.158
2.897GlyTyr: 2.897 ± 0.334
0.0GlyXaa: 0.0 ± 0.0
His
0.966HisAla: 0.966 ± 0.248
0.042HisCys: 0.042 ± 0.04
1.26HisAsp: 1.26 ± 0.215
1.008HisGlu: 1.008 ± 0.223
1.05HisPhe: 1.05 ± 0.22
1.26HisGly: 1.26 ± 0.239
0.63HisHis: 0.63 ± 0.148
1.638HisIle: 1.638 ± 0.252
1.47HisLys: 1.47 ± 0.287
0.924HisLeu: 0.924 ± 0.177
0.378HisMet: 0.378 ± 0.127
1.428HisAsn: 1.428 ± 0.248
0.672HisPro: 0.672 ± 0.174
0.672HisGln: 0.672 ± 0.17
0.672HisArg: 0.672 ± 0.178
1.386HisSer: 1.386 ± 0.228
0.756HisThr: 0.756 ± 0.168
1.344HisVal: 1.344 ± 0.263
0.252HisTrp: 0.252 ± 0.102
0.84HisTyr: 0.84 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.493IleAla: 4.493 ± 0.436
1.302IleCys: 1.302 ± 0.256
6.005IleAsp: 6.005 ± 0.575
4.955IleGlu: 4.955 ± 0.541
2.687IlePhe: 2.687 ± 0.363
3.401IleGly: 3.401 ± 0.523
1.554IleHis: 1.554 ± 0.251
5.753IleIle: 5.753 ± 0.617
7.139IleLys: 7.139 ± 0.645
5.249IleLeu: 5.249 ± 0.526
1.47IleMet: 1.47 ± 0.28
5.165IleAsn: 5.165 ± 0.533
2.604IlePro: 2.604 ± 0.372
2.562IleGln: 2.562 ± 0.408
2.478IleArg: 2.478 ± 0.412
7.559IleSer: 7.559 ± 0.61
5.165IleThr: 5.165 ± 0.723
5.039IleVal: 5.039 ± 0.458
0.63IleTrp: 0.63 ± 0.188
2.981IleTyr: 2.981 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
4.367LysAla: 4.367 ± 0.62
0.588LysCys: 0.588 ± 0.171
5.039LysAsp: 5.039 ± 0.373
5.627LysGlu: 5.627 ± 0.655
4.325LysPhe: 4.325 ± 0.487
4.157LysGly: 4.157 ± 0.337
1.554LysHis: 1.554 ± 0.272
6.971LysIle: 6.971 ± 0.587
8.86LysLys: 8.86 ± 0.871
7.391LysLeu: 7.391 ± 0.535
3.107LysMet: 3.107 ± 0.359
8.482LysAsn: 8.482 ± 0.708
2.016LysPro: 2.016 ± 0.245
4.283LysGln: 4.283 ± 0.529
2.687LysArg: 2.687 ± 0.359
5.753LysSer: 5.753 ± 0.578
5.039LysThr: 5.039 ± 0.409
4.619LysVal: 4.619 ± 0.437
0.882LysTrp: 0.882 ± 0.197
5.207LysTyr: 5.207 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
4.115LeuAla: 4.115 ± 0.427
0.798LeuCys: 0.798 ± 0.213
5.501LeuAsp: 5.501 ± 0.501
4.535LeuGlu: 4.535 ± 0.516
2.939LeuPhe: 2.939 ± 0.446
4.073LeuGly: 4.073 ± 0.381
1.47LeuHis: 1.47 ± 0.277
5.837LeuIle: 5.837 ± 0.554
7.769LeuLys: 7.769 ± 0.634
5.291LeuLeu: 5.291 ± 0.549
1.932LeuMet: 1.932 ± 0.278
6.887LeuAsn: 6.887 ± 0.599
2.226LeuPro: 2.226 ± 0.267
3.275LeuGln: 3.275 ± 0.421
2.897LeuArg: 2.897 ± 0.39
6.719LeuSer: 6.719 ± 0.508
4.745LeuThr: 4.745 ± 0.454
4.367LeuVal: 4.367 ± 0.404
0.84LeuTrp: 0.84 ± 0.163
2.729LeuTyr: 2.729 ± 0.317
0.0LeuXaa: 0.0 ± 0.0
Met
1.47MetAla: 1.47 ± 0.24
0.21MetCys: 0.21 ± 0.094
1.176MetAsp: 1.176 ± 0.229
1.092MetGlu: 1.092 ± 0.229
1.386MetPhe: 1.386 ± 0.236
1.218MetGly: 1.218 ± 0.208
0.714MetHis: 0.714 ± 0.174
1.932MetIle: 1.932 ± 0.233
2.31MetLys: 2.31 ± 0.341
2.562MetLeu: 2.562 ± 0.346
0.924MetMet: 0.924 ± 0.191
1.764MetAsn: 1.764 ± 0.245
0.63MetPro: 0.63 ± 0.18
0.966MetGln: 0.966 ± 0.171
0.924MetArg: 0.924 ± 0.182
1.68MetSer: 1.68 ± 0.279
1.932MetThr: 1.932 ± 0.212
1.26MetVal: 1.26 ± 0.238
0.252MetTrp: 0.252 ± 0.089
1.26MetTyr: 1.26 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
3.653AsnAla: 3.653 ± 0.42
0.63AsnCys: 0.63 ± 0.167
4.997AsnAsp: 4.997 ± 0.54
4.745AsnGlu: 4.745 ± 0.423
3.485AsnPhe: 3.485 ± 0.449
5.753AsnGly: 5.753 ± 0.558
2.016AsnHis: 2.016 ± 0.277
6.929AsnIle: 6.929 ± 0.584
7.097AsnLys: 7.097 ± 0.673
6.299AsnLeu: 6.299 ± 0.797
2.352AsnMet: 2.352 ± 0.304
5.879AsnAsn: 5.879 ± 0.669
3.023AsnPro: 3.023 ± 0.373
2.52AsnGln: 2.52 ± 0.353
2.058AsnArg: 2.058 ± 0.298
6.173AsnSer: 6.173 ± 0.591
3.947AsnThr: 3.947 ± 0.368
3.989AsnVal: 3.989 ± 0.475
0.504AsnTrp: 0.504 ± 0.148
2.855AsnTyr: 2.855 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
2.1ProAla: 2.1 ± 0.362
0.21ProCys: 0.21 ± 0.098
1.68ProAsp: 1.68 ± 0.278
2.1ProGlu: 2.1 ± 0.283
1.134ProPhe: 1.134 ± 0.206
1.26ProGly: 1.26 ± 0.225
0.252ProHis: 0.252 ± 0.108
2.813ProIle: 2.813 ± 0.309
1.722ProLys: 1.722 ± 0.238
1.932ProLeu: 1.932 ± 0.245
0.924ProMet: 0.924 ± 0.164
2.142ProAsn: 2.142 ± 0.373
0.168ProPro: 0.168 ± 0.091
1.176ProGln: 1.176 ± 0.197
0.63ProArg: 0.63 ± 0.163
2.226ProSer: 2.226 ± 0.357
2.184ProThr: 2.184 ± 0.355
1.554ProVal: 1.554 ± 0.244
0.126ProTrp: 0.126 ± 0.065
1.008ProTyr: 1.008 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
2.394GlnAla: 2.394 ± 0.318
0.21GlnCys: 0.21 ± 0.116
1.806GlnAsp: 1.806 ± 0.289
1.554GlnGlu: 1.554 ± 0.258
1.218GlnPhe: 1.218 ± 0.271
1.974GlnGly: 1.974 ± 0.277
0.588GlnHis: 0.588 ± 0.15
3.149GlnIle: 3.149 ± 0.378
4.115GlnLys: 4.115 ± 0.427
3.695GlnLeu: 3.695 ± 0.367
0.84GlnMet: 0.84 ± 0.15
2.897GlnAsn: 2.897 ± 0.367
1.386GlnPro: 1.386 ± 0.309
1.89GlnGln: 1.89 ± 0.366
1.386GlnArg: 1.386 ± 0.255
2.478GlnSer: 2.478 ± 0.276
2.1GlnThr: 2.1 ± 0.301
1.68GlnVal: 1.68 ± 0.27
0.42GlnTrp: 0.42 ± 0.13
2.352GlnTyr: 2.352 ± 0.415
0.0GlnXaa: 0.0 ± 0.0
Arg
1.302ArgAla: 1.302 ± 0.19
0.42ArgCys: 0.42 ± 0.164
2.646ArgAsp: 2.646 ± 0.396
1.68ArgGlu: 1.68 ± 0.246
1.386ArgPhe: 1.386 ± 0.26
1.806ArgGly: 1.806 ± 0.279
0.588ArgHis: 0.588 ± 0.129
2.184ArgIle: 2.184 ± 0.332
3.149ArgLys: 3.149 ± 0.429
2.31ArgLeu: 2.31 ± 0.336
0.672ArgMet: 0.672 ± 0.174
3.023ArgAsn: 3.023 ± 0.349
0.462ArgPro: 0.462 ± 0.128
1.218ArgGln: 1.218 ± 0.227
1.344ArgArg: 1.344 ± 0.264
1.638ArgSer: 1.638 ± 0.312
1.344ArgThr: 1.344 ± 0.249
1.89ArgVal: 1.89 ± 0.317
0.378ArgTrp: 0.378 ± 0.105
1.26ArgTyr: 1.26 ± 0.242
0.0ArgXaa: 0.0 ± 0.0
Ser
4.913SerAla: 4.913 ± 0.845
0.63SerCys: 0.63 ± 0.185
5.375SerAsp: 5.375 ± 0.49
4.367SerGlu: 4.367 ± 0.521
2.939SerPhe: 2.939 ± 0.383
5.459SerGly: 5.459 ± 0.704
1.092SerHis: 1.092 ± 0.213
5.291SerIle: 5.291 ± 0.646
5.879SerLys: 5.879 ± 0.548
5.375SerLeu: 5.375 ± 0.522
1.134SerMet: 1.134 ± 0.244
5.207SerAsn: 5.207 ± 0.408
2.016SerPro: 2.016 ± 0.338
3.275SerGln: 3.275 ± 0.427
2.058SerArg: 2.058 ± 0.271
4.913SerSer: 4.913 ± 0.766
4.409SerThr: 4.409 ± 0.413
4.115SerVal: 4.115 ± 0.377
0.756SerTrp: 0.756 ± 0.179
3.191SerTyr: 3.191 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
3.695ThrAla: 3.695 ± 0.528
0.252ThrCys: 0.252 ± 0.115
3.737ThrAsp: 3.737 ± 0.36
2.52ThrGlu: 2.52 ± 0.318
2.268ThrPhe: 2.268 ± 0.316
4.073ThrGly: 4.073 ± 0.552
1.218ThrHis: 1.218 ± 0.25
5.291ThrIle: 5.291 ± 0.452
5.837ThrLys: 5.837 ± 0.589
4.535ThrLeu: 4.535 ± 0.43
1.092ThrMet: 1.092 ± 0.221
4.409ThrAsn: 4.409 ± 0.392
2.52ThrPro: 2.52 ± 0.458
2.52ThrGln: 2.52 ± 0.328
1.68ThrArg: 1.68 ± 0.234
3.527ThrSer: 3.527 ± 0.387
4.157ThrThr: 4.157 ± 0.486
3.737ThrVal: 3.737 ± 0.449
0.546ThrTrp: 0.546 ± 0.193
2.016ThrTyr: 2.016 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
3.191ValAla: 3.191 ± 0.384
0.504ValCys: 0.504 ± 0.156
4.073ValAsp: 4.073 ± 0.524
3.611ValGlu: 3.611 ± 0.449
2.31ValPhe: 2.31 ± 0.292
3.947ValGly: 3.947 ± 0.61
1.176ValHis: 1.176 ± 0.24
3.863ValIle: 3.863 ± 0.41
4.493ValLys: 4.493 ± 0.455
4.577ValLeu: 4.577 ± 0.471
1.638ValMet: 1.638 ± 0.234
3.737ValAsn: 3.737 ± 0.393
1.764ValPro: 1.764 ± 0.333
1.974ValGln: 1.974 ± 0.348
1.386ValArg: 1.386 ± 0.244
4.535ValSer: 4.535 ± 0.475
3.611ValThr: 3.611 ± 0.439
2.729ValVal: 2.729 ± 0.331
0.63ValTrp: 0.63 ± 0.132
2.058ValTyr: 2.058 ± 0.317
0.0ValXaa: 0.0 ± 0.0
Trp
0.546TrpAla: 0.546 ± 0.165
0.168TrpCys: 0.168 ± 0.082
0.798TrpAsp: 0.798 ± 0.198
0.504TrpGlu: 0.504 ± 0.13
0.546TrpPhe: 0.546 ± 0.147
0.336TrpGly: 0.336 ± 0.134
0.294TrpHis: 0.294 ± 0.105
0.672TrpIle: 0.672 ± 0.166
0.924TrpLys: 0.924 ± 0.196
0.672TrpLeu: 0.672 ± 0.151
0.378TrpMet: 0.378 ± 0.126
0.462TrpAsn: 0.462 ± 0.125
0.168TrpPro: 0.168 ± 0.077
0.504TrpGln: 0.504 ± 0.173
0.504TrpArg: 0.504 ± 0.13
0.882TrpSer: 0.882 ± 0.186
0.546TrpThr: 0.546 ± 0.153
0.588TrpVal: 0.588 ± 0.122
0.168TrpTrp: 0.168 ± 0.065
0.462TrpTyr: 0.462 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.058TyrAla: 2.058 ± 0.275
0.756TyrCys: 0.756 ± 0.179
3.611TyrAsp: 3.611 ± 0.423
1.974TyrGlu: 1.974 ± 0.302
2.1TyrPhe: 2.1 ± 0.357
3.275TyrGly: 3.275 ± 0.331
1.05TyrHis: 1.05 ± 0.215
2.646TyrIle: 2.646 ± 0.379
2.771TyrLys: 2.771 ± 0.306
3.989TyrLeu: 3.989 ± 0.559
0.798TyrMet: 0.798 ± 0.184
3.107TyrAsn: 3.107 ± 0.311
1.512TyrPro: 1.512 ± 0.181
2.058TyrGln: 2.058 ± 0.286
1.554TyrArg: 1.554 ± 0.238
2.981TyrSer: 2.981 ± 0.463
3.065TyrThr: 3.065 ± 0.443
2.142TyrVal: 2.142 ± 0.293
0.672TyrTrp: 0.672 ± 0.134
1.68TyrTyr: 1.68 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (23815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski