Amino acid dipepetide frequency for Escherichia phage T1 (Bacteriophage T1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.23AlaAla: 8.23 ± 0.987
1.195AlaCys: 1.195 ± 0.288
3.849AlaAsp: 3.849 ± 0.421
5.177AlaGlu: 5.177 ± 0.62
2.057AlaPhe: 2.057 ± 0.434
5.641AlaGly: 5.641 ± 0.619
0.996AlaHis: 0.996 ± 0.256
6.372AlaIle: 6.372 ± 0.65
6.305AlaLys: 6.305 ± 0.831
6.372AlaLeu: 6.372 ± 0.656
2.655AlaMet: 2.655 ± 0.319
2.987AlaAsn: 2.987 ± 0.386
1.659AlaPro: 1.659 ± 0.297
3.916AlaGln: 3.916 ± 0.533
4.38AlaArg: 4.38 ± 0.549
6.04AlaSer: 6.04 ± 1.027
4.513AlaThr: 4.513 ± 0.605
4.978AlaVal: 4.978 ± 0.53
1.128AlaTrp: 1.128 ± 0.241
2.522AlaTyr: 2.522 ± 0.448
0.0AlaXaa: 0.0 ± 0.0
Cys
0.531CysAla: 0.531 ± 0.189
0.199CysCys: 0.199 ± 0.11
0.996CysAsp: 0.996 ± 0.236
0.796CysGlu: 0.796 ± 0.229
0.332CysPhe: 0.332 ± 0.145
0.996CysGly: 0.996 ± 0.313
0.398CysHis: 0.398 ± 0.196
0.796CysIle: 0.796 ± 0.198
1.128CysLys: 1.128 ± 0.332
0.796CysLeu: 0.796 ± 0.203
0.332CysMet: 0.332 ± 0.128
0.73CysAsn: 0.73 ± 0.195
0.597CysPro: 0.597 ± 0.195
0.332CysGln: 0.332 ± 0.135
0.664CysArg: 0.664 ± 0.199
0.73CysSer: 0.73 ± 0.224
0.73CysThr: 0.73 ± 0.23
0.996CysVal: 0.996 ± 0.281
0.398CysTrp: 0.398 ± 0.154
0.465CysTyr: 0.465 ± 0.17
0.0CysXaa: 0.0 ± 0.0
Asp
5.708AspAla: 5.708 ± 0.556
0.398AspCys: 0.398 ± 0.142
3.451AspAsp: 3.451 ± 0.597
4.646AspGlu: 4.646 ± 0.552
2.788AspPhe: 2.788 ± 0.445
6.305AspGly: 6.305 ± 0.792
1.062AspHis: 1.062 ± 0.243
3.65AspIle: 3.65 ± 0.421
3.916AspLys: 3.916 ± 0.452
3.916AspLeu: 3.916 ± 0.518
1.792AspMet: 1.792 ± 0.328
3.849AspAsn: 3.849 ± 0.515
1.858AspPro: 1.858 ± 0.354
1.46AspGln: 1.46 ± 0.364
2.323AspArg: 2.323 ± 0.493
4.513AspSer: 4.513 ± 0.52
2.721AspThr: 2.721 ± 0.38
4.513AspVal: 4.513 ± 0.529
0.73AspTrp: 0.73 ± 0.209
2.19AspTyr: 2.19 ± 0.328
0.0AspXaa: 0.0 ± 0.0
Glu
5.907GluAla: 5.907 ± 0.549
0.929GluCys: 0.929 ± 0.261
3.849GluAsp: 3.849 ± 0.481
4.447GluGlu: 4.447 ± 0.626
2.854GluPhe: 2.854 ± 0.45
3.119GluGly: 3.119 ± 0.386
0.996GluHis: 0.996 ± 0.248
5.907GluIle: 5.907 ± 0.537
4.447GluLys: 4.447 ± 0.701
4.58GluLeu: 4.58 ± 0.534
2.522GluMet: 2.522 ± 0.354
3.119GluAsn: 3.119 ± 0.402
1.659GluPro: 1.659 ± 0.415
2.655GluGln: 2.655 ± 0.449
3.385GluArg: 3.385 ± 0.601
3.982GluSer: 3.982 ± 0.459
3.451GluThr: 3.451 ± 0.505
4.978GluVal: 4.978 ± 0.583
0.664GluTrp: 0.664 ± 0.15
2.854GluTyr: 2.854 ± 0.335
0.0GluXaa: 0.0 ± 0.0
Phe
2.655PheAla: 2.655 ± 0.356
0.863PheCys: 0.863 ± 0.243
2.854PheAsp: 2.854 ± 0.456
2.854PheGlu: 2.854 ± 0.409
1.394PhePhe: 1.394 ± 0.375
3.385PheGly: 3.385 ± 0.48
0.73PheHis: 0.73 ± 0.265
2.588PheIle: 2.588 ± 0.393
2.721PheLys: 2.721 ± 0.438
1.659PheLeu: 1.659 ± 0.287
1.46PheMet: 1.46 ± 0.306
2.588PheAsn: 2.588 ± 0.431
1.327PhePro: 1.327 ± 0.292
0.996PheGln: 0.996 ± 0.303
1.527PheArg: 1.527 ± 0.31
2.19PheSer: 2.19 ± 0.433
2.788PheThr: 2.788 ± 0.382
2.124PheVal: 2.124 ± 0.367
0.796PheTrp: 0.796 ± 0.235
1.261PheTyr: 1.261 ± 0.287
0.0PheXaa: 0.0 ± 0.0
Gly
4.911GlyAla: 4.911 ± 0.773
1.394GlyCys: 1.394 ± 0.28
4.911GlyAsp: 4.911 ± 0.539
4.58GlyGlu: 4.58 ± 0.398
2.522GlyPhe: 2.522 ± 0.374
6.106GlyGly: 6.106 ± 0.899
1.195GlyHis: 1.195 ± 0.36
3.584GlyIle: 3.584 ± 0.395
5.973GlyLys: 5.973 ± 0.536
3.584GlyLeu: 3.584 ± 0.339
2.854GlyMet: 2.854 ± 0.486
3.518GlyAsn: 3.518 ± 0.532
0.664GlyPro: 0.664 ± 0.243
2.655GlyGln: 2.655 ± 0.424
3.385GlyArg: 3.385 ± 0.465
5.376GlySer: 5.376 ± 0.558
3.783GlyThr: 3.783 ± 0.561
6.172GlyVal: 6.172 ± 0.62
1.394GlyTrp: 1.394 ± 0.227
3.186GlyTyr: 3.186 ± 0.471
0.0GlyXaa: 0.0 ± 0.0
His
0.796HisAla: 0.796 ± 0.33
0.133HisCys: 0.133 ± 0.079
1.327HisAsp: 1.327 ± 0.278
1.593HisGlu: 1.593 ± 0.311
0.597HisPhe: 0.597 ± 0.217
1.327HisGly: 1.327 ± 0.297
0.465HisHis: 0.465 ± 0.169
1.195HisIle: 1.195 ± 0.314
1.261HisLys: 1.261 ± 0.298
1.327HisLeu: 1.327 ± 0.318
0.398HisMet: 0.398 ± 0.155
0.996HisAsn: 0.996 ± 0.299
0.664HisPro: 0.664 ± 0.215
0.597HisGln: 0.597 ± 0.207
1.195HisArg: 1.195 ± 0.28
0.929HisSer: 0.929 ± 0.261
0.863HisThr: 0.863 ± 0.248
1.128HisVal: 1.128 ± 0.271
0.133HisTrp: 0.133 ± 0.099
1.128HisTyr: 1.128 ± 0.294
0.0HisXaa: 0.0 ± 0.0
Ile
5.973IleAla: 5.973 ± 0.573
0.863IleCys: 0.863 ± 0.213
5.177IleAsp: 5.177 ± 0.551
5.177IleGlu: 5.177 ± 0.499
2.323IlePhe: 2.323 ± 0.343
4.58IleGly: 4.58 ± 0.532
0.863IleHis: 0.863 ± 0.275
3.717IleIle: 3.717 ± 0.385
5.111IleLys: 5.111 ± 0.534
3.518IleLeu: 3.518 ± 0.524
1.925IleMet: 1.925 ± 0.357
3.518IleAsn: 3.518 ± 0.621
2.522IlePro: 2.522 ± 0.437
2.854IleGln: 2.854 ± 0.436
2.987IleArg: 2.987 ± 0.4
4.447IleSer: 4.447 ± 0.522
5.376IleThr: 5.376 ± 0.584
3.982IleVal: 3.982 ± 0.447
0.73IleTrp: 0.73 ± 0.213
2.124IleTyr: 2.124 ± 0.372
0.0IleXaa: 0.0 ± 0.0
Lys
6.903LysAla: 6.903 ± 0.857
0.929LysCys: 0.929 ± 0.263
4.115LysAsp: 4.115 ± 0.596
5.509LysGlu: 5.509 ± 0.625
2.456LysPhe: 2.456 ± 0.558
3.916LysGly: 3.916 ± 0.393
1.792LysHis: 1.792 ± 0.359
4.712LysIle: 4.712 ± 0.598
4.779LysLys: 4.779 ± 0.59
4.58LysLeu: 4.58 ± 0.511
2.588LysMet: 2.588 ± 0.432
3.385LysAsn: 3.385 ± 0.474
2.721LysPro: 2.721 ± 0.522
2.92LysGln: 2.92 ± 0.504
3.982LysArg: 3.982 ± 0.537
3.982LysSer: 3.982 ± 0.454
3.783LysThr: 3.783 ± 0.498
4.447LysVal: 4.447 ± 0.5
0.73LysTrp: 0.73 ± 0.199
2.124LysTyr: 2.124 ± 0.331
0.0LysXaa: 0.0 ± 0.0
Leu
4.845LeuAla: 4.845 ± 0.721
0.664LeuCys: 0.664 ± 0.22
4.049LeuAsp: 4.049 ± 0.458
3.319LeuGlu: 3.319 ± 0.517
2.257LeuPhe: 2.257 ± 0.352
3.849LeuGly: 3.849 ± 0.519
1.327LeuHis: 1.327 ± 0.362
4.779LeuIle: 4.779 ± 0.478
4.845LeuLys: 4.845 ± 0.579
4.447LeuLeu: 4.447 ± 0.534
1.726LeuMet: 1.726 ± 0.264
2.788LeuAsn: 2.788 ± 0.519
3.053LeuPro: 3.053 ± 0.401
1.527LeuGln: 1.527 ± 0.342
3.518LeuArg: 3.518 ± 0.469
5.575LeuSer: 5.575 ± 0.52
4.248LeuThr: 4.248 ± 0.474
4.712LeuVal: 4.712 ± 0.642
0.996LeuTrp: 0.996 ± 0.228
2.124LeuTyr: 2.124 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
2.257MetAla: 2.257 ± 0.419
0.332MetCys: 0.332 ± 0.136
1.46MetAsp: 1.46 ± 0.304
2.389MetGlu: 2.389 ± 0.446
1.527MetPhe: 1.527 ± 0.336
0.996MetGly: 0.996 ± 0.31
0.597MetHis: 0.597 ± 0.213
2.389MetIle: 2.389 ± 0.449
2.655MetLys: 2.655 ± 0.307
2.124MetLeu: 2.124 ± 0.389
1.792MetMet: 1.792 ± 0.37
1.46MetAsn: 1.46 ± 0.257
0.863MetPro: 0.863 ± 0.205
1.195MetGln: 1.195 ± 0.285
1.726MetArg: 1.726 ± 0.312
2.456MetSer: 2.456 ± 0.341
1.593MetThr: 1.593 ± 0.312
2.19MetVal: 2.19 ± 0.344
0.199MetTrp: 0.199 ± 0.098
0.531MetTyr: 0.531 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
4.115AsnAla: 4.115 ± 0.615
0.398AsnCys: 0.398 ± 0.191
2.788AsnAsp: 2.788 ± 0.471
2.92AsnGlu: 2.92 ± 0.359
2.323AsnPhe: 2.323 ± 0.437
4.911AsnGly: 4.911 ± 0.784
1.261AsnHis: 1.261 ± 0.268
3.319AsnIle: 3.319 ± 0.357
3.385AsnLys: 3.385 ± 0.546
3.319AsnLeu: 3.319 ± 0.456
0.929AsnMet: 0.929 ± 0.254
2.588AsnAsn: 2.588 ± 0.415
1.858AsnPro: 1.858 ± 0.307
2.257AsnGln: 2.257 ± 0.382
1.858AsnArg: 1.858 ± 0.339
3.916AsnSer: 3.916 ± 0.549
2.323AsnThr: 2.323 ± 0.357
3.65AsnVal: 3.65 ± 0.466
0.796AsnTrp: 0.796 ± 0.2
1.527AsnTyr: 1.527 ± 0.28
0.0AsnXaa: 0.0 ± 0.0
Pro
2.057ProAla: 2.057 ± 0.313
0.465ProCys: 0.465 ± 0.205
2.456ProAsp: 2.456 ± 0.435
2.721ProGlu: 2.721 ± 0.507
1.261ProPhe: 1.261 ± 0.247
2.987ProGly: 2.987 ± 0.49
0.664ProHis: 0.664 ± 0.185
2.19ProIle: 2.19 ± 0.408
1.593ProLys: 1.593 ± 0.357
1.46ProLeu: 1.46 ± 0.267
0.796ProMet: 0.796 ± 0.209
2.124ProAsn: 2.124 ± 0.359
1.195ProPro: 1.195 ± 0.429
0.996ProGln: 0.996 ± 0.254
1.327ProArg: 1.327 ± 0.285
1.991ProSer: 1.991 ± 0.345
1.659ProThr: 1.659 ± 0.317
3.252ProVal: 3.252 ± 0.381
0.332ProTrp: 0.332 ± 0.138
1.327ProTyr: 1.327 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.584GlnAla: 3.584 ± 0.463
0.265GlnCys: 0.265 ± 0.141
1.659GlnAsp: 1.659 ± 0.352
1.925GlnGlu: 1.925 ± 0.352
1.261GlnPhe: 1.261 ± 0.343
1.991GlnGly: 1.991 ± 0.441
0.465GlnHis: 0.465 ± 0.155
2.588GlnIle: 2.588 ± 0.397
2.389GlnLys: 2.389 ± 0.404
3.319GlnLeu: 3.319 ± 0.521
0.929GlnMet: 0.929 ± 0.212
1.991GlnAsn: 1.991 ± 0.31
1.195GlnPro: 1.195 ± 0.384
2.389GlnGln: 2.389 ± 0.687
1.991GlnArg: 1.991 ± 0.424
2.721GlnSer: 2.721 ± 0.507
1.792GlnThr: 1.792 ± 0.418
3.053GlnVal: 3.053 ± 0.378
0.796GlnTrp: 0.796 ± 0.205
1.46GlnTyr: 1.46 ± 0.26
0.0GlnXaa: 0.0 ± 0.0
Arg
3.717ArgAla: 3.717 ± 0.445
0.929ArgCys: 0.929 ± 0.315
2.522ArgAsp: 2.522 ± 0.341
3.916ArgGlu: 3.916 ± 0.564
2.19ArgPhe: 2.19 ± 0.378
2.987ArgGly: 2.987 ± 0.441
0.796ArgHis: 0.796 ± 0.195
3.252ArgIle: 3.252 ± 0.478
3.717ArgLys: 3.717 ± 0.339
4.049ArgLeu: 4.049 ± 0.522
1.261ArgMet: 1.261 ± 0.299
2.655ArgAsn: 2.655 ± 0.452
1.792ArgPro: 1.792 ± 0.443
1.593ArgGln: 1.593 ± 0.373
3.053ArgArg: 3.053 ± 0.515
2.788ArgSer: 2.788 ± 0.397
1.726ArgThr: 1.726 ± 0.455
4.248ArgVal: 4.248 ± 0.528
0.73ArgTrp: 0.73 ± 0.215
2.124ArgTyr: 2.124 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
5.376SerAla: 5.376 ± 0.817
0.929SerCys: 0.929 ± 0.255
4.447SerAsp: 4.447 ± 0.508
4.049SerGlu: 4.049 ± 0.468
3.053SerPhe: 3.053 ± 0.388
6.77SerGly: 6.77 ± 0.676
0.796SerHis: 0.796 ± 0.215
4.248SerIle: 4.248 ± 0.414
3.65SerLys: 3.65 ± 0.525
5.509SerLeu: 5.509 ± 0.53
1.394SerMet: 1.394 ± 0.286
2.655SerAsn: 2.655 ± 0.54
2.124SerPro: 2.124 ± 0.347
2.854SerGln: 2.854 ± 0.396
3.584SerArg: 3.584 ± 0.372
4.181SerSer: 4.181 ± 0.611
3.717SerThr: 3.717 ± 0.447
5.509SerVal: 5.509 ± 0.462
1.062SerTrp: 1.062 ± 0.244
1.792SerTyr: 1.792 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
4.58ThrAla: 4.58 ± 0.57
0.929ThrCys: 0.929 ± 0.214
3.252ThrAsp: 3.252 ± 0.556
3.252ThrGlu: 3.252 ± 0.464
2.987ThrPhe: 2.987 ± 0.469
4.911ThrGly: 4.911 ± 0.538
1.128ThrHis: 1.128 ± 0.293
4.181ThrIle: 4.181 ± 0.514
3.053ThrLys: 3.053 ± 0.424
3.319ThrLeu: 3.319 ± 0.468
1.593ThrMet: 1.593 ± 0.344
2.92ThrAsn: 2.92 ± 0.506
2.92ThrPro: 2.92 ± 0.553
1.925ThrGln: 1.925 ± 0.333
2.19ThrArg: 2.19 ± 0.354
3.385ThrSer: 3.385 ± 0.47
3.119ThrThr: 3.119 ± 0.535
4.181ThrVal: 4.181 ± 0.536
0.73ThrTrp: 0.73 ± 0.214
1.925ThrTyr: 1.925 ± 0.316
0.0ThrXaa: 0.0 ± 0.0
Val
4.978ValAla: 4.978 ± 0.558
0.73ValCys: 0.73 ± 0.201
4.845ValAsp: 4.845 ± 0.539
4.314ValGlu: 4.314 ± 0.585
2.987ValPhe: 2.987 ± 0.392
3.584ValGly: 3.584 ± 0.486
1.394ValHis: 1.394 ± 0.28
4.646ValIle: 4.646 ± 0.562
6.172ValLys: 6.172 ± 0.671
3.783ValLeu: 3.783 ± 0.404
1.991ValMet: 1.991 ± 0.343
4.115ValAsn: 4.115 ± 0.696
2.588ValPro: 2.588 ± 0.447
2.588ValGln: 2.588 ± 0.445
3.916ValArg: 3.916 ± 0.422
5.641ValSer: 5.641 ± 0.562
5.177ValThr: 5.177 ± 0.622
4.712ValVal: 4.712 ± 0.727
1.261ValTrp: 1.261 ± 0.209
2.124ValTyr: 2.124 ± 0.461
0.0ValXaa: 0.0 ± 0.0
Trp
0.996TrpAla: 0.996 ± 0.263
0.199TrpCys: 0.199 ± 0.133
0.796TrpAsp: 0.796 ± 0.197
0.73TrpGlu: 0.73 ± 0.211
0.332TrpPhe: 0.332 ± 0.216
0.863TrpGly: 0.863 ± 0.201
0.332TrpHis: 0.332 ± 0.131
0.996TrpIle: 0.996 ± 0.252
1.062TrpLys: 1.062 ± 0.26
1.195TrpLeu: 1.195 ± 0.275
0.465TrpMet: 0.465 ± 0.144
0.796TrpAsn: 0.796 ± 0.258
0.265TrpPro: 0.265 ± 0.137
0.531TrpGln: 0.531 ± 0.153
0.996TrpArg: 0.996 ± 0.272
0.863TrpSer: 0.863 ± 0.269
1.062TrpThr: 1.062 ± 0.28
0.929TrpVal: 0.929 ± 0.234
0.199TrpTrp: 0.199 ± 0.113
0.597TrpTyr: 0.597 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.92TyrAla: 2.92 ± 0.416
0.199TyrCys: 0.199 ± 0.09
2.987TyrAsp: 2.987 ± 0.424
1.991TyrGlu: 1.991 ± 0.311
1.261TyrPhe: 1.261 ± 0.31
2.522TyrGly: 2.522 ± 0.428
0.796TyrHis: 0.796 ± 0.219
2.655TyrIle: 2.655 ± 0.429
2.19TyrLys: 2.19 ± 0.346
1.792TyrLeu: 1.792 ± 0.375
1.261TyrMet: 1.261 ± 0.291
1.659TyrAsn: 1.659 ± 0.296
1.327TyrPro: 1.327 ± 0.27
1.527TyrGln: 1.527 ± 0.317
2.057TyrArg: 2.057 ± 0.313
1.991TyrSer: 1.991 ± 0.332
2.057TyrThr: 2.057 ± 0.378
1.792TyrVal: 1.792 ± 0.393
0.398TyrTrp: 0.398 ± 0.178
1.527TyrTyr: 1.527 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (15068 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski