Amino acid dipepetide frequency for Escherichia virus mEp460_4F5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.431AlaAla: 12.431 ± 2.317
0.763AlaCys: 0.763 ± 0.245
6.025AlaAsp: 6.025 ± 0.702
6.178AlaGlu: 6.178 ± 0.861
3.661AlaPhe: 3.661 ± 0.573
7.169AlaGly: 7.169 ± 0.792
1.297AlaHis: 1.297 ± 0.371
4.347AlaIle: 4.347 ± 0.547
4.042AlaLys: 4.042 ± 0.597
7.627AlaLeu: 7.627 ± 0.949
2.898AlaMet: 2.898 ± 0.617
2.441AlaAsn: 2.441 ± 0.45
2.746AlaPro: 2.746 ± 0.483
3.508AlaGln: 3.508 ± 0.696
5.034AlaArg: 5.034 ± 0.772
6.864AlaSer: 6.864 ± 0.881
5.72AlaThr: 5.72 ± 1.058
6.788AlaVal: 6.788 ± 0.827
2.059AlaTrp: 2.059 ± 0.322
2.517AlaTyr: 2.517 ± 0.486
0.0AlaXaa: 0.0 ± 0.0
Cys
0.991CysAla: 0.991 ± 0.324
0.153CysCys: 0.153 ± 0.115
0.839CysAsp: 0.839 ± 0.215
0.381CysGlu: 0.381 ± 0.148
0.153CysPhe: 0.153 ± 0.095
1.22CysGly: 1.22 ± 0.296
0.076CysHis: 0.076 ± 0.08
0.686CysIle: 0.686 ± 0.226
0.534CysLys: 0.534 ± 0.214
0.991CysLeu: 0.991 ± 0.261
0.229CysMet: 0.229 ± 0.12
0.61CysAsn: 0.61 ± 0.178
0.61CysPro: 0.61 ± 0.236
0.763CysGln: 0.763 ± 0.232
1.068CysArg: 1.068 ± 0.344
1.068CysSer: 1.068 ± 0.329
0.61CysThr: 0.61 ± 0.189
0.915CysVal: 0.915 ± 0.247
0.229CysTrp: 0.229 ± 0.191
0.305CysTyr: 0.305 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
6.101AspAla: 6.101 ± 0.6
0.305AspCys: 0.305 ± 0.156
4.347AspAsp: 4.347 ± 0.628
3.508AspGlu: 3.508 ± 0.591
2.135AspPhe: 2.135 ± 0.381
6.559AspGly: 6.559 ± 0.798
0.458AspHis: 0.458 ± 0.194
3.356AspIle: 3.356 ± 0.47
2.822AspLys: 2.822 ± 0.53
5.262AspLeu: 5.262 ± 0.768
1.907AspMet: 1.907 ± 0.339
3.432AspAsn: 3.432 ± 0.439
2.212AspPro: 2.212 ± 0.43
1.907AspGln: 1.907 ± 0.427
2.517AspArg: 2.517 ± 0.507
2.746AspSer: 2.746 ± 0.524
2.517AspThr: 2.517 ± 0.418
4.195AspVal: 4.195 ± 0.788
1.525AspTrp: 1.525 ± 0.429
1.449AspTyr: 1.449 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
5.491GluAla: 5.491 ± 0.694
0.839GluCys: 0.839 ± 0.263
2.746GluAsp: 2.746 ± 0.455
3.508GluGlu: 3.508 ± 0.563
1.449GluPhe: 1.449 ± 0.285
3.508GluGly: 3.508 ± 0.532
1.297GluHis: 1.297 ± 0.297
3.661GluIle: 3.661 ± 0.551
4.042GluLys: 4.042 ± 0.443
6.559GluLeu: 6.559 ± 0.872
1.907GluMet: 1.907 ± 0.429
2.441GluAsn: 2.441 ± 0.374
2.517GluPro: 2.517 ± 0.501
3.89GluGln: 3.89 ± 0.919
4.881GluArg: 4.881 ± 0.664
3.737GluSer: 3.737 ± 0.718
4.5GluThr: 4.5 ± 0.595
3.661GluVal: 3.661 ± 0.466
1.22GluTrp: 1.22 ± 0.431
1.983GluTyr: 1.983 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
2.135PheAla: 2.135 ± 0.397
0.763PheCys: 0.763 ± 0.257
2.135PheAsp: 2.135 ± 0.413
2.059PheGlu: 2.059 ± 0.385
1.22PhePhe: 1.22 ± 0.291
1.983PheGly: 1.983 ± 0.459
0.458PheHis: 0.458 ± 0.177
2.059PheIle: 2.059 ± 0.398
2.059PheLys: 2.059 ± 0.395
2.212PheLeu: 2.212 ± 0.436
1.22PheMet: 1.22 ± 0.291
1.754PheAsn: 1.754 ± 0.404
1.373PhePro: 1.373 ± 0.358
0.991PheGln: 0.991 ± 0.241
2.898PheArg: 2.898 ± 0.498
2.898PheSer: 2.898 ± 0.466
1.983PheThr: 1.983 ± 0.335
2.593PheVal: 2.593 ± 0.373
0.991PheTrp: 0.991 ± 0.223
1.22PheTyr: 1.22 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
5.949GlyAla: 5.949 ± 0.813
0.839GlyCys: 0.839 ± 0.255
4.881GlyAsp: 4.881 ± 0.655
4.652GlyGlu: 4.652 ± 0.53
2.517GlyPhe: 2.517 ± 0.517
4.957GlyGly: 4.957 ± 0.642
1.525GlyHis: 1.525 ± 0.325
4.042GlyIle: 4.042 ± 0.633
5.644GlyLys: 5.644 ± 0.65
4.118GlyLeu: 4.118 ± 0.465
2.059GlyMet: 2.059 ± 0.348
3.279GlyAsn: 3.279 ± 0.527
1.907GlyPro: 1.907 ± 0.402
3.279GlyGln: 3.279 ± 0.609
4.652GlyArg: 4.652 ± 0.532
4.195GlySer: 4.195 ± 0.437
3.203GlyThr: 3.203 ± 0.623
5.72GlyVal: 5.72 ± 0.551
1.525GlyTrp: 1.525 ± 0.327
2.059GlyTyr: 2.059 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
1.678HisAla: 1.678 ± 0.346
0.305HisCys: 0.305 ± 0.152
1.22HisAsp: 1.22 ± 0.302
0.991HisGlu: 0.991 ± 0.287
1.068HisPhe: 1.068 ± 0.245
1.449HisGly: 1.449 ± 0.292
0.534HisHis: 0.534 ± 0.239
0.915HisIle: 0.915 ± 0.31
1.144HisLys: 1.144 ± 0.286
1.297HisLeu: 1.297 ± 0.344
0.534HisMet: 0.534 ± 0.19
0.61HisAsn: 0.61 ± 0.206
0.915HisPro: 0.915 ± 0.27
0.534HisGln: 0.534 ± 0.167
0.991HisArg: 0.991 ± 0.291
0.991HisSer: 0.991 ± 0.313
0.686HisThr: 0.686 ± 0.231
0.839HisVal: 0.839 ± 0.253
0.381HisTrp: 0.381 ± 0.157
0.381HisTyr: 0.381 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
5.567IleAla: 5.567 ± 0.772
0.763IleCys: 0.763 ± 0.253
4.728IleAsp: 4.728 ± 0.559
3.432IleGlu: 3.432 ± 0.548
1.068IlePhe: 1.068 ± 0.265
4.5IleGly: 4.5 ± 0.6
0.61IleHis: 0.61 ± 0.228
2.135IleIle: 2.135 ± 0.407
2.212IleLys: 2.212 ± 0.501
2.593IleLeu: 2.593 ± 0.369
1.068IleMet: 1.068 ± 0.285
3.508IleAsn: 3.508 ± 0.531
2.288IlePro: 2.288 ± 0.48
2.135IleGln: 2.135 ± 0.371
4.195IleArg: 4.195 ± 0.604
3.585IleSer: 3.585 ± 0.599
3.585IleThr: 3.585 ± 0.629
2.746IleVal: 2.746 ± 0.479
0.686IleTrp: 0.686 ± 0.205
0.991IleTyr: 0.991 ± 0.292
0.0IleXaa: 0.0 ± 0.0
Lys
5.262LysAla: 5.262 ± 0.788
1.068LysCys: 1.068 ± 0.333
2.974LysAsp: 2.974 ± 0.418
3.585LysGlu: 3.585 ± 0.625
2.212LysPhe: 2.212 ± 0.466
3.432LysGly: 3.432 ± 0.558
0.991LysHis: 0.991 ± 0.236
2.517LysIle: 2.517 ± 0.478
2.974LysLys: 2.974 ± 0.515
4.576LysLeu: 4.576 ± 0.58
1.449LysMet: 1.449 ± 0.331
1.907LysAsn: 1.907 ± 0.366
2.288LysPro: 2.288 ± 0.461
2.441LysGln: 2.441 ± 0.39
2.974LysArg: 2.974 ± 0.568
4.042LysSer: 4.042 ± 0.623
3.966LysThr: 3.966 ± 0.596
3.203LysVal: 3.203 ± 0.453
0.991LysTrp: 0.991 ± 0.28
1.983LysTyr: 1.983 ± 0.303
0.0LysXaa: 0.0 ± 0.0
Leu
7.016LeuAla: 7.016 ± 0.82
0.915LeuCys: 0.915 ± 0.357
3.966LeuAsp: 3.966 ± 0.601
4.118LeuGlu: 4.118 ± 0.633
2.517LeuPhe: 2.517 ± 0.524
4.576LeuGly: 4.576 ± 0.658
1.525LeuHis: 1.525 ± 0.367
3.89LeuIle: 3.89 ± 0.589
5.72LeuLys: 5.72 ± 0.566
4.576LeuLeu: 4.576 ± 0.554
1.983LeuMet: 1.983 ± 0.393
4.195LeuAsn: 4.195 ± 0.543
3.813LeuPro: 3.813 ± 0.529
3.432LeuGln: 3.432 ± 0.522
5.262LeuArg: 5.262 ± 0.68
6.864LeuSer: 6.864 ± 0.816
5.796LeuThr: 5.796 ± 0.707
5.262LeuVal: 5.262 ± 0.528
1.297LeuTrp: 1.297 ± 0.315
1.754LeuTyr: 1.754 ± 0.331
0.0LeuXaa: 0.0 ± 0.0
Met
2.746MetAla: 2.746 ± 0.418
0.076MetCys: 0.076 ± 0.084
1.297MetAsp: 1.297 ± 0.316
1.678MetGlu: 1.678 ± 0.376
0.686MetPhe: 0.686 ± 0.23
1.22MetGly: 1.22 ± 0.268
0.381MetHis: 0.381 ± 0.196
0.839MetIle: 0.839 ± 0.224
1.144MetLys: 1.144 ± 0.391
2.288MetLeu: 2.288 ± 0.357
0.458MetMet: 0.458 ± 0.185
1.602MetAsn: 1.602 ± 0.356
1.754MetPro: 1.754 ± 0.38
1.373MetGln: 1.373 ± 0.294
1.983MetArg: 1.983 ± 0.367
1.907MetSer: 1.907 ± 0.403
2.364MetThr: 2.364 ± 0.46
1.754MetVal: 1.754 ± 0.373
0.305MetTrp: 0.305 ± 0.149
0.61MetTyr: 0.61 ± 0.251
0.0MetXaa: 0.0 ± 0.0
Asn
4.042AsnAla: 4.042 ± 0.712
0.763AsnCys: 0.763 ± 0.284
2.364AsnAsp: 2.364 ± 0.353
2.822AsnGlu: 2.822 ± 0.491
1.525AsnPhe: 1.525 ± 0.303
4.576AsnGly: 4.576 ± 0.687
0.61AsnHis: 0.61 ± 0.224
2.517AsnIle: 2.517 ± 0.418
2.517AsnLys: 2.517 ± 0.559
2.822AsnLeu: 2.822 ± 0.641
1.144AsnMet: 1.144 ± 0.309
1.983AsnAsn: 1.983 ± 0.465
2.135AsnPro: 2.135 ± 0.433
1.754AsnGln: 1.754 ± 0.329
2.517AsnArg: 2.517 ± 0.553
2.746AsnSer: 2.746 ± 0.463
2.746AsnThr: 2.746 ± 0.42
1.983AsnVal: 1.983 ± 0.428
0.61AsnTrp: 0.61 ± 0.201
1.373AsnTyr: 1.373 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
4.271ProAla: 4.271 ± 0.664
0.534ProCys: 0.534 ± 0.214
2.898ProAsp: 2.898 ± 0.582
4.5ProGlu: 4.5 ± 0.89
1.525ProPhe: 1.525 ± 0.358
2.822ProGly: 2.822 ± 0.42
0.991ProHis: 0.991 ± 0.282
0.839ProIle: 0.839 ± 0.243
1.754ProLys: 1.754 ± 0.428
2.288ProLeu: 2.288 ± 0.511
1.068ProMet: 1.068 ± 0.303
1.602ProAsn: 1.602 ± 0.341
1.754ProPro: 1.754 ± 0.447
1.297ProGln: 1.297 ± 0.29
2.669ProArg: 2.669 ± 0.452
2.135ProSer: 2.135 ± 0.47
1.754ProThr: 1.754 ± 0.442
4.195ProVal: 4.195 ± 0.651
0.686ProTrp: 0.686 ± 0.255
0.991ProTyr: 0.991 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
4.347GlnAla: 4.347 ± 1.064
0.763GlnCys: 0.763 ± 0.203
1.373GlnAsp: 1.373 ± 0.386
2.898GlnGlu: 2.898 ± 0.5
1.373GlnPhe: 1.373 ± 0.347
2.059GlnGly: 2.059 ± 0.368
0.915GlnHis: 0.915 ± 0.27
2.822GlnIle: 2.822 ± 0.676
3.051GlnLys: 3.051 ± 0.626
4.423GlnLeu: 4.423 ± 0.784
1.449GlnMet: 1.449 ± 0.322
1.602GlnAsn: 1.602 ± 0.384
1.602GlnPro: 1.602 ± 0.36
3.203GlnGln: 3.203 ± 1.102
3.051GlnArg: 3.051 ± 0.707
2.517GlnSer: 2.517 ± 0.436
2.059GlnThr: 2.059 ± 0.494
3.661GlnVal: 3.661 ± 0.501
0.991GlnTrp: 0.991 ± 0.353
1.373GlnTyr: 1.373 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
4.805ArgAla: 4.805 ± 0.653
0.915ArgCys: 0.915 ± 0.312
3.432ArgAsp: 3.432 ± 0.666
4.728ArgGlu: 4.728 ± 0.512
2.288ArgPhe: 2.288 ± 0.417
3.432ArgGly: 3.432 ± 0.526
1.678ArgHis: 1.678 ± 0.34
3.661ArgIle: 3.661 ± 0.468
3.356ArgLys: 3.356 ± 0.55
5.186ArgLeu: 5.186 ± 0.628
2.441ArgMet: 2.441 ± 0.485
3.279ArgAsn: 3.279 ± 0.549
2.364ArgPro: 2.364 ± 0.491
3.966ArgGln: 3.966 ± 1.01
5.186ArgArg: 5.186 ± 0.971
2.364ArgSer: 2.364 ± 0.469
3.585ArgThr: 3.585 ± 0.596
3.661ArgVal: 3.661 ± 0.62
1.22ArgTrp: 1.22 ± 0.377
2.746ArgTyr: 2.746 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
5.872SerAla: 5.872 ± 0.784
0.534SerCys: 0.534 ± 0.188
4.423SerAsp: 4.423 ± 0.445
4.347SerGlu: 4.347 ± 0.519
2.059SerPhe: 2.059 ± 0.39
6.635SerGly: 6.635 ± 0.717
0.839SerHis: 0.839 ± 0.224
3.356SerIle: 3.356 ± 0.583
3.051SerLys: 3.051 ± 0.56
5.72SerLeu: 5.72 ± 0.58
0.991SerMet: 0.991 ± 0.244
2.364SerAsn: 2.364 ± 0.415
2.441SerPro: 2.441 ± 0.565
3.356SerGln: 3.356 ± 0.421
3.585SerArg: 3.585 ± 0.521
4.271SerSer: 4.271 ± 0.693
3.203SerThr: 3.203 ± 0.456
5.567SerVal: 5.567 ± 0.709
0.686SerTrp: 0.686 ± 0.251
1.602SerTyr: 1.602 ± 0.274
0.0SerXaa: 0.0 ± 0.0
Thr
6.33ThrAla: 6.33 ± 0.837
0.534ThrCys: 0.534 ± 0.233
3.127ThrAsp: 3.127 ± 0.514
4.042ThrGlu: 4.042 ± 0.656
2.974ThrPhe: 2.974 ± 0.43
3.661ThrGly: 3.661 ± 0.623
1.602ThrHis: 1.602 ± 0.35
3.279ThrIle: 3.279 ± 0.509
2.135ThrLys: 2.135 ± 0.476
4.347ThrLeu: 4.347 ± 0.589
0.61ThrMet: 0.61 ± 0.252
1.983ThrAsn: 1.983 ± 0.516
3.203ThrPro: 3.203 ± 0.791
2.441ThrGln: 2.441 ± 0.545
3.813ThrArg: 3.813 ± 0.645
3.737ThrSer: 3.737 ± 0.456
4.118ThrThr: 4.118 ± 0.811
4.576ThrVal: 4.576 ± 0.858
1.068ThrTrp: 1.068 ± 0.307
2.059ThrTyr: 2.059 ± 0.354
0.0ThrXaa: 0.0 ± 0.0
Val
5.72ValAla: 5.72 ± 0.754
0.915ValCys: 0.915 ± 0.29
3.279ValAsp: 3.279 ± 0.495
3.585ValGlu: 3.585 ± 0.614
2.669ValPhe: 2.669 ± 0.542
3.737ValGly: 3.737 ± 0.564
0.686ValHis: 0.686 ± 0.207
4.805ValIle: 4.805 ± 0.701
4.347ValLys: 4.347 ± 0.641
6.33ValLeu: 6.33 ± 0.577
1.754ValMet: 1.754 ± 0.305
3.203ValAsn: 3.203 ± 0.579
3.279ValPro: 3.279 ± 0.562
3.051ValGln: 3.051 ± 0.574
3.737ValArg: 3.737 ± 0.604
5.186ValSer: 5.186 ± 0.632
4.576ValThr: 4.576 ± 0.552
4.805ValVal: 4.805 ± 0.72
1.22ValTrp: 1.22 ± 0.274
1.907ValTyr: 1.907 ± 0.342
0.0ValXaa: 0.0 ± 0.0
Trp
1.602TrpAla: 1.602 ± 0.376
0.229TrpCys: 0.229 ± 0.133
1.297TrpAsp: 1.297 ± 0.314
0.763TrpGlu: 0.763 ± 0.244
0.534TrpPhe: 0.534 ± 0.174
1.297TrpGly: 1.297 ± 0.238
0.381TrpHis: 0.381 ± 0.221
0.991TrpIle: 0.991 ± 0.283
1.068TrpLys: 1.068 ± 0.346
2.669TrpLeu: 2.669 ± 0.561
0.686TrpMet: 0.686 ± 0.185
0.458TrpAsn: 0.458 ± 0.173
0.763TrpPro: 0.763 ± 0.371
0.763TrpGln: 0.763 ± 0.227
1.22TrpArg: 1.22 ± 0.381
1.068TrpSer: 1.068 ± 0.251
1.068TrpThr: 1.068 ± 0.286
1.068TrpVal: 1.068 ± 0.347
0.686TrpTrp: 0.686 ± 0.266
0.534TrpTyr: 0.534 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.754TyrAla: 1.754 ± 0.427
0.458TyrCys: 0.458 ± 0.182
1.907TyrAsp: 1.907 ± 0.347
2.059TyrGlu: 2.059 ± 0.463
1.449TyrPhe: 1.449 ± 0.404
2.212TyrGly: 2.212 ± 0.366
0.61TyrHis: 0.61 ± 0.205
1.678TyrIle: 1.678 ± 0.399
1.22TyrLys: 1.22 ± 0.329
2.593TyrLeu: 2.593 ± 0.37
0.534TyrMet: 0.534 ± 0.23
1.297TyrAsn: 1.297 ± 0.254
0.61TyrPro: 0.61 ± 0.168
1.449TyrGln: 1.449 ± 0.322
2.059TyrArg: 2.059 ± 0.357
1.983TyrSer: 1.983 ± 0.39
1.602TyrThr: 1.602 ± 0.431
1.602TyrVal: 1.602 ± 0.378
0.763TyrTrp: 0.763 ± 0.21
0.458TyrTyr: 0.458 ± 0.161
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (13113 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski