Amino acid dipepetide frequency for Escherichia virus Jk06

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.093AlaAla: 8.093 ± 0.975
0.823AlaCys: 0.823 ± 0.265
3.567AlaAsp: 3.567 ± 0.685
4.87AlaGlu: 4.87 ± 0.686
2.743AlaPhe: 2.743 ± 0.381
5.83AlaGly: 5.83 ± 0.596
1.235AlaHis: 1.235 ± 0.345
6.722AlaIle: 6.722 ± 0.781
5.007AlaLys: 5.007 ± 1.113
7.27AlaLeu: 7.27 ± 0.76
2.263AlaMet: 2.263 ± 0.431
3.361AlaAsn: 3.361 ± 0.488
1.783AlaPro: 1.783 ± 0.336
3.224AlaGln: 3.224 ± 0.439
5.007AlaArg: 5.007 ± 0.652
6.379AlaSer: 6.379 ± 0.742
4.595AlaThr: 4.595 ± 0.545
4.938AlaVal: 4.938 ± 0.603
1.166AlaTrp: 1.166 ± 0.281
1.92AlaTyr: 1.92 ± 0.344
0.0AlaXaa: 0.0 ± 0.0
Cys
0.823CysAla: 0.823 ± 0.27
0.274CysCys: 0.274 ± 0.159
0.686CysAsp: 0.686 ± 0.183
0.754CysGlu: 0.754 ± 0.206
0.686CysPhe: 0.686 ± 0.238
1.303CysGly: 1.303 ± 0.322
0.549CysHis: 0.549 ± 0.186
0.412CysIle: 0.412 ± 0.212
0.823CysLys: 0.823 ± 0.215
1.578CysLeu: 1.578 ± 0.319
0.48CysMet: 0.48 ± 0.171
0.549CysAsn: 0.549 ± 0.159
0.412CysPro: 0.412 ± 0.177
0.617CysGln: 0.617 ± 0.212
1.235CysArg: 1.235 ± 0.294
2.263CysSer: 2.263 ± 0.483
0.549CysThr: 0.549 ± 0.175
0.823CysVal: 0.823 ± 0.248
0.137CysTrp: 0.137 ± 0.081
0.412CysTyr: 0.412 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
4.252AspAla: 4.252 ± 0.738
0.549AspCys: 0.549 ± 0.171
3.772AspAsp: 3.772 ± 0.507
4.047AspGlu: 4.047 ± 0.541
2.743AspPhe: 2.743 ± 0.34
6.31AspGly: 6.31 ± 0.725
0.686AspHis: 0.686 ± 0.189
3.704AspIle: 3.704 ± 0.524
4.252AspLys: 4.252 ± 0.497
3.018AspLeu: 3.018 ± 0.405
0.823AspMet: 0.823 ± 0.221
2.469AspAsn: 2.469 ± 0.498
1.509AspPro: 1.509 ± 0.291
1.646AspGln: 1.646 ± 0.283
2.058AspArg: 2.058 ± 0.381
4.252AspSer: 4.252 ± 0.626
3.155AspThr: 3.155 ± 0.439
3.909AspVal: 3.909 ± 0.575
0.48AspTrp: 0.48 ± 0.219
2.332AspTyr: 2.332 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
4.595GluAla: 4.595 ± 0.679
0.823GluCys: 0.823 ± 0.247
2.675GluAsp: 2.675 ± 0.435
2.949GluGlu: 2.949 ± 0.528
2.881GluPhe: 2.881 ± 0.372
3.567GluGly: 3.567 ± 0.488
0.892GluHis: 0.892 ± 0.253
5.007GluIle: 5.007 ± 0.482
3.361GluLys: 3.361 ± 0.532
5.144GluLeu: 5.144 ± 0.52
1.715GluMet: 1.715 ± 0.336
2.743GluAsn: 2.743 ± 0.485
1.303GluPro: 1.303 ± 0.226
2.675GluGln: 2.675 ± 0.365
3.498GluArg: 3.498 ± 0.535
3.567GluSer: 3.567 ± 0.563
3.429GluThr: 3.429 ± 0.478
4.321GluVal: 4.321 ± 0.638
0.343GluTrp: 0.343 ± 0.157
1.92GluTyr: 1.92 ± 0.399
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 0.415
0.96PheCys: 0.96 ± 0.266
3.567PheAsp: 3.567 ± 0.449
1.989PheGlu: 1.989 ± 0.404
0.96PhePhe: 0.96 ± 0.229
3.155PheGly: 3.155 ± 0.491
0.343PheHis: 0.343 ± 0.136
2.058PheIle: 2.058 ± 0.287
1.852PheLys: 1.852 ± 0.404
3.429PheLeu: 3.429 ± 0.446
1.235PheMet: 1.235 ± 0.297
2.606PheAsn: 2.606 ± 0.49
1.235PhePro: 1.235 ± 0.281
1.509PheGln: 1.509 ± 0.31
2.263PheArg: 2.263 ± 0.396
3.224PheSer: 3.224 ± 0.578
2.263PheThr: 2.263 ± 0.389
2.469PheVal: 2.469 ± 0.319
0.549PheTrp: 0.549 ± 0.197
1.372PheTyr: 1.372 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
4.184GlyAla: 4.184 ± 0.579
1.372GlyCys: 1.372 ± 0.32
4.458GlyAsp: 4.458 ± 0.714
5.007GlyGlu: 5.007 ± 0.473
2.812GlyPhe: 2.812 ± 0.612
5.213GlyGly: 5.213 ± 1.004
0.48GlyHis: 0.48 ± 0.166
5.007GlyIle: 5.007 ± 0.534
3.772GlyLys: 3.772 ± 0.449
6.379GlyLeu: 6.379 ± 0.848
1.92GlyMet: 1.92 ± 0.413
3.498GlyAsn: 3.498 ± 0.496
0.96GlyPro: 0.96 ± 0.401
1.303GlyGln: 1.303 ± 0.265
2.675GlyArg: 2.675 ± 0.447
5.487GlySer: 5.487 ± 0.627
4.115GlyThr: 4.115 ± 0.484
6.379GlyVal: 6.379 ± 0.643
0.96GlyTrp: 0.96 ± 0.25
3.155GlyTyr: 3.155 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
0.686HisAla: 0.686 ± 0.237
0.617HisCys: 0.617 ± 0.274
0.892HisAsp: 0.892 ± 0.23
0.686HisGlu: 0.686 ± 0.236
0.892HisPhe: 0.892 ± 0.249
1.235HisGly: 1.235 ± 0.269
0.549HisHis: 0.549 ± 0.186
0.892HisIle: 0.892 ± 0.29
1.372HisLys: 1.372 ± 0.383
1.783HisLeu: 1.783 ± 0.432
0.343HisMet: 0.343 ± 0.144
0.48HisAsn: 0.48 ± 0.173
0.412HisPro: 0.412 ± 0.161
0.274HisGln: 0.274 ± 0.124
0.96HisArg: 0.96 ± 0.256
0.892HisSer: 0.892 ± 0.235
0.754HisThr: 0.754 ± 0.191
1.44HisVal: 1.44 ± 0.261
0.069HisTrp: 0.069 ± 0.067
0.754HisTyr: 0.754 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
4.733IleAla: 4.733 ± 0.635
1.097IleCys: 1.097 ± 0.261
4.664IleAsp: 4.664 ± 0.609
4.733IleGlu: 4.733 ± 0.619
2.058IlePhe: 2.058 ± 0.31
3.772IleGly: 3.772 ± 0.682
1.166IleHis: 1.166 ± 0.28
4.047IleIle: 4.047 ± 0.493
3.704IleLys: 3.704 ± 0.419
3.704IleLeu: 3.704 ± 0.595
1.235IleMet: 1.235 ± 0.39
4.252IleAsn: 4.252 ± 0.445
2.743IlePro: 2.743 ± 0.478
2.263IleGln: 2.263 ± 0.504
3.567IleArg: 3.567 ± 0.561
5.281IleSer: 5.281 ± 0.69
4.252IleThr: 4.252 ± 0.608
3.635IleVal: 3.635 ± 0.377
0.549IleTrp: 0.549 ± 0.197
2.469IleTyr: 2.469 ± 0.455
0.0IleXaa: 0.0 ± 0.0
Lys
5.624LysAla: 5.624 ± 0.939
0.686LysCys: 0.686 ± 0.264
3.292LysAsp: 3.292 ± 0.381
3.841LysGlu: 3.841 ± 0.67
2.469LysPhe: 2.469 ± 0.448
3.018LysGly: 3.018 ± 0.457
1.166LysHis: 1.166 ± 0.287
3.155LysIle: 3.155 ± 0.454
2.881LysLys: 2.881 ± 0.468
6.173LysLeu: 6.173 ± 0.789
2.058LysMet: 2.058 ± 0.477
1.92LysAsn: 1.92 ± 0.316
1.715LysPro: 1.715 ± 0.344
2.538LysGln: 2.538 ± 0.418
2.538LysArg: 2.538 ± 0.48
3.978LysSer: 3.978 ± 0.492
3.635LysThr: 3.635 ± 0.429
3.909LysVal: 3.909 ± 0.465
0.549LysTrp: 0.549 ± 0.229
1.92LysTyr: 1.92 ± 0.359
0.0LysXaa: 0.0 ± 0.0
Leu
7.613LeuAla: 7.613 ± 0.718
1.44LeuCys: 1.44 ± 0.325
4.458LeuAsp: 4.458 ± 0.584
3.909LeuGlu: 3.909 ± 0.504
2.743LeuPhe: 2.743 ± 0.583
4.252LeuGly: 4.252 ± 0.64
1.646LeuHis: 1.646 ± 0.297
4.938LeuIle: 4.938 ± 0.575
4.87LeuLys: 4.87 ± 0.575
7.545LeuLeu: 7.545 ± 0.926
2.058LeuMet: 2.058 ± 0.369
3.841LeuAsn: 3.841 ± 0.398
4.184LeuPro: 4.184 ± 0.569
2.743LeuGln: 2.743 ± 0.437
5.556LeuArg: 5.556 ± 0.562
7.682LeuSer: 7.682 ± 0.679
6.722LeuThr: 6.722 ± 0.608
5.761LeuVal: 5.761 ± 0.665
0.549LeuTrp: 0.549 ± 0.18
2.469LeuTyr: 2.469 ± 0.417
0.0LeuXaa: 0.0 ± 0.0
Met
3.909MetAla: 3.909 ± 0.48
0.069MetCys: 0.069 ± 0.074
0.686MetAsp: 0.686 ± 0.227
1.235MetGlu: 1.235 ± 0.304
0.892MetPhe: 0.892 ± 0.254
0.754MetGly: 0.754 ± 0.21
0.549MetHis: 0.549 ± 0.194
1.578MetIle: 1.578 ± 0.36
1.92MetLys: 1.92 ± 0.349
1.989MetLeu: 1.989 ± 0.462
0.96MetMet: 0.96 ± 0.252
1.303MetAsn: 1.303 ± 0.304
0.754MetPro: 0.754 ± 0.217
0.892MetGln: 0.892 ± 0.232
2.263MetArg: 2.263 ± 0.408
1.646MetSer: 1.646 ± 0.332
1.852MetThr: 1.852 ± 0.428
1.715MetVal: 1.715 ± 0.349
0.206MetTrp: 0.206 ± 0.12
0.754MetTyr: 0.754 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
2.881AsnAla: 2.881 ± 0.382
0.823AsnCys: 0.823 ± 0.291
2.332AsnAsp: 2.332 ± 0.42
3.361AsnGlu: 3.361 ± 0.517
1.509AsnPhe: 1.509 ± 0.285
5.761AsnGly: 5.761 ± 0.752
1.029AsnHis: 1.029 ± 0.254
2.675AsnIle: 2.675 ± 0.44
2.538AsnLys: 2.538 ± 0.319
4.733AsnLeu: 4.733 ± 0.541
0.823AsnMet: 0.823 ± 0.223
3.086AsnAsn: 3.086 ± 0.596
1.509AsnPro: 1.509 ± 0.323
1.578AsnGln: 1.578 ± 0.421
2.195AsnArg: 2.195 ± 0.416
4.321AsnSer: 4.321 ± 0.671
2.606AsnThr: 2.606 ± 0.453
3.841AsnVal: 3.841 ± 0.461
0.686AsnTrp: 0.686 ± 0.189
1.852AsnTyr: 1.852 ± 0.385
0.0AsnXaa: 0.0 ± 0.0
Pro
3.772ProAla: 3.772 ± 0.576
0.412ProCys: 0.412 ± 0.177
1.715ProAsp: 1.715 ± 0.437
2.126ProGlu: 2.126 ± 0.444
1.646ProPhe: 1.646 ± 0.345
1.44ProGly: 1.44 ± 0.363
0.617ProHis: 0.617 ± 0.249
1.44ProIle: 1.44 ± 0.299
1.852ProLys: 1.852 ± 0.371
2.332ProLeu: 2.332 ± 0.397
1.029ProMet: 1.029 ± 0.239
1.783ProAsn: 1.783 ± 0.314
1.029ProPro: 1.029 ± 0.282
1.715ProGln: 1.715 ± 0.307
1.715ProArg: 1.715 ± 0.346
1.852ProSer: 1.852 ± 0.396
1.989ProThr: 1.989 ± 0.383
3.224ProVal: 3.224 ± 0.435
0.48ProTrp: 0.48 ± 0.148
1.029ProTyr: 1.029 ± 0.26
0.0ProXaa: 0.0 ± 0.0
Gln
3.841GlnAla: 3.841 ± 0.54
0.48GlnCys: 0.48 ± 0.189
1.852GlnAsp: 1.852 ± 0.34
1.989GlnGlu: 1.989 ± 0.313
1.509GlnPhe: 1.509 ± 0.277
1.989GlnGly: 1.989 ± 0.433
0.549GlnHis: 0.549 ± 0.182
2.949GlnIle: 2.949 ± 0.599
1.715GlnLys: 1.715 ± 0.337
3.772GlnLeu: 3.772 ± 0.497
1.097GlnMet: 1.097 ± 0.288
1.783GlnAsn: 1.783 ± 0.375
1.852GlnPro: 1.852 ± 0.351
2.606GlnGln: 2.606 ± 0.77
1.92GlnArg: 1.92 ± 0.399
3.018GlnSer: 3.018 ± 0.387
1.646GlnThr: 1.646 ± 0.268
1.92GlnVal: 1.92 ± 0.364
0.343GlnTrp: 0.343 ± 0.158
1.715GlnTyr: 1.715 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
3.704ArgAla: 3.704 ± 0.523
1.235ArgCys: 1.235 ± 0.37
1.92ArgAsp: 1.92 ± 0.33
2.949ArgGlu: 2.949 ± 0.402
2.263ArgPhe: 2.263 ± 0.348
2.949ArgGly: 2.949 ± 0.439
0.754ArgHis: 0.754 ± 0.219
3.909ArgIle: 3.909 ± 0.581
4.115ArgLys: 4.115 ± 0.628
4.938ArgLeu: 4.938 ± 0.6
1.372ArgMet: 1.372 ± 0.361
2.949ArgAsn: 2.949 ± 0.585
2.401ArgPro: 2.401 ± 0.374
2.538ArgGln: 2.538 ± 0.394
3.635ArgArg: 3.635 ± 0.449
3.567ArgSer: 3.567 ± 0.528
2.401ArgThr: 2.401 ± 0.402
3.635ArgVal: 3.635 ± 0.475
0.686ArgTrp: 0.686 ± 0.202
2.812ArgTyr: 2.812 ± 0.404
0.0ArgXaa: 0.0 ± 0.0
Ser
6.722SerAla: 6.722 ± 0.814
0.892SerCys: 0.892 ± 0.238
5.281SerAsp: 5.281 ± 0.495
4.184SerGlu: 4.184 ± 0.53
3.978SerPhe: 3.978 ± 0.571
6.379SerGly: 6.379 ± 0.659
1.44SerHis: 1.44 ± 0.314
4.252SerIle: 4.252 ± 0.51
3.772SerLys: 3.772 ± 0.465
8.025SerLeu: 8.025 ± 0.881
1.852SerMet: 1.852 ± 0.38
3.498SerAsn: 3.498 ± 0.477
2.263SerPro: 2.263 ± 0.413
3.567SerGln: 3.567 ± 0.428
3.224SerArg: 3.224 ± 0.528
6.379SerSer: 6.379 ± 1.156
4.252SerThr: 4.252 ± 0.644
5.693SerVal: 5.693 ± 0.551
1.166SerTrp: 1.166 ± 0.293
2.949SerTyr: 2.949 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
5.075ThrAla: 5.075 ± 0.685
0.96ThrCys: 0.96 ± 0.254
2.743ThrAsp: 2.743 ± 0.379
2.881ThrGlu: 2.881 ± 0.44
1.989ThrPhe: 1.989 ± 0.416
4.938ThrGly: 4.938 ± 0.514
0.754ThrHis: 0.754 ± 0.197
3.772ThrIle: 3.772 ± 0.445
2.606ThrLys: 2.606 ± 0.474
5.075ThrLeu: 5.075 ± 0.511
1.715ThrMet: 1.715 ± 0.312
3.429ThrAsn: 3.429 ± 0.411
3.155ThrPro: 3.155 ± 0.349
2.195ThrGln: 2.195 ± 0.351
2.606ThrArg: 2.606 ± 0.39
4.664ThrSer: 4.664 ± 0.623
3.772ThrThr: 3.772 ± 0.602
4.39ThrVal: 4.39 ± 0.489
0.823ThrTrp: 0.823 ± 0.225
2.195ThrTyr: 2.195 ± 0.407
0.0ThrXaa: 0.0 ± 0.0
Val
5.213ValAla: 5.213 ± 0.763
0.96ValCys: 0.96 ± 0.237
4.184ValAsp: 4.184 ± 0.548
3.361ValGlu: 3.361 ± 0.601
3.224ValPhe: 3.224 ± 0.468
3.841ValGly: 3.841 ± 0.58
0.96ValHis: 0.96 ± 0.293
4.184ValIle: 4.184 ± 0.461
4.252ValLys: 4.252 ± 0.509
4.733ValLeu: 4.733 ± 0.561
1.783ValMet: 1.783 ± 0.353
3.841ValAsn: 3.841 ± 0.527
2.538ValPro: 2.538 ± 0.429
2.538ValGln: 2.538 ± 0.45
4.39ValArg: 4.39 ± 0.666
7.682ValSer: 7.682 ± 0.706
4.321ValThr: 4.321 ± 0.493
5.83ValVal: 5.83 ± 0.637
0.754ValTrp: 0.754 ± 0.19
2.332ValTyr: 2.332 ± 0.392
0.0ValXaa: 0.0 ± 0.0
Trp
0.412TrpAla: 0.412 ± 0.16
0.343TrpCys: 0.343 ± 0.148
0.412TrpAsp: 0.412 ± 0.159
0.48TrpGlu: 0.48 ± 0.164
0.343TrpPhe: 0.343 ± 0.137
0.754TrpGly: 0.754 ± 0.254
0.137TrpHis: 0.137 ± 0.082
0.96TrpIle: 0.96 ± 0.218
0.549TrpLys: 0.549 ± 0.154
1.44TrpLeu: 1.44 ± 0.309
0.274TrpMet: 0.274 ± 0.13
0.48TrpAsn: 0.48 ± 0.169
0.343TrpPro: 0.343 ± 0.136
0.274TrpGln: 0.274 ± 0.147
0.823TrpArg: 0.823 ± 0.199
1.372TrpSer: 1.372 ± 0.325
0.686TrpThr: 0.686 ± 0.222
0.754TrpVal: 0.754 ± 0.194
0.206TrpTrp: 0.206 ± 0.096
0.206TrpTyr: 0.206 ± 0.088
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.852TyrAla: 1.852 ± 0.396
0.549TyrCys: 0.549 ± 0.199
2.812TyrAsp: 2.812 ± 0.439
2.058TyrGlu: 2.058 ± 0.348
1.509TyrPhe: 1.509 ± 0.297
2.675TyrGly: 2.675 ± 0.53
0.48TyrHis: 0.48 ± 0.184
2.263TyrIle: 2.263 ± 0.388
1.852TyrLys: 1.852 ± 0.487
1.92TyrLeu: 1.92 ± 0.345
0.823TyrMet: 0.823 ± 0.24
2.263TyrAsn: 2.263 ± 0.324
1.097TyrPro: 1.097 ± 0.264
1.852TyrGln: 1.852 ± 0.318
2.606TyrArg: 2.606 ± 0.432
2.263TyrSer: 2.263 ± 0.41
2.743TyrThr: 2.743 ± 0.384
2.332TyrVal: 2.332 ± 0.372
0.549TyrTrp: 0.549 ± 0.234
0.823TyrTyr: 0.823 ± 0.266
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (14581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski