Amino acid dipepetide frequency for Escherichia virus P2_4B2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.428AlaAla: 9.428 ± 1.936
0.702AlaCys: 0.702 ± 0.341
5.817AlaAsp: 5.817 ± 0.942
5.216AlaGlu: 5.216 ± 0.793
2.708AlaPhe: 2.708 ± 0.38
8.124AlaGly: 8.124 ± 1.061
1.605AlaHis: 1.605 ± 0.447
3.912AlaIle: 3.912 ± 0.681
4.814AlaLys: 4.814 ± 0.765
9.729AlaLeu: 9.729 ± 1.158
1.805AlaMet: 1.805 ± 0.37
2.909AlaAsn: 2.909 ± 0.386
4.313AlaPro: 4.313 ± 0.706
3.711AlaGln: 3.711 ± 0.753
5.316AlaArg: 5.316 ± 0.949
7.924AlaSer: 7.924 ± 0.976
6.921AlaThr: 6.921 ± 0.897
7.121AlaVal: 7.121 ± 0.982
1.505AlaTrp: 1.505 ± 0.423
2.508AlaTyr: 2.508 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
1.003CysAla: 1.003 ± 0.283
0.1CysCys: 0.1 ± 0.093
0.903CysAsp: 0.903 ± 0.315
0.301CysGlu: 0.301 ± 0.173
0.1CysPhe: 0.1 ± 0.094
0.602CysGly: 0.602 ± 0.24
0.1CysHis: 0.1 ± 0.087
0.502CysIle: 0.502 ± 0.226
0.301CysLys: 0.301 ± 0.189
0.502CysLeu: 0.502 ± 0.198
0.201CysMet: 0.201 ± 0.16
0.201CysAsn: 0.201 ± 0.139
0.401CysPro: 0.401 ± 0.182
1.003CysGln: 1.003 ± 0.301
1.204CysArg: 1.204 ± 0.269
0.401CysSer: 0.401 ± 0.215
0.702CysThr: 0.702 ± 0.325
0.602CysVal: 0.602 ± 0.247
0.1CysTrp: 0.1 ± 0.109
0.301CysTyr: 0.301 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
6.62AspAla: 6.62 ± 0.777
0.401AspCys: 0.401 ± 0.183
3.31AspAsp: 3.31 ± 0.549
4.514AspGlu: 4.514 ± 1.036
3.109AspPhe: 3.109 ± 0.656
5.216AspGly: 5.216 ± 0.84
0.301AspHis: 0.301 ± 0.179
4.714AspIle: 4.714 ± 0.883
2.407AspLys: 2.407 ± 0.47
4.313AspLeu: 4.313 ± 0.515
0.802AspMet: 0.802 ± 0.303
2.006AspAsn: 2.006 ± 0.595
1.805AspPro: 1.805 ± 0.448
1.404AspGln: 1.404 ± 0.432
2.207AspArg: 2.207 ± 0.538
2.407AspSer: 2.407 ± 0.423
3.912AspThr: 3.912 ± 0.703
3.41AspVal: 3.41 ± 0.532
0.903AspTrp: 0.903 ± 0.308
2.407AspTyr: 2.407 ± 0.535
0.0AspXaa: 0.0 ± 0.0
Glu
5.216GluAla: 5.216 ± 0.738
0.602GluCys: 0.602 ± 0.236
2.608GluAsp: 2.608 ± 0.575
4.012GluGlu: 4.012 ± 0.807
2.307GluPhe: 2.307 ± 0.601
3.009GluGly: 3.009 ± 0.515
1.906GluHis: 1.906 ± 0.412
3.41GluIle: 3.41 ± 0.706
4.213GluLys: 4.213 ± 0.553
7.623GluLeu: 7.623 ± 0.808
2.708GluMet: 2.708 ± 0.464
3.811GluAsn: 3.811 ± 0.812
2.708GluPro: 2.708 ± 0.599
2.708GluGln: 2.708 ± 0.592
4.413GluArg: 4.413 ± 0.869
4.012GluSer: 4.012 ± 0.613
2.708GluThr: 2.708 ± 0.587
3.711GluVal: 3.711 ± 0.802
1.103GluTrp: 1.103 ± 0.337
2.207GluTyr: 2.207 ± 0.644
0.0GluXaa: 0.0 ± 0.0
Phe
2.708PheAla: 2.708 ± 0.493
0.702PheCys: 0.702 ± 0.256
2.106PheAsp: 2.106 ± 0.439
2.106PheGlu: 2.106 ± 0.51
1.003PhePhe: 1.003 ± 0.34
1.204PheGly: 1.204 ± 0.32
1.204PheHis: 1.204 ± 0.468
2.006PheIle: 2.006 ± 0.743
2.407PheLys: 2.407 ± 0.401
3.31PheLeu: 3.31 ± 0.539
0.903PheMet: 0.903 ± 0.313
1.404PheAsn: 1.404 ± 0.38
1.204PhePro: 1.204 ± 0.37
1.505PheGln: 1.505 ± 0.4
1.805PheArg: 1.805 ± 0.391
2.909PheSer: 2.909 ± 0.657
2.608PheThr: 2.608 ± 0.444
1.404PheVal: 1.404 ± 0.377
0.702PheTrp: 0.702 ± 0.269
1.304PheTyr: 1.304 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
6.118GlyAla: 6.118 ± 1.079
0.903GlyCys: 0.903 ± 0.354
4.614GlyAsp: 4.614 ± 0.628
4.112GlyGlu: 4.112 ± 0.639
2.608GlyPhe: 2.608 ± 0.541
5.617GlyGly: 5.617 ± 0.941
0.903GlyHis: 0.903 ± 0.309
3.711GlyIle: 3.711 ± 0.739
5.617GlyLys: 5.617 ± 0.726
4.915GlyLeu: 4.915 ± 0.57
2.106GlyMet: 2.106 ± 0.558
3.009GlyAsn: 3.009 ± 0.784
1.003GlyPro: 1.003 ± 0.335
2.608GlyGln: 2.608 ± 0.515
4.413GlyArg: 4.413 ± 0.693
3.811GlySer: 3.811 ± 0.687
4.112GlyThr: 4.112 ± 0.928
5.316GlyVal: 5.316 ± 0.792
0.903GlyTrp: 0.903 ± 0.211
1.906GlyTyr: 1.906 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
1.906HisAla: 1.906 ± 0.561
0.502HisCys: 0.502 ± 0.209
0.903HisAsp: 0.903 ± 0.341
1.304HisGlu: 1.304 ± 0.34
0.802HisPhe: 0.802 ± 0.287
1.505HisGly: 1.505 ± 0.474
0.602HisHis: 0.602 ± 0.258
1.304HisIle: 1.304 ± 0.34
1.003HisLys: 1.003 ± 0.286
2.006HisLeu: 2.006 ± 0.447
0.401HisMet: 0.401 ± 0.177
0.802HisAsn: 0.802 ± 0.37
1.204HisPro: 1.204 ± 0.366
0.903HisGln: 0.903 ± 0.262
1.103HisArg: 1.103 ± 0.35
0.802HisSer: 0.802 ± 0.288
1.003HisThr: 1.003 ± 0.463
1.003HisVal: 1.003 ± 0.278
0.301HisTrp: 0.301 ± 0.137
0.502HisTyr: 0.502 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
4.413IleAla: 4.413 ± 0.742
0.301IleCys: 0.301 ± 0.208
3.811IleAsp: 3.811 ± 0.67
3.711IleGlu: 3.711 ± 0.75
1.906IlePhe: 1.906 ± 0.52
4.112IleGly: 4.112 ± 0.664
0.401IleHis: 0.401 ± 0.193
2.808IleIle: 2.808 ± 0.392
2.207IleLys: 2.207 ± 0.581
3.109IleLeu: 3.109 ± 0.668
1.003IleMet: 1.003 ± 0.291
3.31IleAsn: 3.31 ± 0.467
2.407IlePro: 2.407 ± 0.574
2.407IleGln: 2.407 ± 0.547
5.015IleArg: 5.015 ± 0.774
4.313IleSer: 4.313 ± 0.661
4.614IleThr: 4.614 ± 0.6
3.511IleVal: 3.511 ± 0.565
0.802IleTrp: 0.802 ± 0.285
1.304IleTyr: 1.304 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
5.617LysAla: 5.617 ± 0.626
0.1LysCys: 0.1 ± 0.098
1.906LysAsp: 1.906 ± 0.549
3.41LysGlu: 3.41 ± 0.69
1.705LysPhe: 1.705 ± 0.444
2.808LysGly: 2.808 ± 0.417
1.404LysHis: 1.404 ± 0.38
2.708LysIle: 2.708 ± 0.552
4.112LysLys: 4.112 ± 0.776
6.219LysLeu: 6.219 ± 0.789
0.802LysMet: 0.802 ± 0.302
4.112LysAsn: 4.112 ± 0.667
3.31LysPro: 3.31 ± 0.561
1.705LysGln: 1.705 ± 0.378
4.012LysArg: 4.012 ± 0.823
3.21LysSer: 3.21 ± 0.584
3.711LysThr: 3.711 ± 0.539
3.511LysVal: 3.511 ± 0.67
1.003LysTrp: 1.003 ± 0.339
2.307LysTyr: 2.307 ± 0.588
0.0LysXaa: 0.0 ± 0.0
Leu
9.629LeuAla: 9.629 ± 1.137
0.702LeuCys: 0.702 ± 0.285
5.517LeuAsp: 5.517 ± 0.799
6.62LeuGlu: 6.62 ± 0.655
3.912LeuPhe: 3.912 ± 0.704
4.915LeuGly: 4.915 ± 0.818
2.106LeuHis: 2.106 ± 0.541
4.614LeuIle: 4.614 ± 0.608
5.817LeuLys: 5.817 ± 0.971
6.52LeuLeu: 6.52 ± 1.157
3.21LeuMet: 3.21 ± 0.614
4.915LeuAsn: 4.915 ± 0.537
4.012LeuPro: 4.012 ± 0.646
2.808LeuGln: 2.808 ± 0.605
4.614LeuArg: 4.614 ± 0.614
6.921LeuSer: 6.921 ± 0.993
7.121LeuThr: 7.121 ± 0.904
3.811LeuVal: 3.811 ± 0.435
1.003LeuTrp: 1.003 ± 0.31
2.307LeuTyr: 2.307 ± 0.572
0.0LeuXaa: 0.0 ± 0.0
Met
2.909MetAla: 2.909 ± 0.401
0.301MetCys: 0.301 ± 0.199
0.802MetAsp: 0.802 ± 0.279
1.705MetGlu: 1.705 ± 0.459
0.903MetPhe: 0.903 ± 0.335
0.802MetGly: 0.802 ± 0.28
0.802MetHis: 0.802 ± 0.257
1.003MetIle: 1.003 ± 0.376
1.404MetLys: 1.404 ± 0.336
2.508MetLeu: 2.508 ± 0.532
0.802MetMet: 0.802 ± 0.291
1.605MetAsn: 1.605 ± 0.397
1.103MetPro: 1.103 ± 0.353
1.003MetGln: 1.003 ± 0.302
2.006MetArg: 2.006 ± 0.487
1.505MetSer: 1.505 ± 0.349
2.808MetThr: 2.808 ± 0.522
1.204MetVal: 1.204 ± 0.355
0.201MetTrp: 0.201 ± 0.148
0.602MetTyr: 0.602 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
3.912AsnAla: 3.912 ± 0.58
0.502AsnCys: 0.502 ± 0.251
2.708AsnAsp: 2.708 ± 0.677
2.508AsnGlu: 2.508 ± 0.66
1.204AsnPhe: 1.204 ± 0.398
4.213AsnGly: 4.213 ± 0.697
0.802AsnHis: 0.802 ± 0.363
3.31AsnIle: 3.31 ± 0.499
2.608AsnLys: 2.608 ± 0.578
3.511AsnLeu: 3.511 ± 0.604
1.204AsnMet: 1.204 ± 0.317
1.906AsnAsn: 1.906 ± 0.26
2.006AsnPro: 2.006 ± 0.476
1.404AsnGln: 1.404 ± 0.303
2.608AsnArg: 2.608 ± 0.583
2.207AsnSer: 2.207 ± 0.563
1.805AsnThr: 1.805 ± 0.354
2.808AsnVal: 2.808 ± 0.426
0.301AsnTrp: 0.301 ± 0.172
1.003AsnTyr: 1.003 ± 0.344
0.0AsnXaa: 0.0 ± 0.0
Pro
3.912ProAla: 3.912 ± 0.806
0.201ProCys: 0.201 ± 0.123
3.21ProAsp: 3.21 ± 0.574
3.611ProGlu: 3.611 ± 0.545
1.204ProPhe: 1.204 ± 0.486
2.307ProGly: 2.307 ± 0.443
1.103ProHis: 1.103 ± 0.421
2.106ProIle: 2.106 ± 0.514
2.608ProLys: 2.608 ± 0.576
4.313ProLeu: 4.313 ± 0.555
0.602ProMet: 0.602 ± 0.237
1.003ProAsn: 1.003 ± 0.39
1.505ProPro: 1.505 ± 0.375
1.605ProGln: 1.605 ± 0.37
2.508ProArg: 2.508 ± 0.67
2.307ProSer: 2.307 ± 0.463
1.404ProThr: 1.404 ± 0.33
4.915ProVal: 4.915 ± 0.816
0.702ProTrp: 0.702 ± 0.246
0.702ProTyr: 0.702 ± 0.247
0.0ProXaa: 0.0 ± 0.0
Gln
3.811GlnAla: 3.811 ± 1.148
0.201GlnCys: 0.201 ± 0.145
2.006GlnAsp: 2.006 ± 0.524
2.106GlnGlu: 2.106 ± 0.601
1.003GlnPhe: 1.003 ± 0.311
1.906GlnGly: 1.906 ± 0.309
0.702GlnHis: 0.702 ± 0.326
2.207GlnIle: 2.207 ± 0.583
2.608GlnLys: 2.608 ± 0.428
3.611GlnLeu: 3.611 ± 0.73
0.802GlnMet: 0.802 ± 0.239
1.103GlnAsn: 1.103 ± 0.354
1.605GlnPro: 1.605 ± 0.395
2.207GlnGln: 2.207 ± 0.627
4.413GlnArg: 4.413 ± 0.73
3.009GlnSer: 3.009 ± 0.596
2.307GlnThr: 2.307 ± 0.424
1.605GlnVal: 1.605 ± 0.408
0.602GlnTrp: 0.602 ± 0.253
0.502GlnTyr: 0.502 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
5.617ArgAla: 5.617 ± 0.793
1.003ArgCys: 1.003 ± 0.305
3.31ArgAsp: 3.31 ± 0.461
4.714ArgGlu: 4.714 ± 0.739
2.006ArgPhe: 2.006 ± 0.503
3.109ArgGly: 3.109 ± 0.854
1.304ArgHis: 1.304 ± 0.319
4.012ArgIle: 4.012 ± 0.655
3.811ArgLys: 3.811 ± 0.844
6.018ArgLeu: 6.018 ± 0.905
1.404ArgMet: 1.404 ± 0.392
2.608ArgAsn: 2.608 ± 0.575
2.106ArgPro: 2.106 ± 0.49
3.511ArgGln: 3.511 ± 0.627
4.915ArgArg: 4.915 ± 0.71
2.909ArgSer: 2.909 ± 0.488
2.909ArgThr: 2.909 ± 0.451
5.216ArgVal: 5.216 ± 0.847
0.903ArgTrp: 0.903 ± 0.307
3.41ArgTyr: 3.41 ± 0.602
0.0ArgXaa: 0.0 ± 0.0
Ser
6.419SerAla: 6.419 ± 1.034
0.502SerCys: 0.502 ± 0.189
3.511SerAsp: 3.511 ± 0.579
4.313SerGlu: 4.313 ± 0.492
2.006SerPhe: 2.006 ± 0.445
4.814SerGly: 4.814 ± 0.932
1.404SerHis: 1.404 ± 0.571
2.808SerIle: 2.808 ± 0.498
3.31SerLys: 3.31 ± 0.51
6.62SerLeu: 6.62 ± 1.046
1.805SerMet: 1.805 ± 0.409
2.608SerAsn: 2.608 ± 0.511
2.307SerPro: 2.307 ± 0.651
2.307SerGln: 2.307 ± 0.382
3.811SerArg: 3.811 ± 0.623
2.207SerSer: 2.207 ± 0.575
3.912SerThr: 3.912 ± 0.651
5.115SerVal: 5.115 ± 0.943
0.301SerTrp: 0.301 ± 0.16
1.103SerTyr: 1.103 ± 0.392
0.0SerXaa: 0.0 ± 0.0
Thr
6.52ThrAla: 6.52 ± 1.046
0.702ThrCys: 0.702 ± 0.347
3.21ThrAsp: 3.21 ± 0.533
3.31ThrGlu: 3.31 ± 0.591
2.508ThrPhe: 2.508 ± 0.51
6.82ThrGly: 6.82 ± 0.854
1.204ThrHis: 1.204 ± 0.368
3.31ThrIle: 3.31 ± 0.512
2.909ThrLys: 2.909 ± 0.665
6.921ThrLeu: 6.921 ± 0.844
2.207ThrMet: 2.207 ± 0.344
1.304ThrAsn: 1.304 ± 0.295
3.009ThrPro: 3.009 ± 0.406
1.805ThrGln: 1.805 ± 0.56
4.112ThrArg: 4.112 ± 0.612
3.912ThrSer: 3.912 ± 0.612
3.611ThrThr: 3.611 ± 0.842
4.514ThrVal: 4.514 ± 0.689
0.802ThrTrp: 0.802 ± 0.367
0.903ThrTyr: 0.903 ± 0.322
0.0ThrXaa: 0.0 ± 0.0
Val
6.52ValAla: 6.52 ± 0.888
1.103ValCys: 1.103 ± 0.439
4.213ValAsp: 4.213 ± 0.626
3.912ValGlu: 3.912 ± 0.709
2.207ValPhe: 2.207 ± 0.457
4.714ValGly: 4.714 ± 0.89
0.903ValHis: 0.903 ± 0.28
4.313ValIle: 4.313 ± 0.659
4.413ValLys: 4.413 ± 0.684
5.517ValLeu: 5.517 ± 0.936
1.805ValMet: 1.805 ± 0.362
2.106ValAsn: 2.106 ± 0.583
3.109ValPro: 3.109 ± 0.555
1.705ValGln: 1.705 ± 0.4
2.909ValArg: 2.909 ± 0.569
4.213ValSer: 4.213 ± 0.575
5.015ValThr: 5.015 ± 1.05
4.413ValVal: 4.413 ± 0.713
0.802ValTrp: 0.802 ± 0.357
1.605ValTyr: 1.605 ± 0.53
0.0ValXaa: 0.0 ± 0.0
Trp
1.103TrpAla: 1.103 ± 0.296
0.0TrpCys: 0.0 ± 0.0
0.702TrpAsp: 0.702 ± 0.29
0.903TrpGlu: 0.903 ± 0.229
0.201TrpPhe: 0.201 ± 0.131
0.502TrpGly: 0.502 ± 0.283
0.401TrpHis: 0.401 ± 0.233
0.502TrpIle: 0.502 ± 0.267
0.602TrpLys: 0.602 ± 0.254
1.605TrpLeu: 1.605 ± 0.454
0.602TrpMet: 0.602 ± 0.224
0.802TrpAsn: 0.802 ± 0.398
1.204TrpPro: 1.204 ± 0.306
0.502TrpGln: 0.502 ± 0.217
1.605TrpArg: 1.605 ± 0.48
0.802TrpSer: 0.802 ± 0.354
0.401TrpThr: 0.401 ± 0.193
0.502TrpVal: 0.502 ± 0.206
0.502TrpTrp: 0.502 ± 0.208
0.702TrpTyr: 0.702 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.708TyrAla: 2.708 ± 0.618
0.1TyrCys: 0.1 ± 0.104
1.003TyrAsp: 1.003 ± 0.312
2.608TyrGlu: 2.608 ± 0.481
1.103TyrPhe: 1.103 ± 0.356
2.006TyrGly: 2.006 ± 0.435
0.702TyrHis: 0.702 ± 0.274
2.207TyrIle: 2.207 ± 0.475
0.502TyrLys: 0.502 ± 0.293
2.207TyrLeu: 2.207 ± 0.539
0.702TyrMet: 0.702 ± 0.291
1.003TyrAsn: 1.003 ± 0.295
1.505TyrPro: 1.505 ± 0.399
1.505TyrGln: 1.505 ± 0.446
1.805TyrArg: 1.805 ± 0.515
1.404TyrSer: 1.404 ± 0.291
2.006TyrThr: 2.006 ± 0.526
1.805TyrVal: 1.805 ± 0.432
0.702TyrTrp: 0.702 ± 0.224
0.903TyrTyr: 0.903 ± 0.258
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (9971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski