Amino acid dipepetide frequency for Estero Real virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.886AlaAla: 1.886 ± 0.694
1.543AlaCys: 1.543 ± 0.07
1.886AlaAsp: 1.886 ± 0.181
2.915AlaGlu: 2.915 ± 0.225
2.058AlaPhe: 2.058 ± 0.27
3.43AlaGly: 3.43 ± 1.133
0.343AlaHis: 0.343 ± 0.128
3.601AlaIle: 3.601 ± 0.277
3.258AlaLys: 3.258 ± 0.461
6.688AlaLeu: 6.688 ± 1.119
0.857AlaMet: 0.857 ± 0.758
1.715AlaAsn: 1.715 ± 0.397
1.372AlaPro: 1.372 ± 0.687
1.2AlaGln: 1.2 ± 0.483
1.886AlaArg: 1.886 ± 0.411
4.116AlaSer: 4.116 ± 1.234
2.915AlaThr: 2.915 ± 0.189
3.43AlaVal: 3.43 ± 0.419
0.686AlaTrp: 0.686 ± 0.126
1.715AlaTyr: 1.715 ± 0.756
0.0AlaXaa: 0.0 ± 0.0
Cys
1.715CysAla: 1.715 ± 0.38
1.372CysCys: 1.372 ± 0.277
1.372CysAsp: 1.372 ± 0.687
1.372CysGlu: 1.372 ± 0.514
1.715CysPhe: 1.715 ± 0.397
1.2CysGly: 1.2 ± 0.78
0.857CysHis: 0.857 ± 0.19
1.543CysIle: 1.543 ± 0.595
1.543CysLys: 1.543 ± 0.259
2.229CysLeu: 2.229 ± 0.473
0.171CysMet: 0.171 ± 0.094
1.543CysAsn: 1.543 ± 0.997
1.543CysPro: 1.543 ± 0.714
0.857CysGln: 0.857 ± 0.469
2.744CysArg: 2.744 ± 0.44
2.401CysSer: 2.401 ± 0.443
2.058CysThr: 2.058 ± 1.045
1.372CysVal: 1.372 ± 0.277
0.514CysTrp: 0.514 ± 0.086
0.857CysTyr: 0.857 ± 0.746
0.0CysXaa: 0.0 ± 0.0
Asp
2.915AspAla: 2.915 ± 1.107
2.401AspCys: 2.401 ± 0.39
3.087AspAsp: 3.087 ± 0.542
3.944AspGlu: 3.944 ± 1.295
1.715AspPhe: 1.715 ± 0.1
2.915AspGly: 2.915 ± 0.77
0.857AspHis: 0.857 ± 0.198
3.43AspIle: 3.43 ± 0.777
3.087AspLys: 3.087 ± 0.896
6.688AspLeu: 6.688 ± 0.158
1.029AspMet: 1.029 ± 0.873
2.915AspAsn: 2.915 ± 0.578
1.543AspPro: 1.543 ± 0.304
2.058AspGln: 2.058 ± 0.27
2.401AspArg: 2.401 ± 0.899
4.459AspSer: 4.459 ± 0.631
3.087AspThr: 3.087 ± 0.463
3.258AspVal: 3.258 ± 0.737
1.2AspTrp: 1.2 ± 0.819
2.572AspTyr: 2.572 ± 0.57
0.0AspXaa: 0.0 ± 0.0
Glu
3.773GluAla: 3.773 ± 0.378
1.543GluCys: 1.543 ± 0.328
4.116GluAsp: 4.116 ± 0.057
4.459GluGlu: 4.459 ± 0.712
3.43GluPhe: 3.43 ± 0.686
3.43GluGly: 3.43 ± 0.729
2.058GluHis: 2.058 ± 0.617
4.63GluIle: 4.63 ± 0.913
3.258GluLys: 3.258 ± 0.635
7.546GluLeu: 7.546 ± 0.63
1.886GluMet: 1.886 ± 0.421
3.087GluAsn: 3.087 ± 0.542
2.915GluPro: 2.915 ± 0.266
3.258GluGln: 3.258 ± 0.633
2.915GluArg: 2.915 ± 0.578
3.944GluSer: 3.944 ± 0.693
4.287GluThr: 4.287 ± 0.338
5.831GluVal: 5.831 ± 0.378
0.686GluTrp: 0.686 ± 0.126
1.372GluTyr: 1.372 ± 0.253
0.0GluXaa: 0.0 ± 0.0
Phe
1.372PheAla: 1.372 ± 0.132
1.372PheCys: 1.372 ± 0.132
2.572PheAsp: 2.572 ± 0.672
2.058PheGlu: 2.058 ± 0.27
2.229PhePhe: 2.229 ± 0.075
2.572PheGly: 2.572 ± 0.36
0.857PheHis: 0.857 ± 0.459
2.915PheIle: 2.915 ± 0.79
3.601PheLys: 3.601 ± 0.277
3.944PheLeu: 3.944 ± 0.735
1.029PheMet: 1.029 ± 0.385
1.886PheAsn: 1.886 ± 0.649
1.2PhePro: 1.2 ± 0.452
1.2PheGln: 1.2 ± 0.218
1.543PheArg: 1.543 ± 0.823
4.116PheSer: 4.116 ± 1.389
3.258PheThr: 3.258 ± 0.35
2.572PheVal: 2.572 ± 0.36
0.171PheTrp: 0.171 ± 0.208
2.229PheTyr: 2.229 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
2.058GlyAla: 2.058 ± 0.886
2.058GlyCys: 2.058 ± 2.095
3.087GlyAsp: 3.087 ± 1.526
3.258GlyGlu: 3.258 ± 0.633
2.401GlyPhe: 2.401 ± 0.63
2.915GlyGly: 2.915 ± 1.285
0.857GlyHis: 0.857 ± 0.205
4.287GlyIle: 4.287 ± 0.695
3.43GlyLys: 3.43 ± 0.478
5.488GlyLeu: 5.488 ± 1.864
0.686GlyMet: 0.686 ± 0.257
1.372GlyAsn: 1.372 ± 0.253
1.715GlyPro: 1.715 ± 0.798
2.401GlyGln: 2.401 ± 0.436
3.258GlyArg: 3.258 ± 0.145
3.944GlySer: 3.944 ± 0.325
3.258GlyThr: 3.258 ± 0.524
3.43GlyVal: 3.43 ± 0.403
0.857GlyTrp: 0.857 ± 0.568
1.2GlyTyr: 1.2 ± 0.218
0.0GlyXaa: 0.0 ± 0.0
His
1.886HisAla: 1.886 ± 0.576
1.029HisCys: 1.029 ± 0.385
0.514HisAsp: 0.514 ± 0.332
1.372HisGlu: 1.372 ± 1.261
1.2HisPhe: 1.2 ± 0.384
1.543HisGly: 1.543 ± 0.997
0.514HisHis: 0.514 ± 0.086
0.857HisIle: 0.857 ± 0.198
1.2HisLys: 1.2 ± 0.384
1.886HisLeu: 1.886 ± 0.317
1.029HisMet: 1.029 ± 0.504
1.029HisAsn: 1.029 ± 0.563
0.857HisPro: 0.857 ± 0.19
0.686HisGln: 0.686 ± 0.257
1.372HisArg: 1.372 ± 0.253
1.372HisSer: 1.372 ± 0.253
1.2HisThr: 1.2 ± 0.322
1.2HisVal: 1.2 ± 0.587
0.171HisTrp: 0.171 ± 0.094
0.686HisTyr: 0.686 ± 0.126
0.0HisXaa: 0.0 ± 0.0
Ile
1.372IleAla: 1.372 ± 0.228
2.058IleCys: 2.058 ± 0.029
2.915IleAsp: 2.915 ± 0.53
3.944IleGlu: 3.944 ± 0.141
2.572IlePhe: 2.572 ± 0.442
2.744IleGly: 2.744 ± 0.505
1.715IleHis: 1.715 ± 0.1
4.116IleIle: 4.116 ± 1.105
5.831IleLys: 5.831 ± 1.489
6.86IleLeu: 6.86 ± 1.872
1.543IleMet: 1.543 ± 0.259
3.601IleAsn: 3.601 ± 0.277
2.572IlePro: 2.572 ± 0.442
1.886IleGln: 1.886 ± 0.771
2.572IleArg: 2.572 ± 1.127
6.86IleSer: 6.86 ± 1.055
3.258IleThr: 3.258 ± 0.145
4.116IleVal: 4.116 ± 0.617
0.686IleTrp: 0.686 ± 0.126
1.2IleTyr: 1.2 ± 0.195
0.0IleXaa: 0.0 ± 0.0
Lys
5.488LysAla: 5.488 ± 1.289
1.372LysCys: 1.372 ± 1.078
6.174LysAsp: 6.174 ± 1.218
7.546LysGlu: 7.546 ± 1.313
3.087LysPhe: 3.087 ± 0.609
3.43LysGly: 3.43 ± 1.565
1.2LysHis: 1.2 ± 0.195
4.116LysIle: 4.116 ± 0.828
6.002LysLys: 6.002 ± 0.693
7.889LysLeu: 7.889 ± 1.21
1.2LysMet: 1.2 ± 0.25
3.087LysAsn: 3.087 ± 0.883
3.601LysPro: 3.601 ± 1.415
3.773LysGln: 3.773 ± 0.361
2.572LysArg: 2.572 ± 0.887
4.459LysSer: 4.459 ± 0.315
4.287LysThr: 4.287 ± 0.921
3.944LysVal: 3.944 ± 0.745
1.029LysTrp: 1.029 ± 0.584
0.686LysTyr: 0.686 ± 0.251
0.0LysXaa: 0.0 ± 0.0
Leu
5.488LeuAla: 5.488 ± 0.81
2.744LeuCys: 2.744 ± 0.645
4.973LeuAsp: 4.973 ± 1.584
8.403LeuGlu: 8.403 ± 0.552
4.459LeuPhe: 4.459 ± 0.466
3.944LeuGly: 3.944 ± 0.618
2.229LeuHis: 2.229 ± 0.529
5.145LeuIle: 5.145 ± 1.386
9.432LeuLys: 9.432 ± 1.443
8.918LeuLeu: 8.918 ± 0.506
2.058LeuMet: 2.058 ± 0.886
5.488LeuAsn: 5.488 ± 1.855
3.258LeuPro: 3.258 ± 1.098
3.944LeuGln: 3.944 ± 0.735
4.63LeuArg: 4.63 ± 0.816
10.29LeuSer: 10.29 ± 2.836
7.203LeuThr: 7.203 ± 0.85
5.488LeuVal: 5.488 ± 0.328
0.686LeuTrp: 0.686 ± 0.257
3.43LeuTyr: 3.43 ± 0.632
0.0LeuXaa: 0.0 ± 0.0
Met
0.686MetAla: 0.686 ± 0.257
0.686MetCys: 0.686 ± 0.257
1.715MetAsp: 1.715 ± 0.41
1.029MetGlu: 1.029 ± 0.673
1.2MetPhe: 1.2 ± 0.195
0.857MetGly: 0.857 ± 0.469
0.514MetHis: 0.514 ± 0.586
2.401MetIle: 2.401 ± 0.644
1.543MetLys: 1.543 ± 1.449
2.401MetLeu: 2.401 ± 0.646
1.029MetMet: 1.029 ± 0.172
0.857MetAsn: 0.857 ± 0.294
0.857MetPro: 0.857 ± 0.205
0.857MetGln: 0.857 ± 0.205
0.857MetArg: 0.857 ± 0.205
2.915MetSer: 2.915 ± 0.7
1.543MetThr: 1.543 ± 0.64
1.543MetVal: 1.543 ± 0.259
0.0MetTrp: 0.0 ± 0.0
0.514MetTyr: 0.514 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
0.857AsnAla: 0.857 ± 0.568
1.543AsnCys: 1.543 ± 0.714
2.744AsnAsp: 2.744 ± 0.64
1.543AsnGlu: 1.543 ± 0.442
2.572AsnPhe: 2.572 ± 0.672
2.572AsnGly: 2.572 ± 0.723
1.886AsnHis: 1.886 ± 0.317
3.258AsnIle: 3.258 ± 0.145
3.087AsnLys: 3.087 ± 0.812
4.973AsnLeu: 4.973 ± 0.428
2.229AsnMet: 2.229 ± 0.312
1.886AsnAsn: 1.886 ± 0.117
2.572AsnPro: 2.572 ± 0.261
0.857AsnGln: 0.857 ± 0.19
2.572AsnArg: 2.572 ± 0.723
3.773AsnSer: 3.773 ± 1.508
2.058AsnThr: 2.058 ± 0.27
2.229AsnVal: 2.229 ± 0.785
1.029AsnTrp: 1.029 ± 0.385
2.058AsnTyr: 2.058 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
1.372ProAla: 1.372 ± 0.132
0.343ProCys: 0.343 ± 0.264
2.744ProAsp: 2.744 ± 0.127
2.744ProGlu: 2.744 ± 0.951
1.372ProPhe: 1.372 ± 0.791
2.229ProGly: 2.229 ± 1.324
0.686ProHis: 0.686 ± 0.493
2.058ProIle: 2.058 ± 0.566
3.944ProLys: 3.944 ± 0.871
2.058ProLeu: 2.058 ± 0.519
0.514ProMet: 0.514 ± 0.24
1.029ProAsn: 1.029 ± 0.293
1.2ProPro: 1.2 ± 0.452
0.343ProGln: 0.343 ± 0.128
2.401ProArg: 2.401 ± 0.157
3.601ProSer: 3.601 ± 1.397
3.087ProThr: 3.087 ± 0.693
1.029ProVal: 1.029 ± 0.909
0.686ProTrp: 0.686 ± 0.257
1.372ProTyr: 1.372 ± 0.513
0.0ProXaa: 0.0 ± 0.0
Gln
1.715GlnAla: 1.715 ± 0.553
1.2GlnCys: 1.2 ± 0.195
2.229GlnAsp: 2.229 ± 0.952
2.401GlnGlu: 2.401 ± 0.454
1.372GlnPhe: 1.372 ± 0.475
1.886GlnGly: 1.886 ± 0.98
0.686GlnHis: 0.686 ± 0.126
1.886GlnIle: 1.886 ± 0.754
2.229GlnLys: 2.229 ± 0.315
3.601GlnLeu: 3.601 ± 0.703
0.686GlnMet: 0.686 ± 0.251
1.372GlnAsn: 1.372 ± 0.132
0.514GlnPro: 0.514 ± 0.281
2.058GlnGln: 2.058 ± 0.847
1.886GlnArg: 1.886 ± 0.181
2.058GlnSer: 2.058 ± 1.009
2.229GlnThr: 2.229 ± 0.452
3.258GlnVal: 3.258 ± 0.204
0.0GlnTrp: 0.0 ± 0.0
1.029GlnTyr: 1.029 ± 0.357
0.0GlnXaa: 0.0 ± 0.0
Arg
1.886ArgAla: 1.886 ± 0.411
0.686ArgCys: 0.686 ± 0.126
3.601ArgAsp: 3.601 ± 0.864
2.401ArgGlu: 2.401 ± 0.532
1.886ArgPhe: 1.886 ± 0.411
2.058ArgGly: 2.058 ± 0.886
1.886ArgHis: 1.886 ± 0.359
3.087ArgIle: 3.087 ± 0.511
2.744ArgLys: 2.744 ± 0.285
6.002ArgLeu: 6.002 ± 1.317
1.029ArgMet: 1.029 ± 0.172
3.258ArgAsn: 3.258 ± 1.456
0.857ArgPro: 0.857 ± 0.459
0.857ArgGln: 0.857 ± 0.205
1.2ArgArg: 1.2 ± 0.819
3.601ArgSer: 3.601 ± 0.477
4.116ArgThr: 4.116 ± 0.69
2.744ArgVal: 2.744 ± 0.457
0.686ArgTrp: 0.686 ± 0.257
2.744ArgTyr: 2.744 ± 0.127
0.0ArgXaa: 0.0 ± 0.0
Ser
4.116SerAla: 4.116 ± 0.634
3.43SerCys: 3.43 ± 1.024
3.601SerAsp: 3.601 ± 0.703
7.203SerGlu: 7.203 ± 1.646
3.773SerPhe: 3.773 ± 0.666
6.345SerGly: 6.345 ± 0.825
1.2SerHis: 1.2 ± 0.195
4.287SerIle: 4.287 ± 1.198
6.517SerLys: 6.517 ± 1.281
7.717SerLeu: 7.717 ± 1.119
2.744SerMet: 2.744 ± 0.951
4.116SerAsn: 4.116 ± 0.685
2.744SerPro: 2.744 ± 0.285
2.058SerGln: 2.058 ± 0.566
3.258SerArg: 3.258 ± 0.615
10.461SerSer: 10.461 ± 2.669
4.63SerThr: 4.63 ± 1.109
5.659SerVal: 5.659 ± 0.505
1.029SerTrp: 1.029 ± 0.584
2.744SerTyr: 2.744 ± 0.227
0.0SerXaa: 0.0 ± 0.0
Thr
3.087ThrAla: 3.087 ± 0.463
1.2ThrCys: 1.2 ± 0.322
2.058ThrAsp: 2.058 ± 0.029
5.316ThrGlu: 5.316 ± 1.113
1.543ThrPhe: 1.543 ± 0.328
2.744ThrGly: 2.744 ± 0.645
1.2ThrHis: 1.2 ± 0.195
4.459ThrIle: 4.459 ± 0.431
4.116ThrLys: 4.116 ± 0.436
4.459ThrLeu: 4.459 ± 0.712
1.543ThrMet: 1.543 ± 0.303
1.886ThrAsn: 1.886 ± 0.468
2.572ThrPro: 2.572 ± 0.392
2.058ThrGln: 2.058 ± 0.586
2.915ThrArg: 2.915 ± 0.225
7.374ThrSer: 7.374 ± 0.865
2.572ThrThr: 2.572 ± 0.616
6.002ThrVal: 6.002 ± 2.047
0.857ThrTrp: 0.857 ± 0.205
2.744ThrTyr: 2.744 ± 0.285
0.0ThrXaa: 0.0 ± 0.0
Val
3.43ValAla: 3.43 ± 0.76
1.543ValCys: 1.543 ± 0.889
3.773ValAsp: 3.773 ± 1.231
4.459ValGlu: 4.459 ± 0.47
1.886ValPhe: 1.886 ± 0.181
2.401ValGly: 2.401 ± 0.436
0.857ValHis: 0.857 ± 0.459
4.63ValIle: 4.63 ± 0.444
6.517ValLys: 6.517 ± 0.699
7.546ValLeu: 7.546 ± 1.018
1.372ValMet: 1.372 ± 0.294
3.087ValAsn: 3.087 ± 0.684
2.058ValPro: 2.058 ± 0.975
2.058ValGln: 2.058 ± 0.77
3.43ValArg: 3.43 ± 0.459
4.63ValSer: 4.63 ± 0.62
2.915ValThr: 2.915 ± 0.196
4.459ValVal: 4.459 ± 0.523
0.857ValTrp: 0.857 ± 0.568
1.715ValTyr: 1.715 ± 0.1
0.0ValXaa: 0.0 ± 0.0
Trp
0.686TrpAla: 0.686 ± 0.539
0.343TrpCys: 0.343 ± 0.128
0.686TrpAsp: 0.686 ± 0.493
0.857TrpGlu: 0.857 ± 0.294
0.514TrpPhe: 0.514 ± 0.337
1.029TrpGly: 1.029 ± 0.504
0.343TrpHis: 0.343 ± 0.128
0.343TrpIle: 0.343 ± 0.188
1.543TrpLys: 1.543 ± 0.596
1.2TrpLeu: 1.2 ± 0.171
0.514TrpMet: 0.514 ± 0.369
0.686TrpAsn: 0.686 ± 0.126
0.171TrpPro: 0.171 ± 0.208
0.343TrpGln: 0.343 ± 0.415
0.686TrpArg: 0.686 ± 0.257
0.686TrpSer: 0.686 ± 0.126
0.857TrpThr: 0.857 ± 0.459
0.857TrpVal: 0.857 ± 0.758
0.171TrpTrp: 0.171 ± 0.094
0.343TrpTyr: 0.343 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.715TyrAla: 1.715 ± 0.927
0.514TyrCys: 0.514 ± 0.281
1.715TyrAsp: 1.715 ± 0.683
1.2TyrGlu: 1.2 ± 0.218
1.715TyrPhe: 1.715 ± 0.918
1.715TyrGly: 1.715 ± 0.274
0.857TyrHis: 0.857 ± 0.205
1.372TyrIle: 1.372 ± 0.475
2.229TyrLys: 2.229 ± 1.324
4.116TyrLeu: 4.116 ± 0.348
0.686TyrMet: 0.686 ± 0.257
2.401TyrAsn: 2.401 ± 0.532
0.686TyrPro: 0.686 ± 0.257
1.543TyrGln: 1.543 ± 0.303
2.229TyrArg: 2.229 ± 0.452
2.572TyrSer: 2.572 ± 0.442
1.886TyrThr: 1.886 ± 0.709
1.2TyrVal: 1.2 ± 0.218
0.857TyrTrp: 0.857 ± 0.294
0.857TyrTyr: 0.857 ± 0.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski