Amino acid dipepetide frequency for Taiyuan leafhopper virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.214AlaAla: 5.214 ± 3.083
0.869AlaCys: 0.869 ± 0.305
2.39AlaAsp: 2.39 ± 0.555
4.562AlaGlu: 4.562 ± 0.641
2.607AlaPhe: 2.607 ± 0.448
4.345AlaGly: 4.345 ± 2.384
2.607AlaHis: 2.607 ± 0.93
2.607AlaIle: 2.607 ± 0.448
1.738AlaLys: 1.738 ± 0.248
7.386AlaLeu: 7.386 ± 1.567
2.39AlaMet: 2.39 ± 1.332
4.128AlaAsn: 4.128 ± 1.836
3.041AlaPro: 3.041 ± 0.635
2.172AlaGln: 2.172 ± 0.446
3.91AlaArg: 3.91 ± 3.089
5.214AlaSer: 5.214 ± 2.021
3.041AlaThr: 3.041 ± 0.329
4.779AlaVal: 4.779 ± 2.324
0.652AlaTrp: 0.652 ± 0.534
4.128AlaTyr: 4.128 ± 1.205
0.0AlaXaa: 0.0 ± 0.0
Cys
1.086CysAla: 1.086 ± 0.392
1.086CysCys: 1.086 ± 0.123
0.652CysAsp: 0.652 ± 0.189
0.0CysGlu: 0.0 ± 0.0
0.652CysPhe: 0.652 ± 0.248
0.0CysGly: 0.0 ± 0.0
0.652CysHis: 0.652 ± 0.33
1.738CysIle: 1.738 ± 0.501
0.652CysLys: 0.652 ± 0.6
1.738CysLeu: 1.738 ± 0.248
0.217CysMet: 0.217 ± 0.199
0.652CysAsn: 0.652 ± 0.248
0.869CysPro: 0.869 ± 0.299
1.303CysGln: 1.303 ± 0.628
1.738CysArg: 1.738 ± 0.507
1.521CysSer: 1.521 ± 0.324
1.738CysThr: 1.738 ± 0.501
0.869CysVal: 0.869 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.652CysTyr: 0.652 ± 0.218
0.0CysXaa: 0.0 ± 0.0
Asp
2.607AspAla: 2.607 ± 1.124
0.434AspCys: 0.434 ± 0.247
1.303AspAsp: 1.303 ± 0.66
3.476AspGlu: 3.476 ± 0.441
2.39AspPhe: 2.39 ± 0.621
1.521AspGly: 1.521 ± 0.324
1.303AspHis: 1.303 ± 0.449
5.214AspIle: 5.214 ± 0.655
2.607AspLys: 2.607 ± 0.52
5.431AspLeu: 5.431 ± 0.581
1.738AspMet: 1.738 ± 0.556
2.172AspAsn: 2.172 ± 1.186
2.39AspPro: 2.39 ± 0.807
2.824AspGln: 2.824 ± 0.691
2.607AspArg: 2.607 ± 0.448
1.955AspSer: 1.955 ± 0.56
4.345AspThr: 4.345 ± 0.778
3.041AspVal: 3.041 ± 0.809
0.869AspTrp: 0.869 ± 0.299
2.172AspTyr: 2.172 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
4.128GluAla: 4.128 ± 0.828
0.217GluCys: 0.217 ± 0.124
3.693GluAsp: 3.693 ± 0.806
4.562GluGlu: 4.562 ± 1.183
2.824GluPhe: 2.824 ± 0.816
3.693GluGly: 3.693 ± 0.775
1.086GluHis: 1.086 ± 0.223
2.607GluIle: 2.607 ± 0.535
2.824GluLys: 2.824 ± 1.081
5.648GluLeu: 5.648 ± 1.922
1.521GluMet: 1.521 ± 0.468
3.259GluAsn: 3.259 ± 0.896
1.521GluPro: 1.521 ± 0.719
2.607GluGln: 2.607 ± 0.789
2.824GluArg: 2.824 ± 1.324
4.128GluSer: 4.128 ± 0.77
1.955GluThr: 1.955 ± 0.765
3.041GluVal: 3.041 ± 0.701
0.434GluTrp: 0.434 ± 0.331
1.303GluTyr: 1.303 ± 0.696
0.0GluXaa: 0.0 ± 0.0
Phe
2.39PheAla: 2.39 ± 0.153
1.521PheCys: 1.521 ± 0.116
2.607PheAsp: 2.607 ± 0.475
2.824PheGlu: 2.824 ± 0.533
1.738PhePhe: 1.738 ± 0.61
1.303PheGly: 1.303 ± 0.498
1.955PheHis: 1.955 ± 0.672
1.086PheIle: 1.086 ± 0.42
1.521PheLys: 1.521 ± 0.464
3.476PheLeu: 3.476 ± 0.754
1.086PheMet: 1.086 ± 0.123
1.738PheAsn: 1.738 ± 0.203
2.172PhePro: 2.172 ± 0.581
1.521PheGln: 1.521 ± 0.324
3.476PheArg: 3.476 ± 1.128
3.041PheSer: 3.041 ± 0.554
1.955PheThr: 1.955 ± 0.636
1.521PheVal: 1.521 ± 0.71
0.652PheTrp: 0.652 ± 0.189
0.869PheTyr: 0.869 ± 0.494
0.0PheXaa: 0.0 ± 0.0
Gly
3.476GlyAla: 3.476 ± 0.787
0.869GlyCys: 0.869 ± 0.127
3.041GlyAsp: 3.041 ± 0.669
3.476GlyGlu: 3.476 ± 0.246
1.521GlyPhe: 1.521 ± 0.532
3.91GlyGly: 3.91 ± 1.472
1.955GlyHis: 1.955 ± 0.238
4.345GlyIle: 4.345 ± 1.131
2.39GlyLys: 2.39 ± 0.153
3.476GlyLeu: 3.476 ± 0.55
1.955GlyMet: 1.955 ± 0.217
2.172GlyAsn: 2.172 ± 1.468
1.738GlyPro: 1.738 ± 0.91
1.738GlyGln: 1.738 ± 0.248
1.955GlyArg: 1.955 ± 0.563
3.259GlySer: 3.259 ± 0.669
3.041GlyThr: 3.041 ± 0.547
3.693GlyVal: 3.693 ± 0.742
1.086GlyTrp: 1.086 ± 0.123
1.303GlyTyr: 1.303 ± 0.128
0.0GlyXaa: 0.0 ± 0.0
His
3.476HisAla: 3.476 ± 0.926
0.217HisCys: 0.217 ± 0.124
0.869HisAsp: 0.869 ± 0.494
1.521HisGlu: 1.521 ± 0.464
1.738HisPhe: 1.738 ± 0.749
1.303HisGly: 1.303 ± 0.378
2.172HisHis: 2.172 ± 0.65
3.041HisIle: 3.041 ± 1.064
1.086HisLys: 1.086 ± 0.318
2.824HisLeu: 2.824 ± 0.413
0.434HisMet: 0.434 ± 0.247
1.955HisAsn: 1.955 ± 0.303
1.955HisPro: 1.955 ± 0.605
1.955HisGln: 1.955 ± 0.567
0.869HisArg: 0.869 ± 0.501
1.521HisSer: 1.521 ± 0.459
1.086HisThr: 1.086 ± 0.478
2.172HisVal: 2.172 ± 0.389
0.869HisTrp: 0.869 ± 0.299
1.086HisTyr: 1.086 ± 0.472
0.0HisXaa: 0.0 ± 0.0
Ile
4.562IleAla: 4.562 ± 1.916
1.303IleCys: 1.303 ± 0.128
3.693IleAsp: 3.693 ± 1.062
2.607IleGlu: 2.607 ± 0.448
2.172IlePhe: 2.172 ± 1.236
3.476IleGly: 3.476 ± 0.285
1.521IleHis: 1.521 ± 0.459
3.041IleIle: 3.041 ± 0.329
4.345IleLys: 4.345 ± 0.801
5.648IleLeu: 5.648 ± 0.955
2.39IleMet: 2.39 ± 0.902
5.214IleAsn: 5.214 ± 1.153
4.997IlePro: 4.997 ± 1.426
1.955IleGln: 1.955 ± 0.5
5.431IleArg: 5.431 ± 0.615
4.345IleSer: 4.345 ± 1.33
3.476IleThr: 3.476 ± 0.622
3.693IleVal: 3.693 ± 0.724
0.652IleTrp: 0.652 ± 0.189
1.303IleTyr: 1.303 ± 0.912
0.0IleXaa: 0.0 ± 0.0
Lys
3.041LysAla: 3.041 ± 0.317
0.217LysCys: 0.217 ± 0.199
2.607LysAsp: 2.607 ± 0.52
1.086LysGlu: 1.086 ± 0.435
1.521LysPhe: 1.521 ± 0.642
1.521LysGly: 1.521 ± 0.618
0.869LysHis: 0.869 ± 0.282
2.607LysIle: 2.607 ± 0.605
1.738LysLys: 1.738 ± 0.203
5.866LysLeu: 5.866 ± 1.138
1.303LysMet: 1.303 ± 0.496
2.39LysAsn: 2.39 ± 0.911
0.869LysPro: 0.869 ± 0.524
1.521LysGln: 1.521 ± 0.642
1.521LysArg: 1.521 ± 0.464
3.041LysSer: 3.041 ± 0.996
2.39LysThr: 2.39 ± 0.487
3.041LysVal: 3.041 ± 0.781
0.652LysTrp: 0.652 ± 0.371
1.955LysTyr: 1.955 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
8.69LeuAla: 8.69 ± 1.055
3.041LeuCys: 3.041 ± 0.833
4.562LeuAsp: 4.562 ± 1.16
5.866LeuGlu: 5.866 ± 1.014
3.476LeuPhe: 3.476 ± 1.167
5.866LeuGly: 5.866 ± 0.711
2.607LeuHis: 2.607 ± 0.507
6.735LeuIle: 6.735 ± 0.959
4.562LeuLys: 4.562 ± 1.685
10.645LeuLeu: 10.645 ± 1.864
2.39LeuMet: 2.39 ± 0.344
6.3LeuAsn: 6.3 ± 0.494
6.3LeuPro: 6.3 ± 0.494
6.735LeuGln: 6.735 ± 0.542
4.779LeuArg: 4.779 ± 1.444
7.386LeuSer: 7.386 ± 0.425
7.169LeuThr: 7.169 ± 0.795
6.517LeuVal: 6.517 ± 0.566
1.086LeuTrp: 1.086 ± 0.123
4.345LeuTyr: 4.345 ± 0.873
0.0LeuXaa: 0.0 ± 0.0
Met
0.869MetAla: 0.869 ± 0.834
0.652MetCys: 0.652 ± 0.189
0.652MetAsp: 0.652 ± 0.371
1.521MetGlu: 1.521 ± 0.642
0.869MetPhe: 0.869 ± 0.299
2.39MetGly: 2.39 ± 0.555
0.869MetHis: 0.869 ± 0.282
1.303MetIle: 1.303 ± 0.454
2.172MetLys: 2.172 ± 0.392
3.693MetLeu: 3.693 ± 0.81
0.869MetMet: 0.869 ± 0.903
1.303MetAsn: 1.303 ± 0.742
0.869MetPro: 0.869 ± 0.299
1.303MetGln: 1.303 ± 0.378
1.955MetArg: 1.955 ± 0.567
1.955MetSer: 1.955 ± 0.457
2.39MetThr: 2.39 ± 0.456
1.303MetVal: 1.303 ± 0.847
1.086MetTrp: 1.086 ± 0.472
0.217MetTyr: 0.217 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
3.041AsnAla: 3.041 ± 2.099
0.434AsnCys: 0.434 ± 0.15
2.824AsnAsp: 2.824 ± 0.533
2.607AsnGlu: 2.607 ± 0.744
1.738AsnPhe: 1.738 ± 0.725
1.303AsnGly: 1.303 ± 0.461
0.869AsnHis: 0.869 ± 0.524
3.91AsnIle: 3.91 ± 1.165
2.824AsnLys: 2.824 ± 0.432
5.214AsnLeu: 5.214 ± 1.325
1.303AsnMet: 1.303 ± 0.378
2.39AsnAsn: 2.39 ± 0.372
3.693AsnPro: 3.693 ± 0.742
2.39AsnGln: 2.39 ± 0.434
4.128AsnArg: 4.128 ± 0.479
3.259AsnSer: 3.259 ± 1.089
2.39AsnThr: 2.39 ± 0.372
3.041AsnVal: 3.041 ± 1.306
1.303AsnTrp: 1.303 ± 0.496
3.041AsnTyr: 3.041 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
3.259ProAla: 3.259 ± 1.513
1.086ProCys: 1.086 ± 0.392
3.476ProAsp: 3.476 ± 0.506
2.607ProGlu: 2.607 ± 0.423
2.39ProPhe: 2.39 ± 0.38
2.39ProGly: 2.39 ± 0.153
1.955ProHis: 1.955 ± 0.377
3.476ProIle: 3.476 ± 0.984
0.652ProLys: 0.652 ± 0.475
6.517ProLeu: 6.517 ± 0.51
0.869ProMet: 0.869 ± 0.337
1.955ProAsn: 1.955 ± 0.217
2.607ProPro: 2.607 ± 0.994
2.607ProGln: 2.607 ± 0.723
1.738ProArg: 1.738 ± 0.407
2.824ProSer: 2.824 ± 0.814
3.259ProThr: 3.259 ± 0.485
2.39ProVal: 2.39 ± 0.807
1.738ProTrp: 1.738 ± 0.415
1.521ProTyr: 1.521 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
2.824GlnAla: 2.824 ± 1.463
0.434GlnCys: 0.434 ± 0.15
1.521GlnAsp: 1.521 ± 0.719
3.91GlnGlu: 3.91 ± 0.96
2.172GlnPhe: 2.172 ± 0.783
1.955GlnGly: 1.955 ± 0.672
1.955GlnHis: 1.955 ± 0.605
4.779GlnIle: 4.779 ± 1.228
1.521GlnLys: 1.521 ± 0.793
6.083GlnLeu: 6.083 ± 1.109
0.869GlnMet: 0.869 ± 0.282
2.607GlnAsn: 2.607 ± 0.723
1.521GlnPro: 1.521 ± 1.078
1.955GlnGln: 1.955 ± 0.377
1.303GlnArg: 1.303 ± 0.128
1.738GlnSer: 1.738 ± 0.203
3.91GlnThr: 3.91 ± 0.833
3.91GlnVal: 3.91 ± 0.63
0.869GlnTrp: 0.869 ± 0.127
0.869GlnTyr: 0.869 ± 0.305
0.0GlnXaa: 0.0 ± 0.0
Arg
3.041ArgAla: 3.041 ± 1.085
1.086ArgCys: 1.086 ± 0.318
4.562ArgAsp: 4.562 ± 0.47
2.39ArgGlu: 2.39 ± 0.844
1.955ArgPhe: 1.955 ± 0.315
3.041ArgGly: 3.041 ± 1.056
1.955ArgHis: 1.955 ± 0.563
3.259ArgIle: 3.259 ± 0.578
1.086ArgLys: 1.086 ± 0.392
5.866ArgLeu: 5.866 ± 0.65
0.652ArgMet: 0.652 ± 0.33
2.824ArgAsn: 2.824 ± 0.472
2.607ArgPro: 2.607 ± 0.996
3.476ArgGln: 3.476 ± 0.937
4.128ArgArg: 4.128 ± 1.091
3.91ArgSer: 3.91 ± 1.343
3.259ArgThr: 3.259 ± 1.044
7.821ArgVal: 7.821 ± 1.424
0.217ArgTrp: 0.217 ± 0.124
1.521ArgTyr: 1.521 ± 0.324
0.0ArgXaa: 0.0 ± 0.0
Ser
4.779SerAla: 4.779 ± 0.794
0.869SerCys: 0.869 ± 0.337
3.041SerAsp: 3.041 ± 0.889
2.39SerGlu: 2.39 ± 1.63
1.738SerPhe: 1.738 ± 0.564
3.91SerGly: 3.91 ± 1.299
1.521SerHis: 1.521 ± 0.324
4.997SerIle: 4.997 ± 1.05
3.259SerLys: 3.259 ± 0.884
8.255SerLeu: 8.255 ± 2.669
1.955SerMet: 1.955 ± 0.989
3.041SerAsn: 3.041 ± 0.554
2.172SerPro: 2.172 ± 0.434
2.607SerGln: 2.607 ± 0.535
4.779SerArg: 4.779 ± 0.964
4.997SerSer: 4.997 ± 1.406
3.476SerThr: 3.476 ± 1.063
4.128SerVal: 4.128 ± 1.612
0.434SerTrp: 0.434 ± 0.15
1.738SerTyr: 1.738 ± 1.092
0.0SerXaa: 0.0 ± 0.0
Thr
4.345ThrAla: 4.345 ± 0.492
1.086ThrCys: 1.086 ± 0.223
2.172ThrAsp: 2.172 ± 0.629
3.041ThrGlu: 3.041 ± 0.554
1.738ThrPhe: 1.738 ± 0.203
2.824ThrGly: 2.824 ± 0.712
2.607ThrHis: 2.607 ± 0.535
3.476ThrIle: 3.476 ± 1.382
1.521ThrLys: 1.521 ± 0.883
7.604ThrLeu: 7.604 ± 1.838
2.39ThrMet: 2.39 ± 0.555
3.041ThrAsn: 3.041 ± 0.828
3.041ThrPro: 3.041 ± 0.648
3.693ThrGln: 3.693 ± 0.796
4.562ThrArg: 4.562 ± 1.326
3.476ThrSer: 3.476 ± 1.383
4.128ThrThr: 4.128 ± 1.611
3.041ThrVal: 3.041 ± 0.387
1.955ThrTrp: 1.955 ± 0.315
1.303ThrTyr: 1.303 ± 0.404
0.0ThrXaa: 0.0 ± 0.0
Val
4.345ValAla: 4.345 ± 1.664
1.521ValCys: 1.521 ± 0.34
3.693ValAsp: 3.693 ± 0.516
2.824ValGlu: 2.824 ± 0.585
2.172ValPhe: 2.172 ± 0.287
3.259ValGly: 3.259 ± 0.411
1.955ValHis: 1.955 ± 0.567
4.128ValIle: 4.128 ± 0.755
1.955ValLys: 1.955 ± 0.654
7.169ValLeu: 7.169 ± 1.791
2.172ValMet: 2.172 ± 0.392
3.476ValAsn: 3.476 ± 0.741
3.693ValPro: 3.693 ± 1.064
2.172ValGln: 2.172 ± 0.206
3.693ValArg: 3.693 ± 1.291
3.91ValSer: 3.91 ± 0.642
5.866ValThr: 5.866 ± 1.165
4.562ValVal: 4.562 ± 1.041
1.303ValTrp: 1.303 ± 0.361
1.955ValTyr: 1.955 ± 0.684
0.0ValXaa: 0.0 ± 0.0
Trp
0.869TrpAla: 0.869 ± 0.503
0.217TrpCys: 0.217 ± 0.124
1.521TrpAsp: 1.521 ± 0.34
0.869TrpGlu: 0.869 ± 0.305
0.652TrpPhe: 0.652 ± 0.248
1.521TrpGly: 1.521 ± 0.714
0.0TrpHis: 0.0 ± 0.0
1.303TrpIle: 1.303 ± 0.128
0.217TrpLys: 0.217 ± 0.124
1.521TrpLeu: 1.521 ± 0.459
0.869TrpMet: 0.869 ± 0.299
0.652TrpAsn: 0.652 ± 0.371
0.652TrpPro: 0.652 ± 0.248
0.869TrpGln: 0.869 ± 0.127
0.652TrpArg: 0.652 ± 0.33
1.303TrpSer: 1.303 ± 0.66
0.434TrpThr: 0.434 ± 0.398
1.086TrpVal: 1.086 ± 0.394
0.0TrpTrp: 0.0 ± 0.0
0.652TrpTyr: 0.652 ± 0.371
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.738TyrAla: 1.738 ± 0.622
0.652TyrCys: 0.652 ± 0.33
1.738TyrAsp: 1.738 ± 0.793
1.738TyrGlu: 1.738 ± 0.501
2.39TyrPhe: 2.39 ± 0.38
0.869TyrGly: 0.869 ± 0.492
1.738TyrHis: 1.738 ± 0.608
2.172TyrIle: 2.172 ± 0.65
1.086TyrLys: 1.086 ± 0.223
4.997TyrLeu: 4.997 ± 1.873
0.652TyrMet: 0.652 ± 0.193
0.434TyrAsn: 0.434 ± 0.246
2.607TyrPro: 2.607 ± 0.554
1.303TyrGln: 1.303 ± 0.509
2.607TyrArg: 2.607 ± 0.186
1.303TyrSer: 1.303 ± 0.628
1.738TyrThr: 1.738 ± 0.248
2.172TyrVal: 2.172 ± 0.55
0.0TyrTrp: 0.0 ± 0.0
1.086TyrTyr: 1.086 ± 0.547
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4604 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski