Amino acid dipepetide frequency for Xingshan nematode virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.425AlaAla: 2.425 ± 0.899
1.617AlaCys: 1.617 ± 0.787
6.198AlaAsp: 6.198 ± 1.059
1.617AlaGlu: 1.617 ± 0.371
3.234AlaPhe: 3.234 ± 0.746
2.425AlaGly: 2.425 ± 0.483
1.617AlaHis: 1.617 ± 0.599
4.312AlaIle: 4.312 ± 1.217
2.695AlaLys: 2.695 ± 0.665
5.659AlaLeu: 5.659 ± 0.99
1.078AlaMet: 1.078 ± 0.221
0.808AlaAsn: 0.808 ± 0.725
1.078AlaPro: 1.078 ± 0.53
2.425AlaGln: 2.425 ± 0.899
2.695AlaArg: 2.695 ± 0.446
3.773AlaSer: 3.773 ± 2.015
4.042AlaThr: 4.042 ± 0.646
4.581AlaVal: 4.581 ± 0.757
0.269AlaTrp: 0.269 ± 0.649
2.964AlaTyr: 2.964 ± 0.1
0.0AlaXaa: 0.0 ± 0.0
Cys
1.078CysAla: 1.078 ± 0.524
0.539CysCys: 0.539 ± 0.262
0.808CysAsp: 0.808 ± 0.237
1.078CysGlu: 1.078 ± 0.524
1.886CysPhe: 1.886 ± 0.439
2.964CysGly: 2.964 ± 1.442
0.0CysHis: 0.0 ± 0.0
0.808CysIle: 0.808 ± 0.393
0.808CysLys: 0.808 ± 0.393
2.156CysLeu: 2.156 ± 0.604
0.808CysMet: 0.808 ± 0.237
1.347CysAsn: 1.347 ± 0.539
1.886CysPro: 1.886 ± 0.484
0.0CysGln: 0.0 ± 0.0
1.347CysArg: 1.347 ± 0.539
0.808CysSer: 0.808 ± 0.237
0.539CysThr: 0.539 ± 0.262
2.156CysVal: 2.156 ± 0.443
0.0CysTrp: 0.0 ± 0.0
0.808CysTyr: 0.808 ± 0.393
0.0CysXaa: 0.0 ± 0.0
Asp
3.773AspAla: 3.773 ± 0.899
1.886AspCys: 1.886 ± 0.439
5.659AspAsp: 5.659 ± 1.451
2.964AspGlu: 2.964 ± 1.076
4.042AspPhe: 4.042 ± 1.185
4.581AspGly: 4.581 ± 1.011
1.617AspHis: 1.617 ± 0.697
4.581AspIle: 4.581 ± 1.011
4.581AspLys: 4.581 ± 1.757
6.198AspLeu: 6.198 ± 1.059
1.886AspMet: 1.886 ± 0.918
2.964AspAsn: 2.964 ± 1.442
1.886AspPro: 1.886 ± 0.568
1.617AspGln: 1.617 ± 0.371
6.198AspArg: 6.198 ± 1.69
5.389AspSer: 5.389 ± 1.341
2.695AspThr: 2.695 ± 0.553
8.084AspVal: 8.084 ± 1.145
0.808AspTrp: 0.808 ± 0.237
3.234AspTyr: 3.234 ± 1.875
0.0AspXaa: 0.0 ± 0.0
Glu
2.156GluAla: 2.156 ± 0.758
0.808GluCys: 0.808 ± 0.393
3.503GluAsp: 3.503 ± 0.699
2.425GluGlu: 2.425 ± 0.728
4.312GluPhe: 4.312 ± 1.627
3.773GluGly: 3.773 ± 0.37
1.078GluHis: 1.078 ± 0.53
2.964GluIle: 2.964 ± 0.547
4.312GluLys: 4.312 ± 0.183
4.581GluLeu: 4.581 ± 1.331
0.808GluMet: 0.808 ± 0.541
3.773GluAsn: 3.773 ± 0.752
0.808GluPro: 0.808 ± 0.393
1.078GluGln: 1.078 ± 0.221
2.695GluArg: 2.695 ± 0.191
3.503GluSer: 3.503 ± 0.905
2.425GluThr: 2.425 ± 0.312
2.964GluVal: 2.964 ± 0.547
0.0GluTrp: 0.0 ± 0.0
2.425GluTyr: 2.425 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
3.503PheAla: 3.503 ± 0.682
2.964PheCys: 2.964 ± 0.981
4.042PheAsp: 4.042 ± 1.185
3.234PheGlu: 3.234 ± 0.948
3.234PhePhe: 3.234 ± 1.395
4.312PheGly: 4.312 ± 2.499
0.269PheHis: 0.269 ± 0.131
3.234PheIle: 3.234 ± 0.659
3.503PheLys: 3.503 ± 0.778
6.198PheLeu: 6.198 ± 2.92
1.347PheMet: 1.347 ± 1.11
2.964PheAsn: 2.964 ± 0.1
2.156PhePro: 2.156 ± 0.443
2.156PheGln: 2.156 ± 0.772
2.695PheArg: 2.695 ± 1.562
8.623PheSer: 8.623 ± 0.727
3.503PheThr: 3.503 ± 0.682
7.276PheVal: 7.276 ± 2.899
0.808PheTrp: 0.808 ± 1.089
2.695PheTyr: 2.695 ± 2.267
0.0PheXaa: 0.0 ± 0.0
Gly
2.156GlyAla: 2.156 ± 0.772
1.617GlyCys: 1.617 ± 0.371
3.503GlyAsp: 3.503 ± 1.238
2.156GlyGlu: 2.156 ± 0.758
3.503GlyPhe: 3.503 ± 1.265
2.695GlyGly: 2.695 ± 0.446
0.539GlyHis: 0.539 ± 0.262
2.964GlyIle: 2.964 ± 0.825
3.503GlyLys: 3.503 ± 1.704
3.234GlyLeu: 3.234 ± 0.41
1.347GlyMet: 1.347 ± 0.656
1.617GlyAsn: 1.617 ± 0.787
1.078GlyPro: 1.078 ± 0.53
0.539GlyGln: 0.539 ± 0.793
2.425GlyArg: 2.425 ± 0.857
2.695GlySer: 2.695 ± 0.553
0.808GlyThr: 0.808 ± 0.675
3.773GlyVal: 3.773 ± 0.755
0.539GlyTrp: 0.539 ± 0.312
2.425GlyTyr: 2.425 ± 0.312
0.0GlyXaa: 0.0 ± 0.0
His
1.617HisAla: 1.617 ± 0.373
0.539HisCys: 0.539 ± 0.262
1.078HisAsp: 1.078 ± 0.524
0.808HisGlu: 0.808 ± 0.237
1.617HisPhe: 1.617 ± 0.697
0.0HisGly: 0.0 ± 0.0
0.539HisHis: 0.539 ± 0.312
0.539HisIle: 0.539 ± 0.833
1.886HisLys: 1.886 ± 1.235
1.347HisLeu: 1.347 ± 0.539
0.269HisMet: 0.269 ± 0.131
0.269HisAsn: 0.269 ± 0.131
0.0HisPro: 0.0 ± 0.0
1.347HisGln: 1.347 ± 1.922
1.078HisArg: 1.078 ± 0.524
1.347HisSer: 1.347 ± 0.276
0.808HisThr: 0.808 ± 0.393
1.886HisVal: 1.886 ± 0.918
0.0HisTrp: 0.0 ± 0.0
2.156HisTyr: 2.156 ± 0.439
0.0HisXaa: 0.0 ± 0.0
Ile
2.425IleAla: 2.425 ± 0.857
1.078IleCys: 1.078 ± 0.625
4.042IleAsp: 4.042 ± 0.829
3.234IleGlu: 3.234 ± 1.109
4.042IlePhe: 4.042 ± 0.962
2.156IleGly: 2.156 ± 0.443
1.078IleHis: 1.078 ± 0.221
2.695IleIle: 2.695 ± 1.189
3.234IleLys: 3.234 ± 0.743
7.815IleLeu: 7.815 ± 2.142
1.886IleMet: 1.886 ± 0.671
1.617IleAsn: 1.617 ± 0.474
2.964IlePro: 2.964 ± 1.006
2.156IleGln: 2.156 ± 0.316
1.347IleArg: 1.347 ± 0.539
4.581IleSer: 4.581 ± 0.471
3.234IleThr: 3.234 ± 0.659
4.85IleVal: 4.85 ± 1.159
0.539IleTrp: 0.539 ± 0.262
2.156IleTyr: 2.156 ± 0.758
0.0IleXaa: 0.0 ± 0.0
Lys
3.234LysAla: 3.234 ± 1.109
0.0LysCys: 0.0 ± 0.0
3.503LysAsp: 3.503 ± 1.238
2.964LysGlu: 2.964 ± 0.981
5.659LysPhe: 5.659 ± 1.007
1.347LysGly: 1.347 ± 0.656
0.808LysHis: 0.808 ± 0.393
4.581LysIle: 4.581 ± 0.408
2.695LysLys: 2.695 ± 0.854
4.581LysLeu: 4.581 ± 1.802
2.156LysMet: 2.156 ± 1.049
4.85LysAsn: 4.85 ± 1.455
2.425LysPro: 2.425 ± 2.279
1.347LysGln: 1.347 ± 0.656
4.042LysArg: 4.042 ± 0.125
2.964LysSer: 2.964 ± 1.525
2.964LysThr: 2.964 ± 0.981
5.928LysVal: 5.928 ± 2.885
0.269LysTrp: 0.269 ± 0.131
3.773LysTyr: 3.773 ± 0.64
0.0LysXaa: 0.0 ± 0.0
Leu
6.198LeuAla: 6.198 ± 1.575
1.617LeuCys: 1.617 ± 0.371
5.659LeuAsp: 5.659 ± 1.318
5.12LeuGlu: 5.12 ± 0.716
5.389LeuPhe: 5.389 ± 2.865
2.964LeuGly: 2.964 ± 0.842
1.617LeuHis: 1.617 ± 0.373
5.928LeuIle: 5.928 ± 1.073
4.581LeuLys: 4.581 ± 0.669
13.743LeuLeu: 13.743 ± 7.251
3.234LeuMet: 3.234 ± 1.198
5.12LeuAsn: 5.12 ± 0.345
2.964LeuPro: 2.964 ± 2.196
2.425LeuGln: 2.425 ± 0.899
7.276LeuArg: 7.276 ± 0.103
7.815LeuSer: 7.815 ± 1.216
5.389LeuThr: 5.389 ± 1.852
7.276LeuVal: 7.276 ± 1.779
0.808LeuTrp: 0.808 ± 0.237
3.234LeuTyr: 3.234 ± 0.743
0.0LeuXaa: 0.0 ± 0.0
Met
2.425MetAla: 2.425 ± 1.624
0.808MetCys: 0.808 ± 0.393
1.886MetAsp: 1.886 ± 0.918
1.617MetGlu: 1.617 ± 1.082
1.347MetPhe: 1.347 ± 0.55
0.808MetGly: 0.808 ± 0.237
0.808MetHis: 0.808 ± 0.541
1.347MetIle: 1.347 ± 0.276
1.886MetLys: 1.886 ± 0.918
2.425MetLeu: 2.425 ± 1.072
0.539MetMet: 0.539 ± 0.262
0.269MetAsn: 0.269 ± 0.131
0.539MetPro: 0.539 ± 0.262
0.0MetGln: 0.0 ± 0.0
1.886MetArg: 1.886 ± 0.32
2.156MetSer: 2.156 ± 0.443
1.886MetThr: 1.886 ± 0.918
1.886MetVal: 1.886 ± 0.671
0.269MetTrp: 0.269 ± 0.131
1.078MetTyr: 1.078 ± 0.524
0.0MetXaa: 0.0 ± 0.0
Asn
3.234AsnAla: 3.234 ± 0.743
0.539AsnCys: 0.539 ± 0.262
3.503AsnAsp: 3.503 ± 0.853
2.964AsnGlu: 2.964 ± 0.842
3.234AsnPhe: 3.234 ± 0.664
1.347AsnGly: 1.347 ± 0.458
1.347AsnHis: 1.347 ± 1.035
1.886AsnIle: 1.886 ± 1.021
2.695AsnLys: 2.695 ± 0.854
2.695AsnLeu: 2.695 ± 0.665
1.078AsnMet: 1.078 ± 0.53
1.617AsnAsn: 1.617 ± 0.371
0.808AsnPro: 0.808 ± 0.393
0.808AsnGln: 0.808 ± 0.237
2.695AsnArg: 2.695 ± 0.553
1.617AsnSer: 1.617 ± 0.373
1.886AsnThr: 1.886 ± 0.32
4.85AsnVal: 4.85 ± 1.422
0.539AsnTrp: 0.539 ± 0.262
1.617AsnTyr: 1.617 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
2.156ProAla: 2.156 ± 1.656
0.0ProCys: 0.0 ± 0.0
2.156ProAsp: 2.156 ± 1.059
1.617ProGlu: 1.617 ± 0.371
2.425ProPhe: 2.425 ± 1.159
1.347ProGly: 1.347 ± 0.276
0.0ProHis: 0.0 ± 0.0
1.347ProIle: 1.347 ± 0.55
1.078ProLys: 1.078 ± 0.53
4.042ProLeu: 4.042 ± 1.007
0.539ProMet: 0.539 ± 0.262
0.808ProAsn: 0.808 ± 1.226
0.539ProPro: 0.539 ± 0.793
0.539ProGln: 0.539 ± 0.312
1.886ProArg: 1.886 ± 0.671
2.695ProSer: 2.695 ± 1.648
1.078ProThr: 1.078 ± 0.53
2.964ProVal: 2.964 ± 0.641
0.269ProTrp: 0.269 ± 0.131
2.156ProTyr: 2.156 ± 0.604
0.0ProXaa: 0.0 ± 0.0
Gln
2.156GlnAla: 2.156 ± 1.477
0.539GlnCys: 0.539 ± 0.262
1.347GlnAsp: 1.347 ± 1.035
1.078GlnGlu: 1.078 ± 0.53
1.617GlnPhe: 1.617 ± 0.937
0.808GlnGly: 0.808 ± 0.393
0.539GlnHis: 0.539 ± 0.262
1.078GlnIle: 1.078 ± 0.625
1.078GlnLys: 1.078 ± 0.524
1.617GlnLeu: 1.617 ± 0.599
0.808GlnMet: 0.808 ± 0.393
0.0GlnAsn: 0.0 ± 0.0
0.269GlnPro: 0.269 ± 0.131
0.539GlnGln: 0.539 ± 0.583
1.886GlnArg: 1.886 ± 0.484
1.886GlnSer: 1.886 ± 1.021
1.617GlnThr: 1.617 ± 0.599
2.964GlnVal: 2.964 ± 2.143
0.0GlnTrp: 0.0 ± 0.0
1.617GlnTyr: 1.617 ± 0.787
0.0GlnXaa: 0.0 ± 0.0
Arg
2.695ArgAla: 2.695 ± 0.191
1.617ArgCys: 1.617 ± 0.371
4.85ArgAsp: 4.85 ± 1.403
3.773ArgGlu: 3.773 ± 1.367
2.156ArgPhe: 2.156 ± 0.772
1.347ArgGly: 1.347 ± 0.458
2.156ArgHis: 2.156 ± 0.953
3.503ArgIle: 3.503 ± 0.853
2.695ArgLys: 2.695 ± 0.553
5.659ArgLeu: 5.659 ± 1.137
2.156ArgMet: 2.156 ± 0.604
3.503ArgAsn: 3.503 ± 0.288
1.078ArgPro: 1.078 ± 0.958
0.808ArgGln: 0.808 ± 0.541
3.234ArgArg: 3.234 ± 1.109
4.042ArgSer: 4.042 ± 0.873
3.773ArgThr: 3.773 ± 1.341
5.12ArgVal: 5.12 ± 1.03
0.0ArgTrp: 0.0 ± 0.0
3.773ArgTyr: 3.773 ± 0.879
0.0ArgXaa: 0.0 ± 0.0
Ser
4.581SerAla: 4.581 ± 1.92
1.617SerCys: 1.617 ± 0.787
6.467SerAsp: 6.467 ± 0.777
3.503SerGlu: 3.503 ± 2.237
5.389SerPhe: 5.389 ± 4.318
2.156SerGly: 2.156 ± 0.439
1.347SerHis: 1.347 ± 0.276
5.389SerIle: 5.389 ± 0.384
5.659SerLys: 5.659 ± 1.007
7.006SerLeu: 7.006 ± 0.779
1.347SerMet: 1.347 ± 0.308
4.042SerAsn: 4.042 ± 0.125
1.886SerPro: 1.886 ± 0.32
1.347SerGln: 1.347 ± 0.276
4.312SerArg: 4.312 ± 1.207
5.659SerSer: 5.659 ± 2.737
4.312SerThr: 4.312 ± 0.491
6.737SerVal: 6.737 ± 0.696
0.269SerTrp: 0.269 ± 0.131
3.773SerTyr: 3.773 ± 0.879
0.0SerXaa: 0.0 ± 0.0
Thr
3.773ThrAla: 3.773 ± 0.64
0.539ThrCys: 0.539 ± 0.262
2.425ThrAsp: 2.425 ± 0.728
3.773ThrGlu: 3.773 ± 0.64
6.467ThrPhe: 6.467 ± 1.774
2.425ThrGly: 2.425 ± 0.857
0.808ThrHis: 0.808 ± 0.725
3.234ThrIle: 3.234 ± 0.133
2.964ThrLys: 2.964 ± 1.442
6.198ThrLeu: 6.198 ± 0.879
0.539ThrMet: 0.539 ± 0.583
1.347ThrAsn: 1.347 ± 0.276
1.078ThrPro: 1.078 ± 0.958
0.808ThrGln: 0.808 ± 0.393
1.886ThrArg: 1.886 ± 0.671
2.964ThrSer: 2.964 ± 0.641
1.617ThrThr: 1.617 ± 0.697
3.503ThrVal: 3.503 ± 0.866
0.269ThrTrp: 0.269 ± 0.131
3.234ThrTyr: 3.234 ± 0.664
0.0ThrXaa: 0.0 ± 0.0
Val
3.773ValAla: 3.773 ± 0.64
2.156ValCys: 2.156 ± 1.25
9.701ValAsp: 9.701 ± 2.185
5.12ValGlu: 5.12 ± 2.017
5.389ValPhe: 5.389 ± 1.852
3.503ValGly: 3.503 ± 1.265
1.347ValHis: 1.347 ± 0.276
2.425ValIle: 2.425 ± 0.728
5.928ValLys: 5.928 ± 2.151
8.084ValLeu: 8.084 ± 4.085
2.425ValMet: 2.425 ± 0.857
2.695ValAsn: 2.695 ± 0.67
4.85ValPro: 4.85 ± 1.204
2.156ValGln: 2.156 ± 0.604
4.581ValArg: 4.581 ± 0.408
7.815ValSer: 7.815 ± 2.044
4.85ValThr: 4.85 ± 1.422
7.545ValVal: 7.545 ± 2.195
0.808ValTrp: 0.808 ± 0.393
4.312ValTyr: 4.312 ± 0.915
0.0ValXaa: 0.0 ± 0.0
Trp
0.808TrpAla: 0.808 ± 0.237
0.269TrpCys: 0.269 ± 0.131
0.269TrpAsp: 0.269 ± 0.131
0.0TrpGlu: 0.0 ± 0.0
0.539TrpPhe: 0.539 ± 0.262
0.808TrpGly: 0.808 ± 0.541
0.0TrpHis: 0.0 ± 0.0
0.269TrpIle: 0.269 ± 0.131
1.078TrpLys: 1.078 ± 0.221
1.347TrpLeu: 1.347 ± 0.458
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.269TrpGln: 0.269 ± 0.416
0.0TrpArg: 0.0 ± 0.0
0.808TrpSer: 0.808 ± 0.237
0.0TrpThr: 0.0 ± 0.0
0.269TrpVal: 0.269 ± 0.131
0.0TrpTrp: 0.0 ± 0.0
0.269TrpTyr: 0.269 ± 0.416
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.347TyrAla: 1.347 ± 0.656
1.078TyrCys: 1.078 ± 0.221
4.042TyrAsp: 4.042 ± 0.498
1.347TyrGlu: 1.347 ± 0.276
3.503TyrPhe: 3.503 ± 0.853
1.347TyrGly: 1.347 ± 0.55
1.617TyrHis: 1.617 ± 0.697
4.042TyrIle: 4.042 ± 1.023
3.234TyrLys: 3.234 ± 0.659
3.773TyrLeu: 3.773 ± 1.235
1.347TyrMet: 1.347 ± 0.55
1.347TyrAsn: 1.347 ± 0.828
1.347TyrPro: 1.347 ± 0.276
1.078TyrGln: 1.078 ± 0.625
3.773TyrArg: 3.773 ± 0.879
5.659TyrSer: 5.659 ± 0.59
2.425TyrThr: 2.425 ± 0.711
4.85TyrVal: 4.85 ± 0.804
0.539TyrTrp: 0.539 ± 0.312
2.425TyrTyr: 2.425 ± 0.483
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3712 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski