Amino acid dipepetide frequency for Xinzhou nematode virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.522AlaAla: 2.522 ± 0.173
1.801AlaCys: 1.801 ± 0.357
3.963AlaAsp: 3.963 ± 0.35
4.323AlaGlu: 4.323 ± 1.074
1.801AlaPhe: 1.801 ± 0.357
6.124AlaGly: 6.124 ± 0.343
0.36AlaHis: 0.36 ± 0.364
3.963AlaIle: 3.963 ± 0.894
2.882AlaLys: 2.882 ± 0.897
5.403AlaLeu: 5.403 ± 1.614
2.522AlaMet: 2.522 ± 0.717
3.963AlaAsn: 3.963 ± 1.825
5.403AlaPro: 5.403 ± 0.561
2.161AlaGln: 2.161 ± 0.537
4.323AlaArg: 4.323 ± 0.014
2.882AlaSer: 2.882 ± 1.278
6.124AlaThr: 6.124 ± 1.832
5.764AlaVal: 5.764 ± 2.338
1.801AlaTrp: 1.801 ± 0.357
1.081AlaTyr: 1.081 ± 0.003
0.0AlaXaa: 0.0 ± 0.0
Cys
1.801CysAla: 1.801 ± 0.357
0.36CysCys: 0.36 ± 0.18
1.081CysAsp: 1.081 ± 0.54
0.72CysGlu: 0.72 ± 0.36
0.72CysPhe: 0.72 ± 0.36
0.72CysGly: 0.72 ± 0.36
0.36CysHis: 0.36 ± 0.18
0.36CysIle: 0.36 ± 0.364
0.36CysLys: 0.36 ± 0.18
1.801CysLeu: 1.801 ± 0.357
0.36CysMet: 0.36 ± 0.18
0.36CysAsn: 0.36 ± 0.18
1.081CysPro: 1.081 ± 0.547
1.441CysGln: 1.441 ± 0.367
1.441CysArg: 1.441 ± 0.721
0.72CysSer: 0.72 ± 0.36
0.36CysThr: 0.36 ± 0.18
1.801CysVal: 1.801 ± 0.357
0.36CysTrp: 0.36 ± 0.18
1.801CysTyr: 1.801 ± 0.357
0.0CysXaa: 0.0 ± 0.0
Asp
2.882AspAla: 2.882 ± 0.897
0.72AspCys: 0.72 ± 0.36
3.242AspAsp: 3.242 ± 0.534
3.242AspGlu: 3.242 ± 1.077
3.242AspPhe: 3.242 ± 1.077
1.801AspGly: 1.801 ± 0.187
0.72AspHis: 0.72 ± 0.184
3.963AspIle: 3.963 ± 0.738
1.801AspLys: 1.801 ± 0.357
6.124AspLeu: 6.124 ± 0.745
1.081AspMet: 1.081 ± 0.54
3.602AspAsn: 3.602 ± 0.17
3.602AspPro: 3.602 ± 1.462
1.801AspGln: 1.801 ± 0.187
1.801AspArg: 1.801 ± 0.901
3.963AspSer: 3.963 ± 0.35
1.441AspThr: 1.441 ± 0.721
4.683AspVal: 4.683 ± 0.921
0.72AspTrp: 0.72 ± 0.36
3.242AspTyr: 3.242 ± 1.077
0.0AspXaa: 0.0 ± 0.0
Glu
3.963GluAla: 3.963 ± 1.438
0.36GluCys: 0.36 ± 0.364
2.161GluAsp: 2.161 ± 0.007
3.963GluGlu: 3.963 ± 0.894
2.161GluPhe: 2.161 ± 0.537
2.161GluGly: 2.161 ± 1.081
1.801GluHis: 1.801 ± 0.357
3.602GluIle: 3.602 ± 2.005
2.161GluLys: 2.161 ± 1.081
3.963GluLeu: 3.963 ± 0.35
1.801GluMet: 1.801 ± 0.901
2.522GluAsn: 2.522 ± 0.173
1.801GluPro: 1.801 ± 0.187
1.441GluGln: 1.441 ± 0.367
3.242GluArg: 3.242 ± 1.077
5.043GluSer: 5.043 ± 2.522
5.043GluThr: 5.043 ± 1.285
6.844GluVal: 6.844 ± 0.384
0.72GluTrp: 0.72 ± 0.184
4.323GluTyr: 4.323 ± 1.074
0.0GluXaa: 0.0 ± 0.0
Phe
3.242PheAla: 3.242 ± 0.534
0.72PheCys: 0.72 ± 0.184
3.602PheAsp: 3.602 ± 0.714
2.522PheGlu: 2.522 ± 0.914
2.161PhePhe: 2.161 ± 0.007
2.882PheGly: 2.882 ± 0.897
2.161PheHis: 2.161 ± 0.551
4.323PheIle: 4.323 ± 1.618
2.522PheLys: 2.522 ± 0.717
6.124PheLeu: 6.124 ± 0.887
1.441PheMet: 1.441 ± 0.177
1.441PheAsn: 1.441 ± 0.721
0.72PhePro: 0.72 ± 0.184
1.801PheGln: 1.801 ± 0.731
3.602PheArg: 3.602 ± 0.17
2.882PheSer: 2.882 ± 1.278
0.72PheThr: 0.72 ± 0.36
3.963PheVal: 3.963 ± 0.894
1.081PheTrp: 1.081 ± 0.003
2.522PheTyr: 2.522 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
2.882GlyAla: 2.882 ± 0.19
1.081GlyCys: 1.081 ± 0.003
3.242GlyAsp: 3.242 ± 0.554
2.882GlyGlu: 2.882 ± 0.19
2.161GlyPhe: 2.161 ± 0.537
1.441GlyGly: 1.441 ± 0.367
0.72GlyHis: 0.72 ± 0.36
2.161GlyIle: 2.161 ± 0.007
3.602GlyLys: 3.602 ± 0.17
2.161GlyLeu: 2.161 ± 1.094
2.161GlyMet: 2.161 ± 0.007
2.161GlyAsn: 2.161 ± 0.007
2.882GlyPro: 2.882 ± 0.353
1.801GlyGln: 1.801 ± 0.187
1.441GlyArg: 1.441 ± 0.177
2.522GlySer: 2.522 ± 0.914
3.963GlyThr: 3.963 ± 0.894
3.602GlyVal: 3.602 ± 0.17
0.72GlyTrp: 0.72 ± 0.184
3.602GlyTyr: 3.602 ± 1.462
0.0GlyXaa: 0.0 ± 0.0
His
0.72HisAla: 0.72 ± 0.184
1.081HisCys: 1.081 ± 0.003
0.36HisAsp: 0.36 ± 0.18
0.36HisGlu: 0.36 ± 0.18
2.161HisPhe: 2.161 ± 0.537
1.801HisGly: 1.801 ± 0.357
0.72HisHis: 0.72 ± 0.36
1.441HisIle: 1.441 ± 0.721
0.0HisLys: 0.0 ± 0.0
2.522HisLeu: 2.522 ± 0.371
1.081HisMet: 1.081 ± 0.003
1.081HisAsn: 1.081 ± 0.003
0.36HisPro: 0.36 ± 0.364
1.801HisGln: 1.801 ± 0.187
1.081HisArg: 1.081 ± 0.003
2.161HisSer: 2.161 ± 0.007
1.441HisThr: 1.441 ± 0.177
2.522HisVal: 2.522 ± 0.173
0.0HisTrp: 0.0 ± 0.0
2.161HisTyr: 2.161 ± 0.537
0.0HisXaa: 0.0 ± 0.0
Ile
6.124IleAla: 6.124 ± 1.431
2.882IleCys: 2.882 ± 1.441
3.242IleAsp: 3.242 ± 0.534
2.161IleGlu: 2.161 ± 0.537
1.441IlePhe: 1.441 ± 0.721
3.242IleGly: 3.242 ± 2.186
2.161IleHis: 2.161 ± 0.007
2.161IleIle: 2.161 ± 0.537
2.161IleLys: 2.161 ± 1.081
4.323IleLeu: 4.323 ± 0.558
2.522IleMet: 2.522 ± 0.173
2.882IleAsn: 2.882 ± 1.441
3.963IlePro: 3.963 ± 0.738
2.522IleGln: 2.522 ± 0.914
5.043IleArg: 5.043 ± 0.347
6.484IleSer: 6.484 ± 1.652
2.882IleThr: 2.882 ± 0.19
6.844IleVal: 6.844 ± 1.472
1.441IleTrp: 1.441 ± 0.177
0.72IleTyr: 0.72 ± 0.184
0.0IleXaa: 0.0 ± 0.0
Lys
2.161LysAla: 2.161 ± 1.081
1.441LysCys: 1.441 ± 0.177
1.441LysAsp: 1.441 ± 0.177
2.882LysGlu: 2.882 ± 0.897
1.801LysPhe: 1.801 ± 0.901
1.801LysGly: 1.801 ± 0.901
1.801LysHis: 1.801 ± 0.901
5.764LysIle: 5.764 ± 0.163
1.801LysLys: 1.801 ± 0.357
3.602LysLeu: 3.602 ± 0.374
0.72LysMet: 0.72 ± 0.36
1.081LysAsn: 1.081 ± 0.54
1.081LysPro: 1.081 ± 0.547
1.801LysGln: 1.801 ± 0.357
1.801LysArg: 1.801 ± 0.901
1.801LysSer: 1.801 ± 0.187
2.882LysThr: 2.882 ± 0.353
2.161LysVal: 2.161 ± 1.094
0.72LysTrp: 0.72 ± 0.184
2.161LysTyr: 2.161 ± 1.081
0.0LysXaa: 0.0 ± 0.0
Leu
6.844LeuAla: 6.844 ± 0.384
0.36LeuCys: 0.36 ± 0.18
3.602LeuAsp: 3.602 ± 1.801
5.043LeuGlu: 5.043 ± 0.197
5.764LeuPhe: 5.764 ± 0.163
4.683LeuGly: 4.683 ± 0.921
1.441LeuHis: 1.441 ± 0.177
3.602LeuIle: 3.602 ± 1.462
3.963LeuLys: 3.963 ± 0.194
7.925LeuLeu: 7.925 ± 1.475
2.882LeuMet: 2.882 ± 0.353
3.242LeuAsn: 3.242 ± 0.554
4.323LeuPro: 4.323 ± 2.189
3.963LeuGln: 3.963 ± 0.194
3.242LeuArg: 3.242 ± 1.621
6.844LeuSer: 6.844 ± 1.472
5.403LeuThr: 5.403 ± 0.017
3.963LeuVal: 3.963 ± 1.438
0.72LeuTrp: 0.72 ± 0.184
2.882LeuTyr: 2.882 ± 0.19
0.0LeuXaa: 0.0 ± 0.0
Met
3.602MetAla: 3.602 ± 0.374
0.0MetCys: 0.0 ± 0.0
1.801MetAsp: 1.801 ± 0.901
2.522MetGlu: 2.522 ± 0.173
0.0MetPhe: 0.0 ± 0.0
1.081MetGly: 1.081 ± 0.003
0.36MetHis: 0.36 ± 0.18
0.36MetIle: 0.36 ± 0.18
2.161MetLys: 2.161 ± 0.537
2.161MetLeu: 2.161 ± 1.081
0.72MetMet: 0.72 ± 0.727
0.72MetAsn: 0.72 ± 0.184
2.161MetPro: 2.161 ± 0.007
3.602MetGln: 3.602 ± 0.714
0.72MetArg: 0.72 ± 0.36
2.522MetSer: 2.522 ± 0.173
2.882MetThr: 2.882 ± 0.19
1.441MetVal: 1.441 ± 0.911
0.36MetTrp: 0.36 ± 0.18
0.36MetTyr: 0.36 ± 0.364
0.0MetXaa: 0.0 ± 0.0
Asn
2.522AsnAla: 2.522 ± 0.371
1.081AsnCys: 1.081 ± 0.003
2.161AsnAsp: 2.161 ± 0.007
2.882AsnGlu: 2.882 ± 0.353
2.882AsnPhe: 2.882 ± 0.353
1.801AsnGly: 1.801 ± 1.818
1.441AsnHis: 1.441 ± 0.177
4.683AsnIle: 4.683 ± 1.254
2.522AsnLys: 2.522 ± 1.261
2.882AsnLeu: 2.882 ± 0.353
2.161AsnMet: 2.161 ± 0.537
1.801AsnAsn: 1.801 ± 0.357
2.882AsnPro: 2.882 ± 0.353
2.161AsnGln: 2.161 ± 0.007
2.522AsnArg: 2.522 ± 0.371
2.522AsnSer: 2.522 ± 0.173
4.323AsnThr: 4.323 ± 2.189
2.161AsnVal: 2.161 ± 1.094
0.36AsnTrp: 0.36 ± 0.18
1.081AsnTyr: 1.081 ± 0.003
0.0AsnXaa: 0.0 ± 0.0
Pro
2.161ProAla: 2.161 ± 0.551
0.0ProCys: 0.0 ± 0.0
2.161ProAsp: 2.161 ± 0.551
1.801ProGlu: 1.801 ± 0.357
3.963ProPhe: 3.963 ± 0.35
1.441ProGly: 1.441 ± 0.177
2.161ProHis: 2.161 ± 1.094
3.963ProIle: 3.963 ± 0.738
1.081ProLys: 1.081 ± 0.003
3.602ProLeu: 3.602 ± 0.374
1.441ProMet: 1.441 ± 0.225
1.801ProAsn: 1.801 ± 0.731
4.323ProPro: 4.323 ± 3.277
0.72ProGln: 0.72 ± 0.727
1.441ProArg: 1.441 ± 0.367
5.043ProSer: 5.043 ± 1.285
4.683ProThr: 4.683 ± 1.465
3.602ProVal: 3.602 ± 0.17
0.72ProTrp: 0.72 ± 0.184
3.602ProTyr: 3.602 ± 0.918
0.0ProXaa: 0.0 ± 0.0
Gln
3.602GlnAla: 3.602 ± 1.462
0.72GlnCys: 0.72 ± 0.36
2.161GlnAsp: 2.161 ± 1.094
4.323GlnGlu: 4.323 ± 0.53
2.161GlnPhe: 2.161 ± 0.007
1.081GlnGly: 1.081 ± 0.54
1.441GlnHis: 1.441 ± 0.721
2.161GlnIle: 2.161 ± 0.537
1.441GlnLys: 1.441 ± 0.721
2.522GlnLeu: 2.522 ± 1.458
1.801GlnMet: 1.801 ± 0.901
1.441GlnAsn: 1.441 ± 0.367
3.242GlnPro: 3.242 ± 1.098
2.161GlnGln: 2.161 ± 0.537
1.801GlnArg: 1.801 ± 0.187
2.522GlnSer: 2.522 ± 0.914
2.522GlnThr: 2.522 ± 1.458
4.323GlnVal: 4.323 ± 1.645
1.081GlnTrp: 1.081 ± 0.54
0.36GlnTyr: 0.36 ± 0.364
0.0GlnXaa: 0.0 ± 0.0
Arg
6.124ArgAla: 6.124 ± 0.887
1.081ArgCys: 1.081 ± 0.54
3.242ArgAsp: 3.242 ± 0.554
1.801ArgGlu: 1.801 ± 0.901
3.602ArgPhe: 3.602 ± 2.549
1.801ArgGly: 1.801 ± 0.187
1.081ArgHis: 1.081 ± 0.54
3.963ArgIle: 3.963 ± 0.194
3.242ArgLys: 3.242 ± 1.077
2.882ArgLeu: 2.882 ± 1.441
1.441ArgMet: 1.441 ± 0.177
2.522ArgAsn: 2.522 ± 0.717
1.081ArgPro: 1.081 ± 0.54
1.801ArgGln: 1.801 ± 0.357
3.963ArgArg: 3.963 ± 1.438
5.043ArgSer: 5.043 ± 2.522
1.441ArgThr: 1.441 ± 0.177
2.522ArgVal: 2.522 ± 0.173
1.081ArgTrp: 1.081 ± 0.003
1.441ArgTyr: 1.441 ± 1.455
0.0ArgXaa: 0.0 ± 0.0
Ser
4.683SerAla: 4.683 ± 0.166
0.72SerCys: 0.72 ± 0.36
3.602SerAsp: 3.602 ± 0.714
3.242SerGlu: 3.242 ± 1.077
4.683SerPhe: 4.683 ± 1.254
3.242SerGly: 3.242 ± 0.534
0.72SerHis: 0.72 ± 0.184
6.124SerIle: 6.124 ± 0.745
4.683SerLys: 4.683 ± 2.009
6.484SerLeu: 6.484 ± 1.108
1.801SerMet: 1.801 ± 1.195
4.323SerAsn: 4.323 ± 1.101
2.522SerPro: 2.522 ± 1.458
3.242SerGln: 3.242 ± 0.554
2.161SerArg: 2.161 ± 0.551
6.844SerSer: 6.844 ± 1.472
4.323SerThr: 4.323 ± 1.101
5.764SerVal: 5.764 ± 0.163
1.081SerTrp: 1.081 ± 0.54
2.882SerTyr: 2.882 ± 0.734
0.0SerXaa: 0.0 ± 0.0
Thr
5.043ThrAla: 5.043 ± 0.741
1.081ThrCys: 1.081 ± 0.54
2.882ThrAsp: 2.882 ± 0.19
4.323ThrGlu: 4.323 ± 1.101
3.602ThrPhe: 3.602 ± 0.374
2.522ThrGly: 2.522 ± 0.371
2.522ThrHis: 2.522 ± 0.173
3.963ThrIle: 3.963 ± 0.194
0.72ThrLys: 0.72 ± 0.184
7.205ThrLeu: 7.205 ± 1.836
1.441ThrMet: 1.441 ± 0.911
3.242ThrAsn: 3.242 ± 0.534
3.602ThrPro: 3.602 ± 0.17
2.161ThrGln: 2.161 ± 0.007
2.522ThrArg: 2.522 ± 0.371
4.683ThrSer: 4.683 ± 0.921
5.403ThrThr: 5.403 ± 1.105
2.882ThrVal: 2.882 ± 0.353
1.441ThrTrp: 1.441 ± 0.177
3.963ThrTyr: 3.963 ± 0.738
0.0ThrXaa: 0.0 ± 0.0
Val
4.683ValAla: 4.683 ± 0.71
0.72ValCys: 0.72 ± 0.36
4.683ValAsp: 4.683 ± 0.377
7.205ValGlu: 7.205 ± 0.204
3.602ValPhe: 3.602 ± 1.462
4.323ValGly: 4.323 ± 1.101
1.441ValHis: 1.441 ± 0.177
5.764ValIle: 5.764 ± 1.794
2.522ValLys: 2.522 ± 0.717
5.043ValLeu: 5.043 ± 0.197
1.081ValMet: 1.081 ± 0.003
3.963ValAsn: 3.963 ± 0.194
3.602ValPro: 3.602 ± 0.374
2.161ValGln: 2.161 ± 1.638
3.602ValArg: 3.602 ± 0.714
4.683ValSer: 4.683 ± 0.921
4.683ValThr: 4.683 ± 0.71
4.683ValVal: 4.683 ± 0.377
1.441ValTrp: 1.441 ± 0.367
2.882ValTyr: 2.882 ± 0.19
0.0ValXaa: 0.0 ± 0.0
Trp
1.081TrpAla: 1.081 ± 0.003
0.72TrpCys: 0.72 ± 0.184
1.441TrpAsp: 1.441 ± 0.177
2.161TrpGlu: 2.161 ± 0.537
1.081TrpPhe: 1.081 ± 0.54
0.36TrpGly: 0.36 ± 0.364
0.0TrpHis: 0.0 ± 0.0
1.081TrpIle: 1.081 ± 0.003
0.36TrpLys: 0.36 ± 0.18
1.081TrpLeu: 1.081 ± 0.003
0.72TrpMet: 0.72 ± 0.184
1.441TrpAsn: 1.441 ± 0.177
0.0TrpPro: 0.0 ± 0.0
0.36TrpGln: 0.36 ± 0.18
0.72TrpArg: 0.72 ± 0.184
1.081TrpSer: 1.081 ± 0.54
1.441TrpThr: 1.441 ± 0.177
0.72TrpVal: 0.72 ± 0.184
0.0TrpTrp: 0.0 ± 0.0
0.36TrpTyr: 0.36 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.522TyrAla: 2.522 ± 0.717
1.081TyrCys: 1.081 ± 0.003
3.602TyrAsp: 3.602 ± 0.714
0.72TyrGlu: 0.72 ± 0.36
1.441TyrPhe: 1.441 ± 0.721
2.882TyrGly: 2.882 ± 0.19
1.441TyrHis: 1.441 ± 0.177
1.801TyrIle: 1.801 ± 0.731
0.72TyrLys: 0.72 ± 0.184
2.882TyrLeu: 2.882 ± 0.19
0.0TyrMet: 0.0 ± 0.0
3.242TyrAsn: 3.242 ± 0.554
1.081TyrPro: 1.081 ± 0.003
3.963TyrGln: 3.963 ± 1.281
4.683TyrArg: 4.683 ± 0.166
3.242TyrSer: 3.242 ± 1.642
3.242TyrThr: 3.242 ± 0.01
2.522TyrVal: 2.522 ± 0.371
0.36TyrTrp: 0.36 ± 0.18
2.161TyrTyr: 2.161 ± 0.551
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2777 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski