Amino acid dipepetide frequency for Xingshan nematode virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.315AlaAla: 10.315 ± 3.717
0.573AlaCys: 0.573 ± 0.373
2.292AlaAsp: 2.292 ± 1.491
2.865AlaGlu: 2.865 ± 1.11
2.865AlaPhe: 2.865 ± 1.11
7.45AlaGly: 7.45 ± 1.383
1.719AlaHis: 1.719 ± 1.033
2.292AlaIle: 2.292 ± 1.118
3.438AlaLys: 3.438 ± 2.058
8.023AlaLeu: 8.023 ± 0.377
3.438AlaMet: 3.438 ± 1.354
1.719AlaAsn: 1.719 ± 0.416
2.292AlaPro: 2.292 ± 1.099
1.719AlaGln: 1.719 ± 0.416
8.023AlaArg: 8.023 ± 1.325
9.742AlaSer: 9.742 ± 0.913
4.011AlaThr: 4.011 ± 1.845
4.011AlaVal: 4.011 ± 1.156
5.731AlaTrp: 5.731 ± 2.425
1.146AlaTyr: 1.146 ± 0.948
0.0AlaXaa: 0.0 ± 0.0
Cys
1.146CysAla: 1.146 ± 0.745
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.573CysGlu: 0.573 ± 0.474
0.573CysPhe: 0.573 ± 0.373
1.146CysGly: 1.146 ± 0.245
1.719CysHis: 1.719 ± 1.218
0.0CysIle: 0.0 ± 0.0
0.573CysLys: 0.573 ± 1.244
2.292CysLeu: 2.292 ± 1.118
1.719CysMet: 1.719 ± 1.218
0.573CysAsn: 0.573 ± 0.474
2.292CysPro: 2.292 ± 1.696
0.0CysGln: 0.0 ± 0.0
1.719CysArg: 1.719 ± 1.422
0.573CysSer: 0.573 ± 1.244
2.865CysThr: 2.865 ± 0.572
1.719CysVal: 1.719 ± 0.416
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.585AspAla: 4.585 ± 1.502
0.573AspCys: 0.573 ± 0.474
4.011AspAsp: 4.011 ± 0.776
2.865AspGlu: 2.865 ± 1.11
1.146AspPhe: 1.146 ± 0.745
4.585AspGly: 4.585 ± 2.236
0.573AspHis: 0.573 ± 0.474
2.292AspIle: 2.292 ± 0.49
1.146AspLys: 1.146 ± 0.245
3.438AspLeu: 3.438 ± 0.697
1.719AspMet: 1.719 ± 0.656
1.146AspAsn: 1.146 ± 0.245
5.158AspPro: 5.158 ± 1.248
2.865AspGln: 2.865 ± 0.87
2.292AspArg: 2.292 ± 0.49
2.865AspSer: 2.865 ± 1.11
0.0AspThr: 0.0 ± 0.0
4.585AspVal: 4.585 ± 2.215
1.719AspTrp: 1.719 ± 1.422
2.865AspTyr: 2.865 ± 0.572
0.0AspXaa: 0.0 ± 0.0
Glu
5.731GluAla: 5.731 ± 1.566
1.146GluCys: 1.146 ± 1.173
1.719GluAsp: 1.719 ± 0.656
2.292GluGlu: 2.292 ± 1.896
1.146GluPhe: 1.146 ± 0.245
3.438GluGly: 3.438 ± 1.476
1.719GluHis: 1.719 ± 0.656
1.146GluIle: 1.146 ± 0.245
2.865GluLys: 2.865 ± 1.11
6.877GluLeu: 6.877 ± 1.964
1.146GluMet: 1.146 ± 0.245
1.146GluAsn: 1.146 ± 0.745
0.573GluPro: 0.573 ± 0.474
0.573GluGln: 0.573 ± 0.373
4.585GluArg: 4.585 ± 0.98
2.292GluSer: 2.292 ± 0.751
2.865GluThr: 2.865 ± 1.591
2.865GluVal: 2.865 ± 1.11
2.292GluTrp: 2.292 ± 0.49
1.719GluTyr: 1.719 ± 0.416
0.0GluXaa: 0.0 ± 0.0
Phe
2.292PheAla: 2.292 ± 0.751
1.719PheCys: 1.719 ± 1.033
0.573PheAsp: 0.573 ± 0.474
0.573PheGlu: 0.573 ± 0.474
0.573PhePhe: 0.573 ± 0.373
2.292PheGly: 2.292 ± 1.118
0.573PheHis: 0.573 ± 0.373
0.573PheIle: 0.573 ± 0.474
0.573PheLys: 0.573 ± 0.474
0.573PheLeu: 0.573 ± 0.474
1.719PheMet: 1.719 ± 0.416
0.0PheAsn: 0.0 ± 0.0
0.573PhePro: 0.573 ± 0.474
0.573PheGln: 0.573 ± 0.474
1.719PheArg: 1.719 ± 0.656
1.719PheSer: 1.719 ± 1.033
2.292PheThr: 2.292 ± 0.751
1.146PheVal: 1.146 ± 1.173
0.0PheTrp: 0.0 ± 0.0
1.719PheTyr: 1.719 ± 0.416
0.0PheXaa: 0.0 ± 0.0
Gly
3.438GlyAla: 3.438 ± 0.697
2.292GlyCys: 2.292 ± 2.347
4.585GlyAsp: 4.585 ± 1.502
1.719GlyGlu: 1.719 ± 1.118
1.719GlyPhe: 1.719 ± 0.656
8.023GlyGly: 8.023 ± 1.966
1.719GlyHis: 1.719 ± 0.416
2.292GlyIle: 2.292 ± 1.367
4.011GlyLys: 4.011 ± 1.156
10.888GlyLeu: 10.888 ± 1.523
4.011GlyMet: 4.011 ± 0.266
4.011GlyAsn: 4.011 ± 1.395
2.865GlyPro: 2.865 ± 0.841
2.292GlyGln: 2.292 ± 0.49
5.158GlyArg: 5.158 ± 1.712
6.877GlySer: 6.877 ± 0.219
4.011GlyThr: 4.011 ± 2.08
7.45GlyVal: 7.45 ± 2.794
4.011GlyTrp: 4.011 ± 0.684
3.438GlyTyr: 3.438 ± 1.145
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.146HisGlu: 1.146 ± 0.745
0.573HisPhe: 0.573 ± 1.244
2.865HisGly: 2.865 ± 2.363
2.292HisHis: 2.292 ± 0.751
1.146HisIle: 1.146 ± 0.245
0.0HisLys: 0.0 ± 0.0
4.011HisLeu: 4.011 ± 1.096
0.573HisMet: 0.573 ± 0.373
0.573HisAsn: 0.573 ± 0.373
0.0HisPro: 0.0 ± 0.0
1.146HisGln: 1.146 ± 0.745
3.438HisArg: 3.438 ± 0.997
0.573HisSer: 0.573 ± 0.474
1.146HisThr: 1.146 ± 0.245
2.292HisVal: 2.292 ± 1.896
0.573HisTrp: 0.573 ± 0.373
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.292IleAla: 2.292 ± 1.118
0.573IleCys: 0.573 ± 0.474
2.292IleAsp: 2.292 ± 0.751
2.292IleGlu: 2.292 ± 0.49
2.292IlePhe: 2.292 ± 0.751
1.146IleGly: 1.146 ± 0.245
0.573IleHis: 0.573 ± 0.474
1.146IleIle: 1.146 ± 0.745
1.146IleLys: 1.146 ± 0.245
5.158IleLeu: 5.158 ± 0.998
1.719IleMet: 1.719 ± 0.656
0.573IleAsn: 0.573 ± 1.244
2.865IlePro: 2.865 ± 2.16
0.573IleGln: 0.573 ± 0.373
1.146IleArg: 1.146 ± 1.242
2.865IleSer: 2.865 ± 0.841
2.865IleThr: 2.865 ± 1.11
1.719IleVal: 1.719 ± 0.656
2.292IleTrp: 2.292 ± 1.491
0.573IleTyr: 0.573 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
1.719LysAla: 1.719 ± 1.033
0.573LysCys: 0.573 ± 0.474
2.865LysAsp: 2.865 ± 0.572
4.011LysGlu: 4.011 ± 1.096
1.146LysPhe: 1.146 ± 0.745
4.011LysGly: 4.011 ± 1.771
0.0LysHis: 0.0 ± 0.0
1.719LysIle: 1.719 ± 1.118
1.719LysLys: 1.719 ± 0.656
4.011LysLeu: 4.011 ± 0.776
1.146LysMet: 1.146 ± 0.948
1.719LysAsn: 1.719 ± 0.416
0.573LysPro: 0.573 ± 0.373
1.146LysGln: 1.146 ± 2.488
2.865LysArg: 2.865 ± 0.572
2.865LysSer: 2.865 ± 1.11
2.865LysThr: 2.865 ± 0.572
5.731LysVal: 5.731 ± 1.145
2.292LysTrp: 2.292 ± 1.118
0.573LysTyr: 0.573 ± 0.474
0.0LysXaa: 0.0 ± 0.0
Leu
6.304LeuAla: 6.304 ± 1.068
1.719LeuCys: 1.719 ± 0.656
8.596LeuAsp: 8.596 ± 1.717
6.877LeuGlu: 6.877 ± 1.964
2.292LeuPhe: 2.292 ± 1.118
10.888LeuGly: 10.888 ± 1.966
3.438LeuHis: 3.438 ± 1.312
3.438LeuIle: 3.438 ± 0.997
5.158LeuLys: 5.158 ± 2.704
8.596LeuLeu: 8.596 ± 0.841
0.573LeuMet: 0.573 ± 0.474
2.865LeuAsn: 2.865 ± 0.572
5.731LeuPro: 5.731 ± 2.509
2.865LeuGln: 2.865 ± 0.87
8.023LeuArg: 8.023 ± 2.629
3.438LeuSer: 3.438 ± 1.312
4.585LeuThr: 4.585 ± 0.468
7.45LeuVal: 7.45 ± 1.296
1.146LeuTrp: 1.146 ± 0.245
3.438LeuTyr: 3.438 ± 0.735
0.0LeuXaa: 0.0 ± 0.0
Met
6.304MetAla: 6.304 ± 1.881
0.573MetCys: 0.573 ± 0.373
1.719MetAsp: 1.719 ± 0.416
1.719MetGlu: 1.719 ± 0.656
1.146MetPhe: 1.146 ± 1.242
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.573MetIle: 0.573 ± 0.474
1.719MetLys: 1.719 ± 0.656
2.292MetLeu: 2.292 ± 1.491
1.146MetMet: 1.146 ± 0.745
0.0MetAsn: 0.0 ± 0.0
1.719MetPro: 1.719 ± 0.656
1.719MetGln: 1.719 ± 0.656
2.292MetArg: 2.292 ± 0.49
2.292MetSer: 2.292 ± 0.49
2.292MetThr: 2.292 ± 1.491
2.865MetVal: 2.865 ± 0.841
0.0MetTrp: 0.0 ± 0.0
1.719MetTyr: 1.719 ± 1.422
0.0MetXaa: 0.0 ± 0.0
Asn
1.146AsnAla: 1.146 ± 0.745
0.573AsnCys: 0.573 ± 0.373
1.146AsnAsp: 1.146 ± 0.245
0.0AsnGlu: 0.0 ± 0.0
0.573AsnPhe: 0.573 ± 0.474
1.719AsnGly: 1.719 ± 1.218
0.0AsnHis: 0.0 ± 0.0
1.146AsnIle: 1.146 ± 0.245
1.719AsnLys: 1.719 ± 1.218
2.865AsnLeu: 2.865 ± 1.591
0.573AsnMet: 0.573 ± 0.474
2.292AsnAsn: 2.292 ± 1.367
2.865AsnPro: 2.865 ± 1.11
0.573AsnGln: 0.573 ± 0.373
4.585AsnArg: 4.585 ± 0.97
1.146AsnSer: 1.146 ± 1.173
0.573AsnThr: 0.573 ± 0.373
2.865AsnVal: 2.865 ± 0.572
1.146AsnTrp: 1.146 ± 0.745
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.438ProAla: 3.438 ± 2.058
1.146ProCys: 1.146 ± 1.242
4.011ProAsp: 4.011 ± 1.096
1.719ProGlu: 1.719 ± 1.118
0.0ProPhe: 0.0 ± 0.0
2.292ProGly: 2.292 ± 0.933
1.146ProHis: 1.146 ± 0.745
2.865ProIle: 2.865 ± 1.11
1.146ProLys: 1.146 ± 0.745
4.585ProLeu: 4.585 ± 4.548
0.573ProMet: 0.573 ± 0.474
1.146ProAsn: 1.146 ± 0.948
1.719ProPro: 1.719 ± 1.033
1.146ProGln: 1.146 ± 0.948
3.438ProArg: 3.438 ± 2.332
2.865ProSer: 2.865 ± 1.11
1.146ProThr: 1.146 ± 0.245
5.158ProVal: 5.158 ± 1.218
5.158ProTrp: 5.158 ± 0.499
0.573ProTyr: 0.573 ± 0.474
0.0ProXaa: 0.0 ± 0.0
Gln
2.865GlnAla: 2.865 ± 0.976
0.573GlnCys: 0.573 ± 0.373
2.292GlnAsp: 2.292 ± 0.751
0.573GlnGlu: 0.573 ± 0.474
0.573GlnPhe: 0.573 ± 0.474
2.292GlnGly: 2.292 ± 1.491
0.573GlnHis: 0.573 ± 0.474
0.573GlnIle: 0.573 ± 0.373
2.865GlnLys: 2.865 ± 1.863
2.865GlnLeu: 2.865 ± 1.587
0.0GlnMet: 0.0 ± 0.0
0.573GlnAsn: 0.573 ± 0.373
1.719GlnPro: 1.719 ± 1.218
1.719GlnGln: 1.719 ± 0.656
1.146GlnArg: 1.146 ± 0.948
1.146GlnSer: 1.146 ± 0.245
0.573GlnThr: 0.573 ± 0.373
2.292GlnVal: 2.292 ± 1.896
2.865GlnTrp: 2.865 ± 0.87
2.292GlnTyr: 2.292 ± 1.118
0.0GlnXaa: 0.0 ± 0.0
Arg
4.585ArgAla: 4.585 ± 1.521
2.292ArgCys: 2.292 ± 1.118
2.292ArgAsp: 2.292 ± 1.099
5.731ArgGlu: 5.731 ± 1.145
1.719ArgPhe: 1.719 ± 1.409
2.865ArgGly: 2.865 ± 3.667
0.573ArgHis: 0.573 ± 0.474
4.011ArgIle: 4.011 ± 0.735
1.719ArgLys: 1.719 ± 0.416
5.731ArgLeu: 5.731 ± 0.962
5.158ArgMet: 5.158 ± 0.589
1.719ArgAsn: 1.719 ± 2.39
2.292ArgPro: 2.292 ± 1.367
4.011ArgGln: 4.011 ± 1.096
5.158ArgArg: 5.158 ± 7.178
5.158ArgSer: 5.158 ± 1.218
5.158ArgThr: 5.158 ± 3.476
8.023ArgVal: 8.023 ± 1.93
2.292ArgTrp: 2.292 ± 1.118
3.438ArgTyr: 3.438 ± 0.832
0.0ArgXaa: 0.0 ± 0.0
Ser
6.877SerAla: 6.877 ± 0.219
1.146SerCys: 1.146 ± 0.948
3.438SerAsp: 3.438 ± 0.735
2.865SerGlu: 2.865 ± 1.11
1.146SerPhe: 1.146 ± 0.245
9.742SerGly: 9.742 ± 3.874
0.0SerHis: 0.0 ± 0.0
4.011SerIle: 4.011 ± 1.933
1.146SerLys: 1.146 ± 0.745
8.596SerLeu: 8.596 ± 2.081
1.719SerMet: 1.719 ± 1.118
1.719SerAsn: 1.719 ± 0.416
1.146SerPro: 1.146 ± 0.948
1.719SerGln: 1.719 ± 0.416
4.585SerArg: 4.585 ± 0.98
5.158SerSer: 5.158 ± 0.499
5.158SerThr: 5.158 ± 0.499
5.158SerVal: 5.158 ± 1.859
2.292SerTrp: 2.292 ± 0.933
3.438SerTyr: 3.438 ± 0.697
0.0SerXaa: 0.0 ± 0.0
Thr
2.865ThrAla: 2.865 ± 0.976
0.573ThrCys: 0.573 ± 0.373
4.011ThrAsp: 4.011 ± 0.776
2.292ThrGlu: 2.292 ± 0.933
0.573ThrPhe: 0.573 ± 0.474
6.304ThrGly: 6.304 ± 1.068
0.0ThrHis: 0.0 ± 0.0
1.719ThrIle: 1.719 ± 0.416
0.573ThrLys: 0.573 ± 0.474
3.438ThrLeu: 3.438 ± 0.832
1.146ThrMet: 1.146 ± 0.745
0.573ThrAsn: 0.573 ± 0.373
3.438ThrPro: 3.438 ± 0.735
0.573ThrGln: 0.573 ± 0.373
5.731ThrArg: 5.731 ± 0.962
8.023ThrSer: 8.023 ± 1.325
5.731ThrThr: 5.731 ± 0.272
5.731ThrVal: 5.731 ± 2.22
2.865ThrTrp: 2.865 ± 0.841
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.023ValAla: 8.023 ± 1.791
1.146ValCys: 1.146 ± 0.245
3.438ValAsp: 3.438 ± 2.058
5.158ValGlu: 5.158 ± 1.859
1.146ValPhe: 1.146 ± 0.948
9.742ValGly: 9.742 ± 1.648
2.292ValHis: 2.292 ± 0.933
2.292ValIle: 2.292 ± 0.49
4.011ValLys: 4.011 ± 0.684
4.011ValLeu: 4.011 ± 1.315
2.292ValMet: 2.292 ± 0.49
2.865ValAsn: 2.865 ± 1.863
4.011ValPro: 4.011 ± 2.079
2.865ValGln: 2.865 ± 0.572
5.731ValArg: 5.731 ± 3.095
8.023ValSer: 8.023 ± 2.968
5.731ValThr: 5.731 ± 1.54
4.011ValVal: 4.011 ± 3.319
2.292ValTrp: 2.292 ± 0.933
3.438ValTyr: 3.438 ± 0.697
0.0ValXaa: 0.0 ± 0.0
Trp
5.158TrpAla: 5.158 ± 1.33
1.719TrpCys: 1.719 ± 0.656
0.573TrpAsp: 0.573 ± 0.373
2.292TrpGlu: 2.292 ± 0.933
0.573TrpPhe: 0.573 ± 0.474
1.719TrpGly: 1.719 ± 2.44
1.719TrpHis: 1.719 ± 1.218
1.719TrpIle: 1.719 ± 0.656
3.438TrpLys: 3.438 ± 0.735
6.304TrpLeu: 6.304 ± 3.026
1.146TrpMet: 1.146 ± 0.301
1.719TrpAsn: 1.719 ± 1.118
0.573TrpPro: 0.573 ± 0.373
1.719TrpGln: 1.719 ± 1.118
1.146TrpArg: 1.146 ± 0.245
1.719TrpSer: 1.719 ± 0.656
0.573TrpThr: 0.573 ± 0.373
4.585TrpVal: 4.585 ± 1.521
1.146TrpTrp: 1.146 ± 0.745
1.146TrpTyr: 1.146 ± 0.745
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.585TyrAla: 4.585 ± 0.98
0.573TyrCys: 0.573 ± 0.373
0.573TyrAsp: 0.573 ± 0.373
0.573TyrGlu: 0.573 ± 0.373
0.0TyrPhe: 0.0 ± 0.0
2.865TyrGly: 2.865 ± 0.572
1.719TyrHis: 1.719 ± 0.416
1.146TyrIle: 1.146 ± 0.948
4.585TyrLys: 4.585 ± 0.98
2.865TyrLeu: 2.865 ± 1.587
0.0TyrMet: 0.0 ± 0.0
0.573TyrAsn: 0.573 ± 0.373
2.865TyrPro: 2.865 ± 0.87
0.573TyrGln: 0.573 ± 0.474
1.146TyrArg: 1.146 ± 1.242
1.719TyrSer: 1.719 ± 1.218
1.719TyrThr: 1.719 ± 0.416
2.865TyrVal: 2.865 ± 0.841
0.573TyrTrp: 0.573 ± 0.373
1.146TyrTyr: 1.146 ± 0.245
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1746 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski