Amino acid dipepetide frequency for Wenzhou tombus-like virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.784AlaAla: 10.784 ± 4.191
0.719AlaCys: 0.719 ± 0.623
2.876AlaAsp: 2.876 ± 0.631
2.157AlaGlu: 2.157 ± 0.213
6.47AlaPhe: 6.47 ± 0.403
1.438AlaGly: 1.438 ± 0.837
0.719AlaHis: 0.719 ± 0.623
2.157AlaIle: 2.157 ± 1.255
4.313AlaLys: 4.313 ± 1.657
5.751AlaLeu: 5.751 ± 0.821
2.876AlaMet: 2.876 ± 1.272
0.719AlaAsn: 0.719 ± 0.418
5.751AlaPro: 5.751 ± 0.221
2.876AlaGln: 2.876 ± 0.631
5.032AlaArg: 5.032 ± 0.844
4.313AlaSer: 4.313 ± 0.426
5.751AlaThr: 5.751 ± 0.821
3.595AlaVal: 3.595 ± 0.008
0.0AlaTrp: 0.0 ± 0.0
6.47AlaTyr: 6.47 ± 0.639
0.0AlaXaa: 0.0 ± 0.0
Cys
2.157CysAla: 2.157 ± 1.255
2.876CysCys: 2.876 ± 0.41
0.719CysAsp: 0.719 ± 0.418
1.438CysGlu: 1.438 ± 0.837
2.157CysPhe: 2.157 ± 0.213
1.438CysGly: 1.438 ± 0.205
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.719CysLys: 0.719 ± 0.623
2.157CysLeu: 2.157 ± 1.87
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.438CysPro: 1.438 ± 1.247
2.157CysGln: 2.157 ± 0.829
1.438CysArg: 1.438 ± 0.837
6.47CysSer: 6.47 ± 3.528
1.438CysThr: 1.438 ± 0.205
4.313CysVal: 4.313 ± 0.426
0.0CysTrp: 0.0 ± 0.0
1.438CysTyr: 1.438 ± 0.837
0.0CysXaa: 0.0 ± 0.0
Asp
6.47AspAla: 6.47 ± 1.444
2.157AspCys: 2.157 ± 0.213
5.032AspAsp: 5.032 ± 0.197
1.438AspGlu: 1.438 ± 0.205
3.595AspPhe: 3.595 ± 1.05
2.876AspGly: 2.876 ± 0.41
0.0AspHis: 0.0 ± 0.0
2.157AspIle: 2.157 ± 0.213
2.876AspLys: 2.876 ± 0.41
3.595AspLeu: 3.595 ± 1.05
3.595AspMet: 3.595 ± 1.034
2.157AspAsn: 2.157 ± 1.255
2.876AspPro: 2.876 ± 1.452
0.719AspGln: 0.719 ± 0.418
2.876AspArg: 2.876 ± 1.452
2.876AspSer: 2.876 ± 1.673
2.157AspThr: 2.157 ± 0.829
2.157AspVal: 2.157 ± 0.829
0.719AspTrp: 0.719 ± 0.418
4.313AspTyr: 4.313 ± 1.468
0.0AspXaa: 0.0 ± 0.0
Glu
1.438GluAla: 1.438 ± 0.205
0.0GluCys: 0.0 ± 0.0
2.157GluAsp: 2.157 ± 0.829
4.313GluGlu: 4.313 ± 1.657
2.157GluPhe: 2.157 ± 0.829
2.157GluGly: 2.157 ± 0.213
2.876GluHis: 2.876 ± 1.452
0.719GluIle: 0.719 ± 0.623
0.719GluLys: 0.719 ± 0.418
2.876GluLeu: 2.876 ± 0.631
2.157GluMet: 2.157 ± 0.213
2.157GluAsn: 2.157 ± 1.255
0.719GluPro: 0.719 ± 0.418
2.157GluGln: 2.157 ± 0.213
3.595GluArg: 3.595 ± 1.05
2.876GluSer: 2.876 ± 0.631
1.438GluThr: 1.438 ± 1.247
3.595GluVal: 3.595 ± 2.091
0.0GluTrp: 0.0 ± 0.0
1.438GluTyr: 1.438 ± 1.247
0.0GluXaa: 0.0 ± 0.0
Phe
5.032PheAla: 5.032 ± 1.886
1.438PheCys: 1.438 ± 1.247
5.751PheAsp: 5.751 ± 0.821
2.876PheGlu: 2.876 ± 0.41
2.157PhePhe: 2.157 ± 1.255
5.032PheGly: 5.032 ± 0.197
1.438PheHis: 1.438 ± 0.205
3.595PheIle: 3.595 ± 1.05
4.313PheLys: 4.313 ± 1.657
3.595PheLeu: 3.595 ± 1.034
2.157PheMet: 2.157 ± 0.829
0.719PheAsn: 0.719 ± 0.418
2.157PhePro: 2.157 ± 0.213
2.157PheGln: 2.157 ± 1.255
2.876PheArg: 2.876 ± 0.631
4.313PheSer: 4.313 ± 0.616
2.876PheThr: 2.876 ± 1.673
5.032PheVal: 5.032 ± 0.197
0.0PheTrp: 0.0 ± 0.0
0.719PheTyr: 0.719 ± 0.623
0.0PheXaa: 0.0 ± 0.0
Gly
1.438GlyAla: 1.438 ± 1.247
3.595GlyCys: 3.595 ± 1.05
4.313GlyAsp: 4.313 ± 0.616
1.438GlyGlu: 1.438 ± 0.837
2.157GlyPhe: 2.157 ± 0.213
1.438GlyGly: 1.438 ± 0.837
0.719GlyHis: 0.719 ± 0.418
1.438GlyIle: 1.438 ± 0.837
2.157GlyLys: 2.157 ± 1.255
4.313GlyLeu: 4.313 ± 0.426
0.719GlyMet: 0.719 ± 0.623
5.751GlyAsn: 5.751 ± 0.221
0.0GlyPro: 0.0 ± 0.0
1.438GlyGln: 1.438 ± 0.205
4.313GlyArg: 4.313 ± 1.657
5.032GlySer: 5.032 ± 0.197
1.438GlyThr: 1.438 ± 0.205
3.595GlyVal: 3.595 ± 1.05
0.719GlyTrp: 0.719 ± 0.418
2.876GlyTyr: 2.876 ± 1.452
0.0GlyXaa: 0.0 ± 0.0
His
2.876HisAla: 2.876 ± 0.41
0.0HisCys: 0.0 ± 0.0
0.719HisAsp: 0.719 ± 0.418
0.719HisGlu: 0.719 ± 0.418
2.876HisPhe: 2.876 ± 0.41
0.719HisGly: 0.719 ± 0.623
0.719HisHis: 0.719 ± 0.623
2.157HisIle: 2.157 ± 0.213
0.719HisLys: 0.719 ± 0.418
0.719HisLeu: 0.719 ± 0.418
2.157HisMet: 2.157 ± 0.213
2.157HisAsn: 2.157 ± 0.213
2.157HisPro: 2.157 ± 0.213
1.438HisGln: 1.438 ± 1.247
0.719HisArg: 0.719 ± 0.623
0.719HisSer: 0.719 ± 0.623
2.876HisThr: 2.876 ± 0.41
1.438HisVal: 1.438 ± 0.205
0.0HisTrp: 0.0 ± 0.0
1.438HisTyr: 1.438 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
2.157IleAla: 2.157 ± 0.829
2.876IleCys: 2.876 ± 0.631
2.157IleAsp: 2.157 ± 0.829
0.719IleGlu: 0.719 ± 0.418
2.876IlePhe: 2.876 ± 0.41
0.719IleGly: 0.719 ± 0.418
0.0IleHis: 0.0 ± 0.0
2.876IleIle: 2.876 ± 1.673
2.157IleLys: 2.157 ± 0.829
4.313IleLeu: 4.313 ± 0.616
0.719IleMet: 0.719 ± 0.418
1.438IleAsn: 1.438 ± 0.837
3.595IlePro: 3.595 ± 0.008
0.719IleGln: 0.719 ± 0.418
2.157IleArg: 2.157 ± 0.213
3.595IleSer: 3.595 ± 0.008
2.876IleThr: 2.876 ± 0.631
4.313IleVal: 4.313 ± 1.657
1.438IleTrp: 1.438 ± 0.205
2.876IleTyr: 2.876 ± 0.41
0.0IleXaa: 0.0 ± 0.0
Lys
2.876LysAla: 2.876 ± 0.631
2.157LysCys: 2.157 ± 0.829
0.719LysAsp: 0.719 ± 0.623
4.313LysGlu: 4.313 ± 0.426
1.438LysPhe: 1.438 ± 1.247
3.595LysGly: 3.595 ± 1.05
3.595LysHis: 3.595 ± 1.034
1.438LysIle: 1.438 ± 1.247
1.438LysLys: 1.438 ± 0.837
2.876LysLeu: 2.876 ± 0.631
2.876LysMet: 2.876 ± 1.452
3.595LysAsn: 3.595 ± 1.034
0.0LysPro: 0.0 ± 0.0
2.876LysGln: 2.876 ± 0.631
3.595LysArg: 3.595 ± 1.034
3.595LysSer: 3.595 ± 0.008
3.595LysThr: 3.595 ± 1.05
0.719LysVal: 0.719 ± 0.418
2.157LysTrp: 2.157 ± 0.213
0.719LysTyr: 0.719 ± 0.623
0.0LysXaa: 0.0 ± 0.0
Leu
6.47LeuAla: 6.47 ± 0.403
1.438LeuCys: 1.438 ± 0.205
3.595LeuAsp: 3.595 ± 1.05
4.313LeuGlu: 4.313 ± 3.741
2.876LeuPhe: 2.876 ± 0.631
7.908LeuGly: 7.908 ± 1.65
2.876LeuHis: 2.876 ± 1.452
5.032LeuIle: 5.032 ± 0.844
4.313LeuLys: 4.313 ± 0.616
10.065LeuLeu: 10.065 ± 1.689
1.438LeuMet: 1.438 ± 0.205
5.032LeuAsn: 5.032 ± 0.197
2.876LeuPro: 2.876 ± 0.631
2.876LeuGln: 2.876 ± 0.631
5.751LeuArg: 5.751 ± 0.221
6.47LeuSer: 6.47 ± 2.486
3.595LeuThr: 3.595 ± 2.076
3.595LeuVal: 3.595 ± 1.05
0.719LeuTrp: 0.719 ± 0.418
3.595LeuTyr: 3.595 ± 0.008
0.0LeuXaa: 0.0 ± 0.0
Met
2.876MetAla: 2.876 ± 0.631
2.157MetCys: 2.157 ± 1.87
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.438MetPhe: 1.438 ± 0.205
0.719MetGly: 0.719 ± 0.418
1.438MetHis: 1.438 ± 0.205
2.157MetIle: 2.157 ± 0.829
0.0MetLys: 0.0 ± 0.0
5.032MetLeu: 5.032 ± 2.281
0.719MetMet: 0.719 ± 0.418
1.438MetAsn: 1.438 ± 0.205
1.438MetPro: 1.438 ± 0.837
0.719MetGln: 0.719 ± 0.418
2.157MetArg: 2.157 ± 0.213
2.157MetSer: 2.157 ± 0.829
0.719MetThr: 0.719 ± 0.418
1.438MetVal: 1.438 ± 0.205
0.0MetTrp: 0.0 ± 0.0
1.438MetTyr: 1.438 ± 0.837
0.0MetXaa: 0.0 ± 0.0
Asn
2.876AsnAla: 2.876 ± 1.452
2.157AsnCys: 2.157 ± 0.213
2.876AsnAsp: 2.876 ± 0.631
0.719AsnGlu: 0.719 ± 0.418
2.876AsnPhe: 2.876 ± 0.41
3.595AsnGly: 3.595 ± 2.091
0.0AsnHis: 0.0 ± 0.0
2.876AsnIle: 2.876 ± 0.41
4.313AsnLys: 4.313 ± 0.426
2.876AsnLeu: 2.876 ± 1.452
0.719AsnMet: 0.719 ± 0.418
2.157AsnAsn: 2.157 ± 0.213
2.876AsnPro: 2.876 ± 1.673
1.438AsnGln: 1.438 ± 0.205
2.876AsnArg: 2.876 ± 1.673
3.595AsnSer: 3.595 ± 0.008
0.719AsnThr: 0.719 ± 0.418
3.595AsnVal: 3.595 ± 1.05
0.0AsnTrp: 0.0 ± 0.0
0.719AsnTyr: 0.719 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
2.157ProAla: 2.157 ± 0.213
2.157ProCys: 2.157 ± 0.829
3.595ProAsp: 3.595 ± 1.034
0.719ProGlu: 0.719 ± 0.623
0.0ProPhe: 0.0 ± 0.0
2.157ProGly: 2.157 ± 0.829
1.438ProHis: 1.438 ± 0.837
3.595ProIle: 3.595 ± 1.034
2.876ProLys: 2.876 ± 1.673
7.189ProLeu: 7.189 ± 1.058
0.0ProMet: 0.0 ± 0.0
1.438ProAsn: 1.438 ± 0.205
3.595ProPro: 3.595 ± 2.091
2.157ProGln: 2.157 ± 0.213
2.157ProArg: 2.157 ± 0.829
5.751ProSer: 5.751 ± 1.263
5.751ProThr: 5.751 ± 1.263
2.876ProVal: 2.876 ± 0.631
0.0ProTrp: 0.0 ± 0.0
0.719ProTyr: 0.719 ± 0.418
0.0ProXaa: 0.0 ± 0.0
Gln
0.719GlnAla: 0.719 ± 0.418
0.719GlnCys: 0.719 ± 0.418
0.0GlnAsp: 0.0 ± 0.0
0.719GlnGlu: 0.719 ± 0.623
5.032GlnPhe: 5.032 ± 1.886
0.719GlnGly: 0.719 ± 0.623
0.0GlnHis: 0.0 ± 0.0
1.438GlnIle: 1.438 ± 0.837
0.719GlnLys: 0.719 ± 0.418
2.876GlnLeu: 2.876 ± 0.41
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
4.313GlnPro: 4.313 ± 0.616
2.876GlnGln: 2.876 ± 0.41
3.595GlnArg: 3.595 ± 1.034
4.313GlnSer: 4.313 ± 0.426
2.157GlnThr: 2.157 ± 0.213
0.719GlnVal: 0.719 ± 0.623
0.0GlnTrp: 0.0 ± 0.0
2.876GlnTyr: 2.876 ± 0.631
0.0GlnXaa: 0.0 ± 0.0
Arg
2.876ArgAla: 2.876 ± 0.41
0.719ArgCys: 0.719 ± 0.418
3.595ArgAsp: 3.595 ± 1.05
3.595ArgGlu: 3.595 ± 0.008
5.032ArgPhe: 5.032 ± 1.239
3.595ArgGly: 3.595 ± 1.05
1.438ArgHis: 1.438 ± 0.205
2.876ArgIle: 2.876 ± 0.631
2.157ArgLys: 2.157 ± 0.213
5.751ArgLeu: 5.751 ± 2.904
0.0ArgMet: 0.0 ± 0.0
4.313ArgAsn: 4.313 ± 0.426
2.157ArgPro: 2.157 ± 0.213
2.157ArgGln: 2.157 ± 0.213
7.189ArgArg: 7.189 ± 0.016
4.313ArgSer: 4.313 ± 0.426
2.876ArgThr: 2.876 ± 1.452
7.189ArgVal: 7.189 ± 0.016
0.719ArgTrp: 0.719 ± 0.623
2.876ArgTyr: 2.876 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
6.47SerAla: 6.47 ± 0.403
2.876SerCys: 2.876 ± 0.41
2.876SerAsp: 2.876 ± 0.631
4.313SerGlu: 4.313 ± 0.616
9.346SerPhe: 9.346 ± 0.229
4.313SerGly: 4.313 ± 0.616
2.876SerHis: 2.876 ± 1.452
5.751SerIle: 5.751 ± 1.863
4.313SerLys: 4.313 ± 1.657
7.908SerLeu: 7.908 ± 0.608
2.876SerMet: 2.876 ± 0.41
2.876SerAsn: 2.876 ± 1.673
2.157SerPro: 2.157 ± 0.213
2.876SerGln: 2.876 ± 0.41
3.595SerArg: 3.595 ± 2.076
11.503SerSer: 11.503 ± 0.442
2.876SerThr: 2.876 ± 0.631
5.751SerVal: 5.751 ± 1.263
0.719SerTrp: 0.719 ± 0.418
2.876SerTyr: 2.876 ± 0.631
0.0SerXaa: 0.0 ± 0.0
Thr
2.876ThrAla: 2.876 ± 0.41
0.0ThrCys: 0.0 ± 0.0
3.595ThrAsp: 3.595 ± 0.008
2.876ThrGlu: 2.876 ± 1.673
2.876ThrPhe: 2.876 ± 1.673
1.438ThrGly: 1.438 ± 0.205
2.157ThrHis: 2.157 ± 1.255
1.438ThrIle: 1.438 ± 0.205
4.313ThrLys: 4.313 ± 0.616
4.313ThrLeu: 4.313 ± 0.426
2.876ThrMet: 2.876 ± 0.981
2.876ThrAsn: 2.876 ± 1.452
7.189ThrPro: 7.189 ± 0.016
0.719ThrGln: 0.719 ± 0.623
2.876ThrArg: 2.876 ± 0.41
5.751ThrSer: 5.751 ± 0.821
2.876ThrThr: 2.876 ± 0.631
1.438ThrVal: 1.438 ± 0.837
2.157ThrTrp: 2.157 ± 0.829
2.157ThrTyr: 2.157 ± 0.829
0.0ThrXaa: 0.0 ± 0.0
Val
5.032ValAla: 5.032 ± 0.844
2.876ValCys: 2.876 ± 0.41
5.032ValAsp: 5.032 ± 0.844
2.157ValGlu: 2.157 ± 1.255
2.157ValPhe: 2.157 ± 1.87
1.438ValGly: 1.438 ± 0.205
1.438ValHis: 1.438 ± 0.205
1.438ValIle: 1.438 ± 0.205
4.313ValLys: 4.313 ± 0.616
5.032ValLeu: 5.032 ± 1.886
0.719ValMet: 0.719 ± 0.418
2.157ValAsn: 2.157 ± 0.213
2.876ValPro: 2.876 ± 0.41
0.719ValGln: 0.719 ± 0.418
5.032ValArg: 5.032 ± 1.886
9.346ValSer: 9.346 ± 0.229
4.313ValThr: 4.313 ± 1.468
4.313ValVal: 4.313 ± 0.426
0.719ValTrp: 0.719 ± 0.418
3.595ValTyr: 3.595 ± 1.05
0.0ValXaa: 0.0 ± 0.0
Trp
2.157TrpAla: 2.157 ± 0.213
0.0TrpCys: 0.0 ± 0.0
2.157TrpAsp: 2.157 ± 0.829
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.719TrpLys: 0.719 ± 0.418
0.719TrpLeu: 0.719 ± 0.623
0.719TrpMet: 0.719 ± 0.418
0.719TrpAsn: 0.719 ± 0.418
0.719TrpPro: 0.719 ± 0.418
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.719TrpThr: 0.719 ± 0.623
2.157TrpVal: 2.157 ± 0.213
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.751TyrAla: 5.751 ± 1.263
1.438TyrCys: 1.438 ± 0.205
2.876TyrAsp: 2.876 ± 0.41
0.719TyrGlu: 0.719 ± 0.418
0.719TyrPhe: 0.719 ± 0.418
3.595TyrGly: 3.595 ± 0.008
3.595TyrHis: 3.595 ± 2.091
1.438TyrIle: 1.438 ± 1.247
0.719TyrLys: 0.719 ± 0.623
2.876TyrLeu: 2.876 ± 1.452
0.0TyrMet: 0.0 ± 0.0
2.157TyrAsn: 2.157 ± 1.255
1.438TyrPro: 1.438 ± 0.837
0.719TyrGln: 0.719 ± 0.623
3.595TyrArg: 3.595 ± 1.034
2.157TyrSer: 2.157 ± 0.829
5.751TyrThr: 5.751 ± 0.221
2.876TyrVal: 2.876 ± 1.673
0.719TyrTrp: 0.719 ± 0.623
0.719TyrTyr: 0.719 ± 0.418
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1392 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski