Amino acid dipepetide frequency for Sclerophthora macrospora virus B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.461AlaAla: 5.461 ± 1.416
0.607AlaCys: 0.607 ± 1.119
4.248AlaAsp: 4.248 ± 0.821
3.034AlaGlu: 3.034 ± 1.616
4.248AlaPhe: 4.248 ± 0.621
3.641AlaGly: 3.641 ± 3.828
1.214AlaHis: 1.214 ± 0.647
4.248AlaIle: 4.248 ± 2.063
3.034AlaLys: 3.034 ± 0.175
4.248AlaLeu: 4.248 ± 0.821
1.214AlaMet: 1.214 ± 0.647
3.034AlaAsn: 3.034 ± 2.709
3.641AlaPro: 3.641 ± 0.944
1.82AlaGln: 1.82 ± 0.472
3.034AlaArg: 3.034 ± 0.175
11.529AlaSer: 11.529 ± 5.393
5.461AlaThr: 5.461 ± 0.026
7.888AlaVal: 7.888 ± 0.123
1.82AlaTrp: 1.82 ± 0.97
2.427AlaTyr: 2.427 ± 1.591
0.0AlaXaa: 0.0 ± 0.0
Cys
0.607CysAla: 0.607 ± 1.119
1.214CysCys: 1.214 ± 0.647
0.607CysAsp: 0.607 ± 0.323
1.214CysGlu: 1.214 ± 0.647
0.607CysPhe: 0.607 ± 0.323
0.607CysGly: 0.607 ± 0.323
0.607CysHis: 0.607 ± 0.323
0.607CysIle: 0.607 ± 0.323
0.0CysLys: 0.0 ± 0.0
2.427CysLeu: 2.427 ± 1.293
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.214CysPro: 1.214 ± 2.237
1.214CysGln: 1.214 ± 0.647
1.214CysArg: 1.214 ± 0.647
1.82CysSer: 1.82 ± 0.97
0.0CysThr: 0.0 ± 0.0
1.214CysVal: 1.214 ± 0.647
0.607CysTrp: 0.607 ± 0.323
1.214CysTyr: 1.214 ± 0.795
0.0CysXaa: 0.0 ± 0.0
Asp
4.248AspAla: 4.248 ± 0.821
0.607AspCys: 0.607 ± 0.323
2.427AspAsp: 2.427 ± 1.293
2.427AspGlu: 2.427 ± 1.293
0.607AspPhe: 0.607 ± 0.323
3.641AspGly: 3.641 ± 1.94
0.0AspHis: 0.0 ± 0.0
2.427AspIle: 2.427 ± 0.149
2.427AspLys: 2.427 ± 0.149
5.461AspLeu: 5.461 ± 1.468
1.214AspMet: 1.214 ± 2.237
1.214AspAsn: 1.214 ± 0.795
1.214AspPro: 1.214 ± 0.647
1.82AspGln: 1.82 ± 0.472
4.854AspArg: 4.854 ± 2.586
4.854AspSer: 4.854 ± 1.144
2.427AspThr: 2.427 ± 1.293
4.854AspVal: 4.854 ± 2.586
1.214AspTrp: 1.214 ± 0.647
1.214AspTyr: 1.214 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
4.248GluAla: 4.248 ± 0.621
0.0GluCys: 0.0 ± 0.0
4.854GluAsp: 4.854 ± 2.586
2.427GluGlu: 2.427 ± 1.293
3.034GluPhe: 3.034 ± 1.616
2.427GluGly: 2.427 ± 1.591
0.607GluHis: 0.607 ± 0.323
3.641GluIle: 3.641 ± 1.94
3.034GluLys: 3.034 ± 1.616
5.461GluLeu: 5.461 ± 1.468
0.607GluMet: 0.607 ± 0.323
0.607GluAsn: 0.607 ± 0.323
3.034GluPro: 3.034 ± 1.616
1.82GluGln: 1.82 ± 0.97
4.248GluArg: 4.248 ± 0.821
4.248GluSer: 4.248 ± 0.821
1.214GluThr: 1.214 ± 0.795
3.034GluVal: 3.034 ± 0.175
1.82GluTrp: 1.82 ± 0.97
2.427GluTyr: 2.427 ± 1.293
0.0GluXaa: 0.0 ± 0.0
Phe
2.427PheAla: 2.427 ± 1.293
2.427PheCys: 2.427 ± 0.149
3.034PheAsp: 3.034 ± 1.616
2.427PheGlu: 2.427 ± 0.149
1.82PhePhe: 1.82 ± 0.97
1.214PheGly: 1.214 ± 0.795
0.607PheHis: 0.607 ± 1.119
1.214PheIle: 1.214 ± 0.647
0.607PheLys: 0.607 ± 1.119
4.248PheLeu: 4.248 ± 0.821
1.214PheMet: 1.214 ± 0.647
2.427PheAsn: 2.427 ± 1.591
1.82PhePro: 1.82 ± 0.97
1.82PheGln: 1.82 ± 0.97
3.034PheArg: 3.034 ± 1.616
4.248PheSer: 4.248 ± 0.821
3.034PheThr: 3.034 ± 1.267
4.854PheVal: 4.854 ± 0.297
0.0PheTrp: 0.0 ± 0.0
0.607PheTyr: 0.607 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
5.461GlyAla: 5.461 ± 2.858
1.214GlyCys: 1.214 ± 0.795
4.248GlyAsp: 4.248 ± 0.621
3.034GlyGlu: 3.034 ± 0.175
1.214GlyPhe: 1.214 ± 2.237
3.034GlyGly: 3.034 ± 0.175
1.214GlyHis: 1.214 ± 0.647
3.034GlyIle: 3.034 ± 1.616
3.034GlyLys: 3.034 ± 0.175
7.888GlyLeu: 7.888 ± 0.123
3.034GlyMet: 3.034 ± 1.616
1.82GlyAsn: 1.82 ± 0.472
2.427GlyPro: 2.427 ± 1.591
2.427GlyGln: 2.427 ± 3.032
4.854GlyArg: 4.854 ± 0.297
2.427GlySer: 2.427 ± 1.293
3.641GlyThr: 3.641 ± 0.944
5.461GlyVal: 5.461 ± 7.183
2.427GlyTrp: 2.427 ± 1.293
4.854GlyTyr: 4.854 ± 1.144
0.0GlyXaa: 0.0 ± 0.0
His
1.214HisAla: 1.214 ± 0.795
0.607HisCys: 0.607 ± 0.323
1.214HisAsp: 1.214 ± 0.647
1.82HisGlu: 1.82 ± 0.97
0.607HisPhe: 0.607 ± 0.323
0.0HisGly: 0.0 ± 0.0
1.82HisHis: 1.82 ± 0.472
1.214HisIle: 1.214 ± 0.795
1.214HisLys: 1.214 ± 0.647
1.214HisLeu: 1.214 ± 0.795
0.607HisMet: 0.607 ± 0.323
1.214HisAsn: 1.214 ± 0.647
1.214HisPro: 1.214 ± 0.647
0.607HisGln: 0.607 ± 0.323
0.0HisArg: 0.0 ± 0.0
1.214HisSer: 1.214 ± 0.647
1.214HisThr: 1.214 ± 0.647
1.82HisVal: 1.82 ± 0.97
0.0HisTrp: 0.0 ± 0.0
1.82HisTyr: 1.82 ± 0.97
0.0HisXaa: 0.0 ± 0.0
Ile
4.854IleAla: 4.854 ± 3.181
0.607IleCys: 0.607 ± 0.323
1.214IleAsp: 1.214 ± 0.647
2.427IleGlu: 2.427 ± 1.293
1.82IlePhe: 1.82 ± 0.472
3.641IleGly: 3.641 ± 0.944
1.214IleHis: 1.214 ± 0.647
5.461IleIle: 5.461 ± 1.468
3.034IleLys: 3.034 ± 1.616
1.82IleLeu: 1.82 ± 0.97
0.607IleMet: 0.607 ± 0.323
2.427IleAsn: 2.427 ± 0.149
2.427IlePro: 2.427 ± 0.149
1.214IleGln: 1.214 ± 0.647
1.82IleArg: 1.82 ± 0.97
1.214IleSer: 1.214 ± 0.647
4.248IleThr: 4.248 ± 4.946
4.854IleVal: 4.854 ± 3.181
0.0IleTrp: 0.0 ± 0.0
3.034IleTyr: 3.034 ± 0.175
0.0IleXaa: 0.0 ± 0.0
Lys
1.214LysAla: 1.214 ± 0.647
1.214LysCys: 1.214 ± 0.647
1.214LysAsp: 1.214 ± 0.647
1.214LysGlu: 1.214 ± 0.647
3.034LysPhe: 3.034 ± 1.616
3.034LysGly: 3.034 ± 0.175
1.214LysHis: 1.214 ± 0.647
4.854LysIle: 4.854 ± 1.144
3.034LysLys: 3.034 ± 5.593
1.82LysLeu: 1.82 ± 0.97
0.0LysMet: 0.0 ± 0.0
2.427LysAsn: 2.427 ± 1.591
2.427LysPro: 2.427 ± 0.149
0.607LysGln: 0.607 ± 1.119
3.641LysArg: 3.641 ± 1.94
4.854LysSer: 4.854 ± 0.297
3.641LysThr: 3.641 ± 2.386
3.034LysVal: 3.034 ± 0.175
0.0LysTrp: 0.0 ± 0.0
2.427LysTyr: 2.427 ± 1.293
0.0LysXaa: 0.0 ± 0.0
Leu
7.282LeuAla: 7.282 ± 0.446
0.607LeuCys: 0.607 ± 0.323
4.248LeuAsp: 4.248 ± 2.263
7.282LeuGlu: 7.282 ± 0.996
1.214LeuPhe: 1.214 ± 0.647
3.641LeuGly: 3.641 ± 0.498
3.034LeuHis: 3.034 ± 1.616
3.641LeuIle: 3.641 ± 1.94
4.854LeuLys: 4.854 ± 1.144
6.675LeuLeu: 6.675 ± 3.556
0.0LeuMet: 0.0 ± 0.0
3.641LeuAsn: 3.641 ± 0.498
4.854LeuPro: 4.854 ± 3.181
2.427LeuGln: 2.427 ± 0.149
5.461LeuArg: 5.461 ± 1.468
13.956LeuSer: 13.956 ± 1.668
1.82LeuThr: 1.82 ± 0.97
9.709LeuVal: 9.709 ± 2.037
0.607LeuTrp: 0.607 ± 0.323
4.248LeuTyr: 4.248 ± 2.263
0.0LeuXaa: 0.0 ± 0.0
Met
1.214MetAla: 1.214 ± 0.647
0.607MetCys: 0.607 ± 0.323
0.607MetAsp: 0.607 ± 0.323
0.607MetGlu: 0.607 ± 0.323
0.0MetPhe: 0.0 ± 0.0
0.607MetGly: 0.607 ± 0.323
0.0MetHis: 0.0 ± 0.0
1.214MetIle: 1.214 ± 0.795
0.607MetLys: 0.607 ± 0.323
1.82MetLeu: 1.82 ± 0.472
2.427MetMet: 2.427 ± 0.149
1.214MetAsn: 1.214 ± 0.795
1.214MetPro: 1.214 ± 0.647
0.607MetGln: 0.607 ± 0.323
2.427MetArg: 2.427 ± 1.293
1.82MetSer: 1.82 ± 0.97
1.214MetThr: 1.214 ± 2.237
0.607MetVal: 0.607 ± 0.323
0.0MetTrp: 0.0 ± 0.0
0.607MetTyr: 0.607 ± 1.119
0.0MetXaa: 0.0 ± 0.0
Asn
3.034AsnAla: 3.034 ± 4.151
0.607AsnCys: 0.607 ± 0.323
1.214AsnAsp: 1.214 ± 0.795
1.214AsnGlu: 1.214 ± 0.795
3.641AsnPhe: 3.641 ± 1.94
1.214AsnGly: 1.214 ± 0.647
0.0AsnHis: 0.0 ± 0.0
2.427AsnIle: 2.427 ± 3.032
0.607AsnLys: 0.607 ± 1.119
1.82AsnLeu: 1.82 ± 0.472
0.0AsnMet: 0.0 ± 0.265
2.427AsnAsn: 2.427 ± 3.032
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
3.034AsnArg: 3.034 ± 0.175
4.248AsnSer: 4.248 ± 2.063
3.034AsnThr: 3.034 ± 2.709
5.461AsnVal: 5.461 ± 1.416
0.607AsnTrp: 0.607 ± 0.323
2.427AsnTyr: 2.427 ± 0.149
0.0AsnXaa: 0.0 ± 0.0
Pro
3.034ProAla: 3.034 ± 1.616
0.0ProCys: 0.0 ± 0.0
1.214ProAsp: 1.214 ± 0.795
4.248ProGlu: 4.248 ± 0.621
3.034ProPhe: 3.034 ± 0.175
5.461ProGly: 5.461 ± 1.416
1.82ProHis: 1.82 ± 0.97
1.82ProIle: 1.82 ± 0.472
1.82ProLys: 1.82 ± 0.472
1.82ProLeu: 1.82 ± 0.97
0.607ProMet: 0.607 ± 0.323
1.214ProAsn: 1.214 ± 2.237
3.034ProPro: 3.034 ± 1.267
1.82ProGln: 1.82 ± 1.914
1.82ProArg: 1.82 ± 0.472
4.854ProSer: 4.854 ± 1.739
4.248ProThr: 4.248 ± 2.063
4.248ProVal: 4.248 ± 0.621
0.0ProTrp: 0.0 ± 0.0
1.214ProTyr: 1.214 ± 0.647
0.0ProXaa: 0.0 ± 0.0
Gln
1.82GlnAla: 1.82 ± 0.472
0.0GlnCys: 0.0 ± 0.0
0.607GlnAsp: 0.607 ± 0.323
1.214GlnGlu: 1.214 ± 0.647
0.607GlnPhe: 0.607 ± 0.323
4.248GlnGly: 4.248 ± 0.621
0.607GlnHis: 0.607 ± 0.323
0.0GlnIle: 0.0 ± 0.0
0.607GlnLys: 0.607 ± 0.323
4.248GlnLeu: 4.248 ± 0.821
0.607GlnMet: 0.607 ± 1.119
1.214GlnAsn: 1.214 ± 0.795
1.214GlnPro: 1.214 ± 0.795
0.0GlnGln: 0.0 ± 0.0
3.034GlnArg: 3.034 ± 1.616
0.607GlnSer: 0.607 ± 0.323
2.427GlnThr: 2.427 ± 1.591
3.034GlnVal: 3.034 ± 0.175
0.607GlnTrp: 0.607 ± 0.323
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.248ArgAla: 4.248 ± 0.621
1.82ArgCys: 1.82 ± 0.97
3.034ArgAsp: 3.034 ± 1.616
3.641ArgGlu: 3.641 ± 1.94
5.461ArgPhe: 5.461 ± 2.91
6.068ArgGly: 6.068 ± 3.233
1.82ArgHis: 1.82 ± 0.472
1.82ArgIle: 1.82 ± 1.914
2.427ArgLys: 2.427 ± 0.149
2.427ArgLeu: 2.427 ± 0.149
0.607ArgMet: 0.607 ± 0.323
1.82ArgAsn: 1.82 ± 0.472
1.82ArgPro: 1.82 ± 0.97
2.427ArgGln: 2.427 ± 0.149
2.427ArgArg: 2.427 ± 1.293
5.461ArgSer: 5.461 ± 2.91
3.641ArgThr: 3.641 ± 1.94
4.854ArgVal: 4.854 ± 1.739
3.034ArgTrp: 3.034 ± 1.616
4.854ArgTyr: 4.854 ± 1.144
0.0ArgXaa: 0.0 ± 0.0
Ser
10.316SerAla: 10.316 ± 1.17
2.427SerCys: 2.427 ± 0.149
2.427SerAsp: 2.427 ± 0.149
6.675SerGlu: 6.675 ± 2.114
4.248SerPhe: 4.248 ± 3.504
6.068SerGly: 6.068 ± 3.976
1.214SerHis: 1.214 ± 0.647
2.427SerIle: 2.427 ± 1.591
6.068SerLys: 6.068 ± 3.233
13.35SerLeu: 13.35 ± 1.345
2.427SerMet: 2.427 ± 1.591
2.427SerAsn: 2.427 ± 0.149
7.888SerPro: 7.888 ± 0.123
1.82SerGln: 1.82 ± 0.97
4.248SerArg: 4.248 ± 0.821
5.461SerSer: 5.461 ± 2.91
4.248SerThr: 4.248 ± 0.821
6.675SerVal: 6.675 ± 0.672
1.214SerTrp: 1.214 ± 0.795
3.034SerTyr: 3.034 ± 0.175
0.0SerXaa: 0.0 ± 0.0
Thr
5.461ThrAla: 5.461 ± 2.858
0.0ThrCys: 0.0 ± 0.0
3.641ThrAsp: 3.641 ± 0.498
1.214ThrGlu: 1.214 ± 0.647
2.427ThrPhe: 2.427 ± 0.149
3.641ThrGly: 3.641 ± 0.944
0.607ThrHis: 0.607 ± 0.323
1.214ThrIle: 1.214 ± 0.795
3.034ThrLys: 3.034 ± 1.267
8.495ThrLeu: 8.495 ± 1.242
0.0ThrMet: 0.0 ± 0.0
3.641ThrAsn: 3.641 ± 2.386
2.427ThrPro: 2.427 ± 1.591
2.427ThrGln: 2.427 ± 1.293
4.854ThrArg: 4.854 ± 0.297
6.675ThrSer: 6.675 ± 0.769
2.427ThrThr: 2.427 ± 0.149
3.034ThrVal: 3.034 ± 4.151
1.82ThrTrp: 1.82 ± 0.472
0.607ThrTyr: 0.607 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
7.888ValAla: 7.888 ± 3.007
1.214ValCys: 1.214 ± 0.647
5.461ValAsp: 5.461 ± 0.026
3.034ValGlu: 3.034 ± 0.175
4.248ValPhe: 4.248 ± 0.621
9.102ValGly: 9.102 ± 5.244
1.82ValHis: 1.82 ± 0.97
4.854ValIle: 4.854 ± 0.297
2.427ValLys: 2.427 ± 0.149
5.461ValLeu: 5.461 ± 0.026
1.82ValMet: 1.82 ± 0.97
2.427ValAsn: 2.427 ± 1.591
3.641ValPro: 3.641 ± 2.386
1.214ValGln: 1.214 ± 0.647
4.248ValArg: 4.248 ± 0.821
9.709ValSer: 9.709 ± 2.037
7.282ValThr: 7.282 ± 1.888
7.282ValVal: 7.282 ± 3.33
0.0ValTrp: 0.0 ± 0.0
3.034ValTyr: 3.034 ± 1.267
0.0ValXaa: 0.0 ± 0.0
Trp
0.607TrpAla: 0.607 ± 0.323
0.0TrpCys: 0.0 ± 0.0
1.214TrpAsp: 1.214 ± 0.647
2.427TrpGlu: 2.427 ± 1.293
0.607TrpPhe: 0.607 ± 0.323
1.214TrpGly: 1.214 ± 0.647
0.0TrpHis: 0.0 ± 0.0
0.607TrpIle: 0.607 ± 1.119
0.607TrpLys: 0.607 ± 0.323
1.214TrpLeu: 1.214 ± 0.647
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.214TrpArg: 1.214 ± 0.647
3.034TrpSer: 3.034 ± 0.175
1.214TrpThr: 1.214 ± 0.647
1.214TrpVal: 1.214 ± 0.647
0.607TrpTrp: 0.607 ± 0.323
1.214TrpTyr: 1.214 ± 0.647
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.214TyrAla: 1.214 ± 0.647
1.214TyrCys: 1.214 ± 0.647
2.427TyrAsp: 2.427 ± 1.293
1.214TyrGlu: 1.214 ± 0.647
0.607TyrPhe: 0.607 ± 0.323
4.248TyrGly: 4.248 ± 0.621
1.214TyrHis: 1.214 ± 0.795
1.214TyrIle: 1.214 ± 0.647
2.427TyrLys: 2.427 ± 0.149
7.282TyrLeu: 7.282 ± 2.438
1.82TyrMet: 1.82 ± 0.213
2.427TyrAsn: 2.427 ± 1.293
1.82TyrPro: 1.82 ± 0.472
0.607TyrGln: 0.607 ± 0.323
4.248TyrArg: 4.248 ± 0.621
2.427TyrSer: 2.427 ± 0.149
1.214TyrThr: 1.214 ± 0.647
3.034TyrVal: 3.034 ± 0.175
0.607TyrTrp: 0.607 ± 0.323
1.82TyrTyr: 1.82 ± 0.97
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1649 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski