Amino acid dipepetide frequency for Maize streak virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.247AlaAla: 3.247 ± 2.336
0.0AlaCys: 0.0 ± 0.0
3.247AlaAsp: 3.247 ± 2.404
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
1.623AlaGly: 1.623 ± 1.202
3.247AlaHis: 3.247 ± 0.991
6.494AlaIle: 6.494 ± 4.068
1.623AlaLys: 1.623 ± 1.202
6.494AlaLeu: 6.494 ± 1.625
1.623AlaMet: 1.623 ± 1.991
0.0AlaAsn: 0.0 ± 0.0
8.117AlaPro: 8.117 ± 4.93
1.623AlaGln: 1.623 ± 1.202
4.87AlaArg: 4.87 ± 1.545
8.117AlaSer: 8.117 ± 2.778
3.247AlaThr: 3.247 ± 2.237
1.623AlaVal: 1.623 ± 2.525
1.623AlaTrp: 1.623 ± 1.202
3.247AlaTyr: 3.247 ± 0.991
0.0AlaXaa: 0.0 ± 0.0
Cys
1.623CysAla: 1.623 ± 1.119
0.0CysCys: 0.0 ± 0.0
1.623CysAsp: 1.623 ± 1.202
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.623CysHis: 1.623 ± 1.202
1.623CysIle: 1.623 ± 2.525
3.247CysLys: 3.247 ± 2.404
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.623CysAsn: 1.623 ± 1.119
1.623CysPro: 1.623 ± 1.119
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.623CysTyr: 1.623 ± 1.119
0.0CysXaa: 0.0 ± 0.0
Asp
4.87AspAla: 4.87 ± 1.897
0.0AspCys: 0.0 ± 0.0
3.247AspAsp: 3.247 ± 2.237
4.87AspGlu: 4.87 ± 1.897
1.623AspPhe: 1.623 ± 1.202
6.494AspGly: 6.494 ± 1.981
0.0AspHis: 0.0 ± 0.0
6.494AspIle: 6.494 ± 2.75
1.623AspLys: 1.623 ± 1.119
6.494AspLeu: 6.494 ± 1.351
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
1.623AspPro: 1.623 ± 2.525
1.623AspGln: 1.623 ± 1.119
1.623AspArg: 1.623 ± 2.525
1.623AspSer: 1.623 ± 1.119
4.87AspThr: 4.87 ± 3.605
1.623AspVal: 1.623 ± 1.119
3.247AspTrp: 3.247 ± 2.237
3.247AspTyr: 3.247 ± 0.991
0.0AspXaa: 0.0 ± 0.0
Glu
8.117GluAla: 8.117 ± 1.94
0.0GluCys: 0.0 ± 0.0
1.623GluAsp: 1.623 ± 1.119
3.247GluGlu: 3.247 ± 2.241
1.623GluPhe: 1.623 ± 1.119
1.623GluGly: 1.623 ± 1.202
0.0GluHis: 0.0 ± 0.0
3.247GluIle: 3.247 ± 2.237
3.247GluLys: 3.247 ± 0.991
4.87GluLeu: 4.87 ± 1.545
0.0GluMet: 0.0 ± 0.0
1.623GluAsn: 1.623 ± 1.119
4.87GluPro: 4.87 ± 2.485
0.0GluGln: 0.0 ± 0.0
1.623GluArg: 1.623 ± 1.119
1.623GluSer: 1.623 ± 1.119
3.247GluThr: 3.247 ± 2.404
1.623GluVal: 1.623 ± 2.525
0.0GluTrp: 0.0 ± 0.0
3.247GluTyr: 3.247 ± 2.237
0.0GluXaa: 0.0 ± 0.0
Phe
1.623PheAla: 1.623 ± 1.202
0.0PheCys: 0.0 ± 0.0
1.623PheAsp: 1.623 ± 1.119
3.247PheGlu: 3.247 ± 2.237
1.623PhePhe: 1.623 ± 1.119
1.623PheGly: 1.623 ± 2.525
3.247PheHis: 3.247 ± 0.991
3.247PheIle: 3.247 ± 2.237
1.623PheLys: 1.623 ± 1.202
3.247PheLeu: 3.247 ± 2.237
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.247PhePro: 3.247 ± 2.237
0.0PheGln: 0.0 ± 0.0
0.0PheArg: 0.0 ± 0.0
0.0PheSer: 0.0 ± 0.0
4.87PheThr: 4.87 ± 3.605
6.494PheVal: 6.494 ± 4.671
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.247GlyAla: 3.247 ± 2.404
0.0GlyCys: 0.0 ± 0.0
1.623GlyAsp: 1.623 ± 1.202
3.247GlyGlu: 3.247 ± 2.336
1.623GlyPhe: 1.623 ± 1.119
8.117GlyGly: 8.117 ± 4.475
0.0GlyHis: 0.0 ± 0.0
1.623GlyIle: 1.623 ± 1.202
1.623GlyLys: 1.623 ± 1.202
1.623GlyLeu: 1.623 ± 1.202
1.623GlyMet: 1.623 ± 1.911
12.987GlyAsn: 12.987 ± 3.962
3.247GlyPro: 3.247 ± 2.241
4.87GlyGln: 4.87 ± 1.545
4.87GlyArg: 4.87 ± 1.545
6.494GlySer: 6.494 ± 1.625
3.247GlyThr: 3.247 ± 0.991
8.117GlyVal: 8.117 ± 6.009
1.623GlyTrp: 1.623 ± 2.525
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.623HisAla: 1.623 ± 1.119
1.623HisCys: 1.623 ± 1.119
0.0HisAsp: 0.0 ± 0.0
1.623HisGlu: 1.623 ± 1.119
1.623HisPhe: 1.623 ± 1.202
1.623HisGly: 1.623 ± 1.202
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.623HisLys: 1.623 ± 1.202
1.623HisLeu: 1.623 ± 1.119
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
4.87HisPro: 4.87 ± 2.485
0.0HisGln: 0.0 ± 0.0
3.247HisArg: 3.247 ± 0.991
1.623HisSer: 1.623 ± 1.202
1.623HisThr: 1.623 ± 1.202
3.247HisVal: 3.247 ± 2.241
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.623IleAla: 1.623 ± 1.202
3.247IleCys: 3.247 ± 2.241
1.623IleAsp: 1.623 ± 1.119
0.0IleGlu: 0.0 ± 0.0
3.247IlePhe: 3.247 ± 2.336
1.623IleGly: 1.623 ± 1.202
0.0IleHis: 0.0 ± 0.0
3.247IleIle: 3.247 ± 2.237
1.623IleLys: 1.623 ± 1.119
4.87IleLeu: 4.87 ± 4.642
3.247IleMet: 3.247 ± 2.237
3.247IleAsn: 3.247 ± 0.991
3.247IlePro: 3.247 ± 2.241
8.117IleGln: 8.117 ± 2.598
0.0IleArg: 0.0 ± 0.0
3.247IleSer: 3.247 ± 2.237
3.247IleThr: 3.247 ± 2.404
1.623IleVal: 1.623 ± 1.119
0.0IleTrp: 0.0 ± 0.0
4.87IleTyr: 4.87 ± 4.642
0.0IleXaa: 0.0 ± 0.0
Lys
8.117LysAla: 8.117 ± 4.475
0.0LysCys: 0.0 ± 0.0
4.87LysAsp: 4.87 ± 1.738
1.623LysGlu: 1.623 ± 1.119
1.623LysPhe: 1.623 ± 1.202
3.247LysGly: 3.247 ± 0.991
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
8.117LysLys: 8.117 ± 2.778
4.87LysLeu: 4.87 ± 3.356
0.0LysMet: 0.0 ± 0.0
1.623LysAsn: 1.623 ± 1.202
6.494LysPro: 6.494 ± 2.75
1.623LysGln: 1.623 ± 1.119
8.117LysArg: 8.117 ± 4.184
4.87LysSer: 4.87 ± 1.738
0.0LysThr: 0.0 ± 0.0
4.87LysVal: 4.87 ± 1.897
0.0LysTrp: 0.0 ± 0.0
1.623LysTyr: 1.623 ± 1.202
0.0LysXaa: 0.0 ± 0.0
Leu
1.623LeuAla: 1.623 ± 1.119
3.247LeuCys: 3.247 ± 0.991
1.623LeuAsp: 1.623 ± 1.119
1.623LeuGlu: 1.623 ± 2.525
4.87LeuPhe: 4.87 ± 1.738
4.87LeuGly: 4.87 ± 1.738
6.494LeuHis: 6.494 ± 2.75
8.117LeuIle: 8.117 ± 6.592
4.87LeuLys: 4.87 ± 1.545
8.117LeuLeu: 8.117 ± 1.94
0.0LeuMet: 0.0 ± 0.0
0.0LeuAsn: 0.0 ± 0.0
1.623LeuPro: 1.623 ± 2.525
8.117LeuGln: 8.117 ± 3.822
1.623LeuArg: 1.623 ± 2.525
3.247LeuSer: 3.247 ± 2.241
3.247LeuThr: 3.247 ± 0.991
6.494LeuVal: 6.494 ± 1.625
3.247LeuTrp: 3.247 ± 2.241
6.494LeuTyr: 6.494 ± 1.625
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
3.247MetAsp: 3.247 ± 2.241
1.623MetGlu: 1.623 ± 1.202
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.623MetIle: 1.623 ± 1.202
0.0MetLys: 0.0 ± 0.0
1.623MetLeu: 1.623 ± 1.119
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.623MetGln: 1.623 ± 1.119
0.0MetArg: 0.0 ± 0.0
3.247MetSer: 3.247 ± 0.991
1.623MetThr: 1.623 ± 1.119
1.623MetVal: 1.623 ± 1.119
1.623MetTrp: 1.623 ± 1.202
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.247AsnAla: 3.247 ± 2.404
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.623AsnGlu: 1.623 ± 1.119
0.0AsnPhe: 0.0 ± 0.0
1.623AsnGly: 1.623 ± 1.202
0.0AsnHis: 0.0 ± 0.0
3.247AsnIle: 3.247 ± 2.237
1.623AsnLys: 1.623 ± 1.119
3.247AsnLeu: 3.247 ± 2.237
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
4.87AsnPro: 4.87 ± 2.485
3.247AsnGln: 3.247 ± 2.404
4.87AsnArg: 4.87 ± 1.897
3.247AsnSer: 3.247 ± 2.237
4.87AsnThr: 4.87 ± 1.545
4.87AsnVal: 4.87 ± 1.897
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.623ProCys: 1.623 ± 1.202
3.247ProAsp: 3.247 ± 2.237
6.494ProGlu: 6.494 ± 4.474
1.623ProPhe: 1.623 ± 2.525
6.494ProGly: 6.494 ± 4.671
3.247ProHis: 3.247 ± 2.237
1.623ProIle: 1.623 ± 2.525
1.623ProLys: 1.623 ± 1.202
3.247ProLeu: 3.247 ± 2.237
0.0ProMet: 0.0 ± 0.0
4.87ProAsn: 4.87 ± 2.485
3.247ProPro: 3.247 ± 2.404
1.623ProGln: 1.623 ± 2.525
3.247ProArg: 3.247 ± 2.241
6.494ProSer: 6.494 ± 3.018
11.364ProThr: 11.364 ± 5.612
3.247ProVal: 3.247 ± 2.241
0.0ProTrp: 0.0 ± 0.0
1.623ProTyr: 1.623 ± 1.119
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.623GlnAsp: 1.623 ± 1.119
1.623GlnGlu: 1.623 ± 1.119
1.623GlnPhe: 1.623 ± 1.119
6.494GlnGly: 6.494 ± 3.135
0.0GlnHis: 0.0 ± 0.0
3.247GlnIle: 3.247 ± 0.991
1.623GlnLys: 1.623 ± 1.119
0.0GlnLeu: 0.0 ± 0.0
1.623GlnMet: 1.623 ± 1.074
0.0GlnAsn: 0.0 ± 0.0
3.247GlnPro: 3.247 ± 2.237
1.623GlnGln: 1.623 ± 1.119
3.247GlnArg: 3.247 ± 2.404
3.247GlnSer: 3.247 ± 2.241
4.87GlnThr: 4.87 ± 1.897
1.623GlnVal: 1.623 ± 1.119
4.87GlnTrp: 4.87 ± 1.897
1.623GlnTyr: 1.623 ± 1.119
0.0GlnXaa: 0.0 ± 0.0
Arg
3.247ArgAla: 3.247 ± 0.991
0.0ArgCys: 0.0 ± 0.0
3.247ArgAsp: 3.247 ± 2.241
4.87ArgGlu: 4.87 ± 1.738
3.247ArgPhe: 3.247 ± 0.991
8.117ArgGly: 8.117 ± 2.405
1.623ArgHis: 1.623 ± 1.202
1.623ArgIle: 1.623 ± 1.202
4.87ArgLys: 4.87 ± 1.738
1.623ArgLeu: 1.623 ± 1.202
0.0ArgMet: 0.0 ± 0.0
4.87ArgAsn: 4.87 ± 1.545
0.0ArgPro: 0.0 ± 0.0
1.623ArgGln: 1.623 ± 1.119
4.87ArgArg: 4.87 ± 4.714
3.247ArgSer: 3.247 ± 2.336
3.247ArgThr: 3.247 ± 0.991
1.623ArgVal: 1.623 ± 2.525
3.247ArgTrp: 3.247 ± 0.991
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.494SerAla: 6.494 ± 1.351
1.623SerCys: 1.623 ± 1.202
8.117SerAsp: 8.117 ± 2.778
6.494SerGlu: 6.494 ± 1.351
3.247SerPhe: 3.247 ± 2.241
4.87SerGly: 4.87 ± 3.605
3.247SerHis: 3.247 ± 2.241
1.623SerIle: 1.623 ± 1.119
9.74SerLys: 9.74 ± 4.914
1.623SerLeu: 1.623 ± 1.202
1.623SerMet: 1.623 ± 1.119
4.87SerAsn: 4.87 ± 1.897
1.623SerPro: 1.623 ± 1.119
1.623SerGln: 1.623 ± 1.119
4.87SerArg: 4.87 ± 1.738
9.74SerSer: 9.74 ± 6.711
6.494SerThr: 6.494 ± 1.625
8.117SerVal: 8.117 ± 3.822
3.247SerTrp: 3.247 ± 2.336
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.87ThrAla: 4.87 ± 4.714
1.623ThrCys: 1.623 ± 1.119
4.87ThrAsp: 4.87 ± 1.897
1.623ThrGlu: 1.623 ± 2.525
4.87ThrPhe: 4.87 ± 1.738
3.247ThrGly: 3.247 ± 2.336
0.0ThrHis: 0.0 ± 0.0
0.0ThrIle: 0.0 ± 0.0
4.87ThrLys: 4.87 ± 1.897
6.494ThrLeu: 6.494 ± 4.671
3.247ThrMet: 3.247 ± 2.404
0.0ThrAsn: 0.0 ± 0.0
3.247ThrPro: 3.247 ± 2.404
1.623ThrGln: 1.623 ± 1.202
1.623ThrArg: 1.623 ± 1.202
12.987ThrSer: 12.987 ± 4.309
6.494ThrThr: 6.494 ± 3.504
1.623ThrVal: 1.623 ± 1.202
1.623ThrTrp: 1.623 ± 1.202
4.87ThrTyr: 4.87 ± 1.897
0.0ThrXaa: 0.0 ± 0.0
Val
4.87ValAla: 4.87 ± 4.714
1.623ValCys: 1.623 ± 1.202
6.494ValAsp: 6.494 ± 1.625
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
8.117ValGly: 8.117 ± 2.405
1.623ValHis: 1.623 ± 2.525
1.623ValIle: 1.623 ± 1.202
1.623ValLys: 1.623 ± 1.202
6.494ValLeu: 6.494 ± 4.483
1.623ValMet: 1.623 ± 1.202
3.247ValAsn: 3.247 ± 2.237
4.87ValPro: 4.87 ± 4.714
1.623ValGln: 1.623 ± 1.119
6.494ValArg: 6.494 ± 2.75
6.494ValSer: 6.494 ± 2.75
1.623ValThr: 1.623 ± 1.202
1.623ValVal: 1.623 ± 1.202
0.0ValTrp: 0.0 ± 0.0
1.623ValTyr: 1.623 ± 1.202
0.0ValXaa: 0.0 ± 0.0
Trp
1.623TrpAla: 1.623 ± 1.119
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.623TrpGlu: 1.623 ± 1.119
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
4.87TrpLys: 4.87 ± 1.897
6.494TrpLeu: 6.494 ± 1.981
0.0TrpMet: 0.0 ± 0.0
1.623TrpAsn: 1.623 ± 1.202
3.247TrpPro: 3.247 ± 2.404
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
3.247TrpSer: 3.247 ± 5.05
0.0TrpThr: 0.0 ± 0.0
1.623TrpVal: 1.623 ± 2.525
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.623TyrAla: 1.623 ± 1.202
0.0TyrCys: 0.0 ± 0.0
3.247TyrAsp: 3.247 ± 0.991
0.0TyrGlu: 0.0 ± 0.0
3.247TyrPhe: 3.247 ± 0.991
0.0TyrGly: 0.0 ± 0.0
1.623TyrHis: 1.623 ± 1.202
3.247TyrIle: 3.247 ± 2.237
1.623TyrLys: 1.623 ± 1.202
6.494TyrLeu: 6.494 ± 4.068
1.623TyrMet: 1.623 ± 1.119
0.0TyrAsn: 0.0 ± 0.0
1.623TyrPro: 1.623 ± 1.119
1.623TyrGln: 1.623 ± 1.119
0.0TyrArg: 0.0 ± 0.0
6.494TyrSer: 6.494 ± 1.981
1.623TyrThr: 1.623 ± 2.525
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (617 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski