Amino acid dipepetide frequency for Maize streak virus genotype A (isolate Nigeria) (MSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.074AlaAla: 3.074 ± 2.072
3.074AlaCys: 3.074 ± 0.462
2.049AlaAsp: 2.049 ± 1.763
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
3.074AlaGly: 3.074 ± 2.644
3.074AlaHis: 3.074 ± 0.462
5.123AlaIle: 5.123 ± 1.35
2.049AlaLys: 2.049 ± 0.772
5.123AlaLeu: 5.123 ± 2.108
0.0AlaMet: 0.0 ± 0.744
6.148AlaAsn: 6.148 ± 2.474
4.098AlaPro: 4.098 ± 2.916
1.025AlaGln: 1.025 ± 0.871
5.123AlaArg: 5.123 ± 0.715
4.098AlaSer: 4.098 ± 1.074
3.074AlaThr: 3.074 ± 0.462
4.098AlaVal: 4.098 ± 1.262
1.025AlaTrp: 1.025 ± 0.881
2.049AlaTyr: 2.049 ± 0.954
0.0AlaXaa: 0.0 ± 0.0
Cys
2.049CysAla: 2.049 ± 0.772
0.0CysCys: 0.0 ± 0.0
1.025CysAsp: 1.025 ± 0.881
0.0CysGlu: 0.0 ± 0.0
1.025CysPhe: 1.025 ± 1.23
0.0CysGly: 0.0 ± 0.0
1.025CysHis: 1.025 ± 0.881
1.025CysIle: 1.025 ± 0.74
1.025CysLys: 1.025 ± 0.881
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.049CysAsn: 2.049 ± 0.772
3.074CysPro: 3.074 ± 1.237
3.074CysGln: 3.074 ± 1.237
1.025CysArg: 1.025 ± 0.881
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.025CysTrp: 1.025 ± 0.74
1.025CysTyr: 1.025 ± 0.871
0.0CysXaa: 0.0 ± 0.0
Asp
3.074AspAla: 3.074 ± 0.462
0.0AspCys: 0.0 ± 0.0
2.049AspAsp: 2.049 ± 0.889
5.123AspGlu: 5.123 ± 1.452
1.025AspPhe: 1.025 ± 0.74
4.098AspGly: 4.098 ± 1.176
0.0AspHis: 0.0 ± 0.0
6.148AspIle: 6.148 ± 1.275
0.0AspLys: 0.0 ± 0.0
5.123AspLeu: 5.123 ± 1.309
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
1.025AspPro: 1.025 ± 1.23
1.025AspGln: 1.025 ± 0.871
3.074AspArg: 3.074 ± 1.198
6.148AspSer: 6.148 ± 1.609
2.049AspThr: 2.049 ± 1.763
0.0AspVal: 0.0 ± 0.0
6.148AspTrp: 6.148 ± 2.474
3.074AspTyr: 3.074 ± 0.462
0.0AspXaa: 0.0 ± 0.0
Glu
6.148GluAla: 6.148 ± 2.316
0.0GluCys: 0.0 ± 0.0
3.074GluAsp: 3.074 ± 1.237
5.123GluGlu: 5.123 ± 1.55
2.049GluPhe: 2.049 ± 0.772
1.025GluGly: 1.025 ± 0.881
0.0GluHis: 0.0 ± 0.0
6.148GluIle: 6.148 ± 2.316
5.123GluLys: 5.123 ± 1.924
7.172GluLeu: 7.172 ± 0.882
1.025GluMet: 1.025 ± 0.74
3.074GluAsn: 3.074 ± 1.47
3.074GluPro: 3.074 ± 1.47
0.0GluGln: 0.0 ± 0.0
3.074GluArg: 3.074 ± 1.237
3.074GluSer: 3.074 ± 1.47
2.049GluThr: 2.049 ± 1.763
3.074GluVal: 3.074 ± 1.167
0.0GluTrp: 0.0 ± 0.0
5.123GluTyr: 5.123 ± 1.924
0.0GluXaa: 0.0 ± 0.0
Phe
1.025PheAla: 1.025 ± 0.881
1.025PheCys: 1.025 ± 0.74
4.098PheAsp: 4.098 ± 1.544
5.123PheGlu: 5.123 ± 1.924
2.049PhePhe: 2.049 ± 0.772
1.025PheGly: 1.025 ± 1.23
3.074PheHis: 3.074 ± 0.462
3.074PheIle: 3.074 ± 1.237
2.049PheLys: 2.049 ± 0.889
4.098PheLeu: 4.098 ± 1.544
0.0PheMet: 0.0 ± 0.0
1.025PheAsn: 1.025 ± 0.881
2.049PhePro: 2.049 ± 0.772
2.049PheGln: 2.049 ± 0.772
0.0PheArg: 0.0 ± 0.0
2.049PheSer: 2.049 ± 0.772
4.098PheThr: 4.098 ± 1.176
4.098PheVal: 4.098 ± 2.916
0.0PheTrp: 0.0 ± 0.0
1.025PheTyr: 1.025 ± 1.23
0.0PheXaa: 0.0 ± 0.0
Gly
2.049GlyAla: 2.049 ± 1.763
1.025GlyCys: 1.025 ± 0.74
1.025GlyAsp: 1.025 ± 0.881
2.049GlyGlu: 2.049 ± 1.165
2.049GlyPhe: 2.049 ± 0.772
5.123GlyGly: 5.123 ± 3.472
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
4.098GlyLys: 4.098 ± 1.887
3.074GlyLeu: 3.074 ± 2.644
1.025GlyMet: 1.025 ± 1.031
4.098GlyAsn: 4.098 ± 2.83
3.074GlyPro: 3.074 ± 1.167
4.098GlyGln: 4.098 ± 1.751
2.049GlyArg: 2.049 ± 1.458
5.123GlySer: 5.123 ± 2.027
5.123GlyThr: 5.123 ± 1.35
6.148GlyVal: 6.148 ± 3.315
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
5.123HisAla: 5.123 ± 0.918
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.025HisPhe: 1.025 ± 0.881
1.025HisGly: 1.025 ± 0.881
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.074HisLys: 3.074 ± 0.462
2.049HisLeu: 2.049 ± 0.772
0.0HisMet: 0.0 ± 0.0
1.025HisAsn: 1.025 ± 0.74
4.098HisPro: 4.098 ± 1.544
1.025HisGln: 1.025 ± 0.871
3.074HisArg: 3.074 ± 0.462
1.025HisSer: 1.025 ± 0.74
1.025HisThr: 1.025 ± 0.881
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.098IleAla: 4.098 ± 1.176
1.025IleCys: 1.025 ± 1.23
1.025IleAsp: 1.025 ± 0.881
0.0IleGlu: 0.0 ± 0.0
4.098IlePhe: 4.098 ± 0.903
1.025IleGly: 1.025 ± 0.881
0.0IleHis: 0.0 ± 0.0
4.098IleIle: 4.098 ± 1.887
0.0IleLys: 0.0 ± 0.0
7.172IleLeu: 7.172 ± 2.315
3.074IleMet: 3.074 ± 1.237
5.123IleAsn: 5.123 ± 0.918
8.197IlePro: 8.197 ± 1.719
8.197IleGln: 8.197 ± 1.231
0.0IleArg: 0.0 ± 0.0
5.123IleSer: 5.123 ± 2.18
1.025IleThr: 1.025 ± 0.881
3.074IleVal: 3.074 ± 2.221
2.049IleTrp: 2.049 ± 0.772
3.074IleTyr: 3.074 ± 1.378
0.0IleXaa: 0.0 ± 0.0
Lys
3.074LysAla: 3.074 ± 2.072
2.049LysCys: 2.049 ± 0.772
5.123LysAsp: 5.123 ± 2.18
9.221LysGlu: 9.221 ± 3.409
2.049LysPhe: 2.049 ± 0.889
0.0LysGly: 0.0 ± 0.0
0.0LysHis: 0.0 ± 0.0
1.025LysIle: 1.025 ± 0.881
8.197LysLys: 8.197 ± 2.724
4.098LysLeu: 4.098 ± 1.544
0.0LysMet: 0.0 ± 0.0
1.025LysAsn: 1.025 ± 0.881
7.172LysPro: 7.172 ± 1.05
2.049LysGln: 2.049 ± 1.481
6.148LysArg: 6.148 ± 5.288
9.221LysSer: 9.221 ± 2.47
0.0LysThr: 0.0 ± 0.0
2.049LysVal: 2.049 ± 0.889
1.025LysTrp: 1.025 ± 0.74
3.074LysTyr: 3.074 ± 1.237
0.0LysXaa: 0.0 ± 0.0
Leu
3.074LeuAla: 3.074 ± 1.237
5.123LeuCys: 5.123 ± 0.918
0.0LeuAsp: 0.0 ± 0.0
2.049LeuGlu: 2.049 ± 0.772
3.074LeuPhe: 3.074 ± 0.462
5.123LeuGly: 5.123 ± 0.918
4.098LeuHis: 4.098 ± 1.544
4.098LeuIle: 4.098 ± 3.729
5.123LeuLys: 5.123 ± 1.35
7.172LeuLeu: 7.172 ± 2.928
0.0LeuMet: 0.0 ± 0.0
0.0LeuAsn: 0.0 ± 0.0
3.074LeuPro: 3.074 ± 1.237
10.246LeuGln: 10.246 ± 2.625
1.025LeuArg: 1.025 ± 1.23
3.074LeuSer: 3.074 ± 1.167
4.098LeuThr: 4.098 ± 1.176
5.123LeuVal: 5.123 ± 0.715
1.025LeuTrp: 1.025 ± 1.23
5.123LeuTyr: 5.123 ± 2.564
0.0LeuXaa: 0.0 ± 0.0
Met
2.049MetAla: 2.049 ± 0.772
0.0MetCys: 0.0 ± 0.0
2.049MetAsp: 2.049 ± 1.493
1.025MetGlu: 1.025 ± 0.881
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
5.123MetIle: 5.123 ± 0.918
2.049MetLys: 2.049 ± 0.772
1.025MetLeu: 1.025 ± 0.871
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
2.049MetArg: 2.049 ± 0.772
2.049MetSer: 2.049 ± 0.889
1.025MetThr: 1.025 ± 0.74
1.025MetVal: 1.025 ± 0.881
1.025MetTrp: 1.025 ± 0.881
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.098AsnAla: 4.098 ± 0.903
1.025AsnCys: 1.025 ± 0.74
2.049AsnAsp: 2.049 ± 0.772
3.074AsnGlu: 3.074 ± 1.237
0.0AsnPhe: 0.0 ± 0.0
3.074AsnGly: 3.074 ± 0.462
0.0AsnHis: 0.0 ± 0.0
7.172AsnIle: 7.172 ± 0.882
2.049AsnLys: 2.049 ± 0.772
1.025AsnLeu: 1.025 ± 0.871
1.025AsnMet: 1.025 ± 0.881
1.025AsnAsn: 1.025 ± 0.74
7.172AsnPro: 7.172 ± 2.315
1.025AsnGln: 1.025 ± 0.881
5.123AsnArg: 5.123 ± 0.918
1.025AsnSer: 1.025 ± 0.74
4.098AsnThr: 4.098 ± 1.176
2.049AsnVal: 2.049 ± 0.889
1.025AsnTrp: 1.025 ± 0.881
1.025AsnTyr: 1.025 ± 0.74
0.0AsnXaa: 0.0 ± 0.0
Pro
2.049ProAla: 2.049 ± 1.458
2.049ProCys: 2.049 ± 0.889
3.074ProAsp: 3.074 ± 0.462
10.246ProGlu: 10.246 ± 4.361
6.148ProPhe: 6.148 ± 1.737
6.148ProGly: 6.148 ± 2.396
2.049ProHis: 2.049 ± 0.772
1.025ProIle: 1.025 ± 1.23
5.123ProLys: 5.123 ± 1.924
2.049ProLeu: 2.049 ± 0.772
0.0ProMet: 0.0 ± 0.0
5.123ProAsn: 5.123 ± 1.55
6.148ProPro: 6.148 ± 1.141
2.049ProGln: 2.049 ± 1.458
3.074ProArg: 3.074 ± 1.167
9.221ProSer: 9.221 ± 2.213
7.172ProThr: 7.172 ± 1.877
2.049ProVal: 2.049 ± 0.772
1.025ProTrp: 1.025 ± 1.23
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.049GlnAla: 2.049 ± 1.458
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
4.098GlnGlu: 4.098 ± 2.29
2.049GlnPhe: 2.049 ± 0.772
2.049GlnGly: 2.049 ± 2.46
1.025GlnHis: 1.025 ± 0.881
1.025GlnIle: 1.025 ± 0.881
3.074GlnLys: 3.074 ± 1.237
2.049GlnLeu: 2.049 ± 1.481
4.098GlnMet: 4.098 ± 1.133
2.049GlnAsn: 2.049 ± 1.165
4.098GlnPro: 4.098 ± 1.698
1.025GlnGln: 1.025 ± 0.871
2.049GlnArg: 2.049 ± 0.889
4.098GlnSer: 4.098 ± 1.887
5.123GlnThr: 5.123 ± 2.027
1.025GlnVal: 1.025 ± 0.871
2.049GlnTrp: 2.049 ± 0.954
2.049GlnTyr: 2.049 ± 0.772
0.0GlnXaa: 0.0 ± 0.0
Arg
2.049ArgAla: 2.049 ± 0.954
0.0ArgCys: 0.0 ± 0.0
6.148ArgAsp: 6.148 ± 1.737
3.074ArgGlu: 3.074 ± 0.462
3.074ArgPhe: 3.074 ± 0.462
5.123ArgGly: 5.123 ± 2.027
1.025ArgHis: 1.025 ± 0.881
3.074ArgIle: 3.074 ± 0.462
4.098ArgLys: 4.098 ± 0.782
1.025ArgLeu: 1.025 ± 0.881
0.0ArgMet: 0.0 ± 0.0
3.074ArgAsn: 3.074 ± 0.462
1.025ArgPro: 1.025 ± 0.881
3.074ArgGln: 3.074 ± 1.167
1.025ArgArg: 1.025 ± 0.881
5.123ArgSer: 5.123 ± 2.601
2.049ArgThr: 2.049 ± 0.889
5.123ArgVal: 5.123 ± 2.108
3.074ArgTrp: 3.074 ± 0.462
1.025ArgTyr: 1.025 ± 0.881
0.0ArgXaa: 0.0 ± 0.0
Ser
6.148SerAla: 6.148 ± 1.609
0.0SerCys: 0.0 ± 0.0
10.246SerAsp: 10.246 ± 1.97
3.074SerGlu: 3.074 ± 0.462
1.025SerPhe: 1.025 ± 1.23
3.074SerGly: 3.074 ± 2.072
5.123SerHis: 5.123 ± 2.18
2.049SerIle: 2.049 ± 0.772
8.197SerLys: 8.197 ± 1.46
5.123SerLeu: 5.123 ± 1.452
3.074SerMet: 3.074 ± 1.128
6.148SerAsn: 6.148 ± 2.093
7.172SerPro: 7.172 ± 2.659
1.025SerGln: 1.025 ± 1.23
6.148SerArg: 6.148 ± 0.835
13.32SerSer: 13.32 ± 3.908
6.148SerThr: 6.148 ± 0.835
4.098SerVal: 4.098 ± 1.074
1.025SerTrp: 1.025 ± 0.881
1.025SerTyr: 1.025 ± 0.74
0.0SerXaa: 0.0 ± 0.0
Thr
2.049ThrAla: 2.049 ± 1.458
1.025ThrCys: 1.025 ± 0.871
2.049ThrAsp: 2.049 ± 1.763
4.098ThrGlu: 4.098 ± 1.262
5.123ThrPhe: 5.123 ± 0.918
4.098ThrGly: 4.098 ± 1.751
0.0ThrHis: 0.0 ± 0.0
2.049ThrIle: 2.049 ± 0.889
3.074ThrLys: 3.074 ± 0.462
4.098ThrLeu: 4.098 ± 2.424
1.025ThrMet: 1.025 ± 0.881
2.049ThrAsn: 2.049 ± 0.772
3.074ThrPro: 3.074 ± 1.608
1.025ThrGln: 1.025 ± 0.881
2.049ThrArg: 2.049 ± 0.889
7.172ThrSer: 7.172 ± 2.008
2.049ThrThr: 2.049 ± 1.763
1.025ThrVal: 1.025 ± 0.881
2.049ThrTrp: 2.049 ± 0.889
4.098ThrTyr: 4.098 ± 1.176
0.0ThrXaa: 0.0 ± 0.0
Val
3.074ValAla: 3.074 ± 2.55
1.025ValCys: 1.025 ± 0.881
3.074ValAsp: 3.074 ± 1.516
0.0ValGlu: 0.0 ± 0.0
2.049ValPhe: 2.049 ± 0.772
5.123ValGly: 5.123 ± 1.714
2.049ValHis: 2.049 ± 1.481
1.025ValIle: 1.025 ± 0.881
3.074ValLys: 3.074 ± 0.462
2.049ValLeu: 2.049 ± 2.46
1.025ValMet: 1.025 ± 0.881
3.074ValAsn: 3.074 ± 1.237
4.098ValPro: 4.098 ± 3.729
1.025ValGln: 1.025 ± 0.74
7.172ValArg: 7.172 ± 1.632
3.074ValSer: 3.074 ± 1.601
2.049ValThr: 2.049 ± 1.763
3.074ValVal: 3.074 ± 0.462
0.0ValTrp: 0.0 ± 0.0
1.025ValTyr: 1.025 ± 0.881
0.0ValXaa: 0.0 ± 0.0
Trp
1.025TrpAla: 1.025 ± 0.74
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.049TrpGlu: 2.049 ± 0.772
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.049TrpIle: 2.049 ± 0.772
4.098TrpLys: 4.098 ± 2.436
4.098TrpLeu: 4.098 ± 1.176
2.049TrpMet: 2.049 ± 0.772
0.0TrpAsn: 0.0 ± 0.0
1.025TrpPro: 1.025 ± 0.881
1.025TrpGln: 1.025 ± 0.74
0.0TrpArg: 0.0 ± 0.0
5.123TrpSer: 5.123 ± 0.715
0.0TrpThr: 0.0 ± 0.0
1.025TrpVal: 1.025 ± 1.23
0.0TrpTrp: 0.0 ± 0.0
1.025TrpTyr: 1.025 ± 0.74
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.025TyrAla: 1.025 ± 0.881
0.0TyrCys: 0.0 ± 0.0
1.025TyrAsp: 1.025 ± 0.881
0.0TyrGlu: 0.0 ± 0.0
5.123TyrPhe: 5.123 ± 1.117
1.025TyrGly: 1.025 ± 0.74
1.025TyrHis: 1.025 ± 0.881
6.148TyrIle: 6.148 ± 2.474
1.025TyrLys: 1.025 ± 0.881
4.098TyrLeu: 4.098 ± 2.27
2.049TyrMet: 2.049 ± 0.954
2.049TyrAsn: 2.049 ± 1.481
3.074TyrPro: 3.074 ± 0.462
1.025TyrGln: 1.025 ± 1.23
0.0TyrArg: 0.0 ± 0.0
4.098TyrSer: 4.098 ± 0.782
1.025TyrThr: 1.025 ± 0.871
0.0TyrVal: 0.0 ± 0.0
1.025TyrTrp: 1.025 ± 0.74
1.025TyrTyr: 1.025 ± 1.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski