Amino acid dipepetide frequency for Ustilaginoidea virens partitivirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.967AlaAla: 9.967 ± 1.249
1.107AlaCys: 1.107 ± 0.707
4.43AlaAsp: 4.43 ± 0.582
1.107AlaGlu: 1.107 ± 0.998
4.43AlaPhe: 4.43 ± 0.582
4.43AlaGly: 4.43 ± 0.582
1.107AlaHis: 1.107 ± 0.998
0.0AlaIle: 0.0 ± 0.0
2.215AlaLys: 2.215 ± 1.414
4.43AlaLeu: 4.43 ± 0.582
2.215AlaMet: 2.215 ± 0.291
0.0AlaAsn: 0.0 ± 0.0
5.537AlaPro: 5.537 ± 1.579
1.107AlaGln: 1.107 ± 0.707
4.43AlaArg: 4.43 ± 3.992
3.322AlaSer: 3.322 ± 1.289
4.43AlaThr: 4.43 ± 0.582
5.537AlaVal: 5.537 ± 0.126
1.107AlaTrp: 1.107 ± 0.707
4.43AlaTyr: 4.43 ± 2.287
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.107CysAsn: 1.107 ± 0.707
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.215CysArg: 2.215 ± 1.414
1.107CysSer: 1.107 ± 0.707
1.107CysThr: 1.107 ± 0.707
2.215CysVal: 2.215 ± 1.414
0.0CysTrp: 0.0 ± 0.0
1.107CysTyr: 1.107 ± 0.707
0.0CysXaa: 0.0 ± 0.0
Asp
3.322AspAla: 3.322 ± 2.121
1.107AspCys: 1.107 ± 0.707
3.322AspAsp: 3.322 ± 0.416
4.43AspGlu: 4.43 ± 0.582
3.322AspPhe: 3.322 ± 1.289
4.43AspGly: 4.43 ± 2.287
0.0AspHis: 0.0 ± 0.0
4.43AspIle: 4.43 ± 1.123
3.322AspLys: 3.322 ± 0.416
2.215AspLeu: 2.215 ± 0.291
0.0AspMet: 0.0 ± 0.0
1.107AspAsn: 1.107 ± 0.707
3.322AspPro: 3.322 ± 0.416
1.107AspGln: 1.107 ± 0.707
0.0AspArg: 0.0 ± 0.0
7.752AspSer: 7.752 ± 1.54
5.537AspThr: 5.537 ± 1.831
7.752AspVal: 7.752 ± 0.165
1.107AspTrp: 1.107 ± 0.998
2.215AspTyr: 2.215 ± 1.414
0.0AspXaa: 0.0 ± 0.0
Glu
2.215GluAla: 2.215 ± 0.291
0.0GluCys: 0.0 ± 0.0
3.322GluAsp: 3.322 ± 0.416
2.215GluGlu: 2.215 ± 0.291
3.322GluPhe: 3.322 ± 0.416
2.215GluGly: 2.215 ± 0.291
2.215GluHis: 2.215 ± 0.291
2.215GluIle: 2.215 ± 1.996
2.215GluLys: 2.215 ± 0.291
6.645GluLeu: 6.645 ± 0.872
1.107GluMet: 1.107 ± 0.707
1.107GluAsn: 1.107 ± 0.707
2.215GluPro: 2.215 ± 0.291
1.107GluGln: 1.107 ± 0.998
4.43GluArg: 4.43 ± 0.582
2.215GluSer: 2.215 ± 1.996
2.215GluThr: 2.215 ± 0.291
2.215GluVal: 2.215 ± 1.996
2.215GluTrp: 2.215 ± 1.414
3.322GluTyr: 3.322 ± 0.416
0.0GluXaa: 0.0 ± 0.0
Phe
2.215PheAla: 2.215 ± 0.291
0.0PheCys: 0.0 ± 0.0
5.537PheAsp: 5.537 ± 1.831
3.322PheGlu: 3.322 ± 1.289
3.322PhePhe: 3.322 ± 2.121
8.859PheGly: 8.859 ± 1.163
2.215PheHis: 2.215 ± 1.414
3.322PheIle: 3.322 ± 1.289
5.537PheLys: 5.537 ± 3.536
3.322PheLeu: 3.322 ± 1.289
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
2.215PhePro: 2.215 ± 1.414
3.322PheGln: 3.322 ± 0.416
1.107PheArg: 1.107 ± 0.707
8.859PheSer: 8.859 ± 0.542
6.645PheThr: 6.645 ± 0.872
1.107PheVal: 1.107 ± 0.998
0.0PheTrp: 0.0 ± 0.0
2.215PheTyr: 2.215 ± 1.996
0.0PheXaa: 0.0 ± 0.0
Gly
3.322GlyAla: 3.322 ± 1.289
0.0GlyCys: 0.0 ± 0.0
5.537GlyAsp: 5.537 ± 1.831
2.215GlyGlu: 2.215 ± 0.291
5.537GlyPhe: 5.537 ± 0.126
3.322GlyGly: 3.322 ± 0.416
1.107GlyHis: 1.107 ± 0.998
1.107GlyIle: 1.107 ± 0.707
3.322GlyLys: 3.322 ± 1.289
12.182GlyLeu: 12.182 ± 0.958
2.215GlyMet: 2.215 ± 0.291
1.107GlyAsn: 1.107 ± 0.998
2.215GlyPro: 2.215 ± 1.414
1.107GlyGln: 1.107 ± 0.998
3.322GlyArg: 3.322 ± 2.121
7.752GlySer: 7.752 ± 1.87
1.107GlyThr: 1.107 ± 0.707
6.645GlyVal: 6.645 ± 0.872
0.0GlyTrp: 0.0 ± 0.0
5.537GlyTyr: 5.537 ± 1.831
0.0GlyXaa: 0.0 ± 0.0
His
1.107HisAla: 1.107 ± 0.707
0.0HisCys: 0.0 ± 0.0
1.107HisAsp: 1.107 ± 0.707
1.107HisGlu: 1.107 ± 0.998
4.43HisPhe: 4.43 ± 1.123
1.107HisGly: 1.107 ± 0.707
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.107HisLys: 1.107 ± 0.998
2.215HisLeu: 2.215 ± 0.291
0.0HisMet: 0.0 ± 0.0
1.107HisAsn: 1.107 ± 0.998
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.107HisArg: 1.107 ± 0.707
0.0HisSer: 0.0 ± 0.0
3.322HisThr: 3.322 ± 0.416
2.215HisVal: 2.215 ± 1.414
1.107HisTrp: 1.107 ± 0.707
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.322IleAla: 3.322 ± 2.994
1.107IleCys: 1.107 ± 0.707
3.322IleAsp: 3.322 ± 0.416
2.215IleGlu: 2.215 ± 0.291
1.107IlePhe: 1.107 ± 0.707
2.215IleGly: 2.215 ± 1.996
2.215IleHis: 2.215 ± 0.291
2.215IleIle: 2.215 ± 1.414
2.215IleLys: 2.215 ± 1.414
7.752IleLeu: 7.752 ± 3.245
0.0IleMet: 0.0 ± 0.0
2.215IleAsn: 2.215 ± 0.291
6.645IlePro: 6.645 ± 2.577
2.215IleGln: 2.215 ± 0.291
3.322IleArg: 3.322 ± 0.416
1.107IleSer: 1.107 ± 0.707
3.322IleThr: 3.322 ± 2.121
2.215IleVal: 2.215 ± 0.291
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.107LysAla: 1.107 ± 0.707
0.0LysCys: 0.0 ± 0.0
1.107LysAsp: 1.107 ± 0.998
4.43LysGlu: 4.43 ± 0.582
2.215LysPhe: 2.215 ± 1.996
2.215LysGly: 2.215 ± 1.414
3.322LysHis: 3.322 ± 0.416
0.0LysIle: 0.0 ± 0.0
1.107LysLys: 1.107 ± 0.707
4.43LysLeu: 4.43 ± 1.123
0.0LysMet: 0.0 ± 0.0
2.215LysAsn: 2.215 ± 0.291
0.0LysPro: 0.0 ± 0.0
2.215LysGln: 2.215 ± 1.414
4.43LysArg: 4.43 ± 1.123
6.645LysSer: 6.645 ± 2.538
6.645LysThr: 6.645 ± 0.833
0.0LysVal: 0.0 ± 0.0
0.0LysTrp: 0.0 ± 0.0
1.107LysTyr: 1.107 ± 0.998
0.0LysXaa: 0.0 ± 0.0
Leu
8.859LeuAla: 8.859 ± 1.163
0.0LeuCys: 0.0 ± 0.0
5.537LeuAsp: 5.537 ± 0.126
5.537LeuGlu: 5.537 ± 3.284
6.645LeuPhe: 6.645 ± 4.282
3.322LeuGly: 3.322 ± 2.121
1.107LeuHis: 1.107 ± 0.707
6.645LeuIle: 6.645 ± 2.538
4.43LeuLys: 4.43 ± 1.123
12.182LeuLeu: 12.182 ± 0.958
0.0LeuMet: 0.0 ± 0.0
2.215LeuAsn: 2.215 ± 0.291
5.537LeuPro: 5.537 ± 1.831
4.43LeuGln: 4.43 ± 0.582
15.504LeuArg: 15.504 ± 1.375
11.074LeuSer: 11.074 ± 1.454
2.215LeuThr: 2.215 ± 0.291
8.859LeuVal: 8.859 ± 1.163
1.107LeuTrp: 1.107 ± 0.707
2.215LeuTyr: 2.215 ± 1.414
0.0LeuXaa: 0.0 ± 0.0
Met
1.107MetAla: 1.107 ± 0.998
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.107MetHis: 1.107 ± 0.707
0.0MetIle: 0.0 ± 0.0
4.43MetLys: 4.43 ± 2.287
1.107MetLeu: 1.107 ± 0.707
1.107MetMet: 1.107 ± 0.707
1.107MetAsn: 1.107 ± 0.707
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.107MetSer: 1.107 ± 0.998
2.215MetThr: 2.215 ± 1.414
1.107MetVal: 1.107 ± 0.998
0.0MetTrp: 0.0 ± 0.0
1.107MetTyr: 1.107 ± 0.707
0.0MetXaa: 0.0 ± 0.0
Asn
1.107AsnAla: 1.107 ± 0.707
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.107AsnGlu: 1.107 ± 0.998
4.43AsnPhe: 4.43 ± 2.287
4.43AsnGly: 4.43 ± 1.123
1.107AsnHis: 1.107 ± 0.998
2.215AsnIle: 2.215 ± 0.291
1.107AsnLys: 1.107 ± 0.707
4.43AsnLeu: 4.43 ± 1.123
0.0AsnMet: 0.0 ± 0.0
1.107AsnAsn: 1.107 ± 0.707
1.107AsnPro: 1.107 ± 0.707
2.215AsnGln: 2.215 ± 1.996
0.0AsnArg: 0.0 ± 0.0
1.107AsnSer: 1.107 ± 0.707
2.215AsnThr: 2.215 ± 1.414
2.215AsnVal: 2.215 ± 1.414
2.215AsnTrp: 2.215 ± 1.414
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.43ProAla: 4.43 ± 0.582
1.107ProCys: 1.107 ± 0.707
2.215ProAsp: 2.215 ± 1.996
1.107ProGlu: 1.107 ± 0.707
1.107ProPhe: 1.107 ± 0.707
5.537ProGly: 5.537 ± 0.126
2.215ProHis: 2.215 ± 1.414
3.322ProIle: 3.322 ± 0.416
1.107ProLys: 1.107 ± 0.707
11.074ProLeu: 11.074 ± 1.454
0.0ProMet: 0.0 ± 0.0
3.322ProAsn: 3.322 ± 0.416
2.215ProPro: 2.215 ± 0.291
0.0ProGln: 0.0 ± 0.0
3.322ProArg: 3.322 ± 1.289
4.43ProSer: 4.43 ± 0.582
2.215ProThr: 2.215 ± 1.414
4.43ProVal: 4.43 ± 0.582
1.107ProTrp: 1.107 ± 0.707
1.107ProTyr: 1.107 ± 0.707
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.215GlnAsp: 2.215 ± 1.996
2.215GlnGlu: 2.215 ± 1.414
2.215GlnPhe: 2.215 ± 1.996
1.107GlnGly: 1.107 ± 0.707
0.0GlnHis: 0.0 ± 0.0
3.322GlnIle: 3.322 ± 0.416
0.0GlnLys: 0.0 ± 0.0
2.215GlnLeu: 2.215 ± 0.291
1.107GlnMet: 1.107 ± 0.998
2.215GlnAsn: 2.215 ± 0.291
2.215GlnPro: 2.215 ± 0.291
1.107GlnGln: 1.107 ± 0.707
4.43GlnArg: 4.43 ± 0.582
3.322GlnSer: 3.322 ± 2.994
4.43GlnThr: 4.43 ± 1.123
1.107GlnVal: 1.107 ± 0.998
0.0GlnTrp: 0.0 ± 0.0
1.107GlnTyr: 1.107 ± 0.998
0.0GlnXaa: 0.0 ± 0.0
Arg
4.43ArgAla: 4.43 ± 0.582
2.215ArgCys: 2.215 ± 1.414
2.215ArgAsp: 2.215 ± 1.414
2.215ArgGlu: 2.215 ± 1.996
4.43ArgPhe: 4.43 ± 2.828
4.43ArgGly: 4.43 ± 0.582
1.107ArgHis: 1.107 ± 0.707
5.537ArgIle: 5.537 ± 0.126
1.107ArgLys: 1.107 ± 0.707
9.967ArgLeu: 9.967 ± 4.659
1.107ArgMet: 1.107 ± 1.54
1.107ArgAsn: 1.107 ± 0.998
1.107ArgPro: 1.107 ± 0.707
5.537ArgGln: 5.537 ± 1.579
2.215ArgArg: 2.215 ± 1.996
2.215ArgSer: 2.215 ± 1.414
1.107ArgThr: 1.107 ± 0.707
3.322ArgVal: 3.322 ± 1.289
1.107ArgTrp: 1.107 ± 0.707
8.859ArgTyr: 8.859 ± 2.868
0.0ArgXaa: 0.0 ± 0.0
Ser
6.645SerAla: 6.645 ± 2.577
1.107SerCys: 1.107 ± 0.707
6.645SerAsp: 6.645 ± 2.577
5.537SerGlu: 5.537 ± 1.831
6.645SerPhe: 6.645 ± 2.538
7.752SerGly: 7.752 ± 1.54
2.215SerHis: 2.215 ± 1.414
6.645SerIle: 6.645 ± 0.833
3.322SerLys: 3.322 ± 2.994
6.645SerLeu: 6.645 ± 2.577
1.107SerMet: 1.107 ± 0.998
3.322SerAsn: 3.322 ± 0.416
5.537SerPro: 5.537 ± 0.126
3.322SerGln: 3.322 ± 2.994
1.107SerArg: 1.107 ± 0.707
2.215SerSer: 2.215 ± 0.291
7.752SerThr: 7.752 ± 3.575
1.107SerVal: 1.107 ± 0.998
0.0SerTrp: 0.0 ± 0.0
2.215SerTyr: 2.215 ± 1.996
0.0SerXaa: 0.0 ± 0.0
Thr
4.43ThrAla: 4.43 ± 0.582
0.0ThrCys: 0.0 ± 0.0
2.215ThrAsp: 2.215 ± 1.414
4.43ThrGlu: 4.43 ± 0.582
4.43ThrPhe: 4.43 ± 1.123
6.645ThrGly: 6.645 ± 2.538
0.0ThrHis: 0.0 ± 0.0
2.215ThrIle: 2.215 ± 0.291
1.107ThrLys: 1.107 ± 0.707
4.43ThrLeu: 4.43 ± 0.582
1.107ThrMet: 1.107 ± 0.511
3.322ThrAsn: 3.322 ± 2.121
6.645ThrPro: 6.645 ± 0.872
2.215ThrGln: 2.215 ± 1.414
11.074ThrArg: 11.074 ± 3.661
5.537ThrSer: 5.537 ± 3.284
5.537ThrThr: 5.537 ± 3.284
2.215ThrVal: 2.215 ± 1.996
0.0ThrTrp: 0.0 ± 0.0
1.107ThrTyr: 1.107 ± 0.707
0.0ThrXaa: 0.0 ± 0.0
Val
3.322ValAla: 3.322 ± 1.289
0.0ValCys: 0.0 ± 0.0
4.43ValAsp: 4.43 ± 1.123
2.215ValGlu: 2.215 ± 1.414
2.215ValPhe: 2.215 ± 1.414
2.215ValGly: 2.215 ± 0.291
0.0ValHis: 0.0 ± 0.0
3.322ValIle: 3.322 ± 2.994
2.215ValLys: 2.215 ± 0.291
4.43ValLeu: 4.43 ± 2.287
1.107ValMet: 1.107 ± 0.707
2.215ValAsn: 2.215 ± 1.414
6.645ValPro: 6.645 ± 0.833
1.107ValGln: 1.107 ± 0.998
3.322ValArg: 3.322 ± 2.994
7.752ValSer: 7.752 ± 5.28
3.322ValThr: 3.322 ± 1.289
1.107ValVal: 1.107 ± 0.707
1.107ValTrp: 1.107 ± 0.707
4.43ValTyr: 4.43 ± 2.828
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.107TrpAsp: 1.107 ± 0.707
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.107TrpIle: 1.107 ± 0.707
1.107TrpLys: 1.107 ± 0.707
1.107TrpLeu: 1.107 ± 0.707
2.215TrpMet: 2.215 ± 1.414
1.107TrpAsn: 1.107 ± 0.707
0.0TrpPro: 0.0 ± 0.0
1.107TrpGln: 1.107 ± 0.998
1.107TrpArg: 1.107 ± 0.707
1.107TrpSer: 1.107 ± 0.707
1.107TrpThr: 1.107 ± 0.707
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.43TyrAla: 4.43 ± 2.287
1.107TyrCys: 1.107 ± 0.707
5.537TyrAsp: 5.537 ± 3.536
3.322TyrGlu: 3.322 ± 0.416
3.322TyrPhe: 3.322 ± 2.121
4.43TyrGly: 4.43 ± 2.287
0.0TyrHis: 0.0 ± 0.0
1.107TyrIle: 1.107 ± 0.707
2.215TyrLys: 2.215 ± 1.414
6.645TyrLeu: 6.645 ± 2.577
0.0TyrMet: 0.0 ± 0.0
1.107TyrAsn: 1.107 ± 0.998
2.215TyrPro: 2.215 ± 0.291
1.107TyrGln: 1.107 ± 0.998
0.0TyrArg: 0.0 ± 0.0
2.215TyrSer: 2.215 ± 0.291
2.215TyrThr: 2.215 ± 0.291
1.107TyrVal: 1.107 ± 0.707
0.0TyrTrp: 0.0 ± 0.0
2.215TyrTyr: 2.215 ± 0.291
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski