Amino acid dipepetide frequency for Red clover powdery mildew-associated totivirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.952AlaAla: 2.952 ± 1.712
0.738AlaCys: 0.738 ± 0.573
4.428AlaAsp: 4.428 ± 0.567
4.428AlaGlu: 4.428 ± 1.435
0.738AlaPhe: 0.738 ± 0.573
2.214AlaGly: 2.214 ± 0.718
0.738AlaHis: 0.738 ± 0.573
2.952AlaIle: 2.952 ± 1.291
3.69AlaLys: 3.69 ± 2.14
7.38AlaLeu: 7.38 ± 0.277
1.476AlaMet: 1.476 ± 0.312
6.642AlaAsn: 6.642 ± 0.85
2.952AlaPro: 2.952 ± 1.291
7.38AlaGln: 7.38 ± 1.725
4.428AlaArg: 4.428 ± 1.568
0.0AlaSer: 0.0 ± 0.0
5.166AlaThr: 5.166 ± 0.995
6.642AlaVal: 6.642 ± 3.853
1.476AlaTrp: 1.476 ± 0.856
2.952AlaTyr: 2.952 ± 0.29
0.0AlaXaa: 0.0 ± 0.0
Cys
1.476CysAla: 1.476 ± 0.856
0.0CysCys: 0.0 ± 0.0
2.214CysAsp: 2.214 ± 1.284
0.0CysGlu: 0.0 ± 0.0
0.738CysPhe: 0.738 ± 0.573
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.214CysLeu: 2.214 ± 0.718
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.738CysPro: 0.738 ± 0.428
0.0CysGln: 0.0 ± 0.0
0.738CysArg: 0.738 ± 0.573
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.214CysVal: 2.214 ± 0.718
0.0CysTrp: 0.0 ± 0.0
1.476CysTyr: 1.476 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
4.428AspAla: 4.428 ± 0.567
0.738AspCys: 0.738 ± 0.573
7.38AspAsp: 7.38 ± 1.725
5.166AspGlu: 5.166 ± 0.006
5.166AspPhe: 5.166 ± 1.996
6.642AspGly: 6.642 ± 1.152
0.0AspHis: 0.0 ± 0.0
5.166AspIle: 5.166 ± 0.006
2.214AspLys: 2.214 ± 0.283
5.166AspLeu: 5.166 ± 0.995
2.952AspMet: 2.952 ± 0.29
3.69AspAsn: 3.69 ± 0.139
0.738AspPro: 0.738 ± 0.573
2.952AspGln: 2.952 ± 0.29
2.214AspArg: 2.214 ± 0.718
5.904AspSer: 5.904 ± 0.579
3.69AspThr: 3.69 ± 0.862
4.428AspVal: 4.428 ± 0.434
2.214AspTrp: 2.214 ± 0.283
0.738AspTyr: 0.738 ± 0.428
0.0AspXaa: 0.0 ± 0.0
Glu
5.166GluAla: 5.166 ± 1.007
0.0GluCys: 0.0 ± 0.0
0.738GluAsp: 0.738 ± 0.573
2.214GluGlu: 2.214 ± 0.283
0.738GluPhe: 0.738 ± 0.428
4.428GluGly: 4.428 ± 1.568
0.0GluHis: 0.0 ± 0.0
2.952GluIle: 2.952 ± 1.291
1.476GluLys: 1.476 ± 0.145
6.642GluLeu: 6.642 ± 1.851
2.214GluMet: 2.214 ± 0.283
2.214GluAsn: 2.214 ± 0.283
2.214GluPro: 2.214 ± 1.719
2.214GluGln: 2.214 ± 0.283
3.69GluArg: 3.69 ± 1.863
3.69GluSer: 3.69 ± 0.862
5.166GluThr: 5.166 ± 1.007
5.166GluVal: 5.166 ± 2.008
2.214GluTrp: 2.214 ± 0.283
0.738GluTyr: 0.738 ± 0.428
0.0GluXaa: 0.0 ± 0.0
Phe
0.738PheAla: 0.738 ± 0.573
1.476PheCys: 1.476 ± 0.856
1.476PheAsp: 1.476 ± 0.145
2.214PheGlu: 2.214 ± 0.718
0.0PhePhe: 0.0 ± 0.0
2.952PheGly: 2.952 ± 0.29
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.476PheLys: 1.476 ± 1.146
2.952PheLeu: 2.952 ± 0.29
1.476PheMet: 1.476 ± 0.145
3.69PheAsn: 3.69 ± 0.139
0.738PhePro: 0.738 ± 0.573
0.0PheGln: 0.0 ± 0.0
2.952PheArg: 2.952 ± 0.29
3.69PheSer: 3.69 ± 0.862
3.69PheThr: 3.69 ± 0.862
1.476PheVal: 1.476 ± 0.145
0.0PheTrp: 0.0 ± 0.0
0.738PheTyr: 0.738 ± 0.573
0.0PheXaa: 0.0 ± 0.0
Gly
2.952GlyAla: 2.952 ± 0.711
0.738GlyCys: 0.738 ± 0.428
8.118GlyAsp: 8.118 ± 1.706
4.428GlyGlu: 4.428 ± 1.435
0.738GlyPhe: 0.738 ± 0.573
5.904GlyGly: 5.904 ± 0.579
1.476GlyHis: 1.476 ± 1.146
2.952GlyIle: 2.952 ± 0.29
1.476GlyLys: 1.476 ± 0.856
7.38GlyLeu: 7.38 ± 0.277
3.69GlyMet: 3.69 ± 1.14
2.952GlyAsn: 2.952 ± 1.712
0.0GlyPro: 0.0 ± 0.0
0.738GlyGln: 0.738 ± 0.428
1.476GlyArg: 1.476 ± 0.856
2.214GlySer: 2.214 ± 0.283
3.69GlyThr: 3.69 ± 0.862
5.904GlyVal: 5.904 ± 1.58
0.738GlyTrp: 0.738 ± 0.428
2.952GlyTyr: 2.952 ± 0.29
0.0GlyXaa: 0.0 ± 0.0
His
0.738HisAla: 0.738 ± 0.573
0.0HisCys: 0.0 ± 0.0
1.476HisAsp: 1.476 ± 0.856
0.0HisGlu: 0.0 ± 0.0
0.738HisPhe: 0.738 ± 0.573
2.214HisGly: 2.214 ± 0.283
1.476HisHis: 1.476 ± 0.145
1.476HisIle: 1.476 ± 0.145
2.952HisLys: 2.952 ± 1.712
0.738HisLeu: 0.738 ± 0.573
2.214HisMet: 2.214 ± 0.283
0.738HisAsn: 0.738 ± 0.428
1.476HisPro: 1.476 ± 1.146
0.0HisGln: 0.0 ± 0.0
2.952HisArg: 2.952 ± 0.711
2.952HisSer: 2.952 ± 0.711
1.476HisThr: 1.476 ± 1.146
0.738HisVal: 0.738 ± 0.428
0.0HisTrp: 0.0 ± 0.0
1.476HisTyr: 1.476 ± 0.145
0.0HisXaa: 0.0 ± 0.0
Ile
5.166IleAla: 5.166 ± 2.997
1.476IleCys: 1.476 ± 0.856
2.952IleAsp: 2.952 ± 0.29
4.428IleGlu: 4.428 ± 1.568
2.214IlePhe: 2.214 ± 0.718
4.428IleGly: 4.428 ± 0.567
0.738IleHis: 0.738 ± 0.428
3.69IleIle: 3.69 ± 0.139
6.642IleLys: 6.642 ± 1.851
3.69IleLeu: 3.69 ± 1.863
1.476IleMet: 1.476 ± 0.856
4.428IleAsn: 4.428 ± 0.434
2.214IlePro: 2.214 ± 1.284
0.0IleGln: 0.0 ± 0.0
3.69IleArg: 3.69 ± 1.14
3.69IleSer: 3.69 ± 1.14
7.38IleThr: 7.38 ± 3.727
1.476IleVal: 1.476 ± 0.145
0.738IleTrp: 0.738 ± 0.573
0.738IleTyr: 0.738 ± 0.573
0.0IleXaa: 0.0 ± 0.0
Lys
6.642LysAla: 6.642 ± 1.851
0.0LysCys: 0.0 ± 0.0
2.952LysAsp: 2.952 ± 0.29
2.952LysGlu: 2.952 ± 1.291
0.738LysPhe: 0.738 ± 0.573
1.476LysGly: 1.476 ± 0.856
3.69LysHis: 3.69 ± 1.14
4.428LysIle: 4.428 ± 0.567
5.904LysLys: 5.904 ± 0.422
4.428LysLeu: 4.428 ± 2.569
2.952LysMet: 2.952 ± 1.712
2.952LysAsn: 2.952 ± 1.291
2.214LysPro: 2.214 ± 0.283
2.952LysGln: 2.952 ± 0.711
2.952LysArg: 2.952 ± 0.711
3.69LysSer: 3.69 ± 1.14
2.952LysThr: 2.952 ± 0.711
5.904LysVal: 5.904 ± 1.58
0.738LysTrp: 0.738 ± 0.428
2.214LysTyr: 2.214 ± 0.283
0.0LysXaa: 0.0 ± 0.0
Leu
10.332LeuAla: 10.332 ± 1.014
0.738LeuCys: 0.738 ± 0.428
8.118LeuAsp: 8.118 ± 2.298
6.642LeuGlu: 6.642 ± 0.151
4.428LeuPhe: 4.428 ± 0.567
2.952LeuGly: 2.952 ± 0.29
3.69LeuHis: 3.69 ± 0.139
5.904LeuIle: 5.904 ± 1.423
5.166LeuLys: 5.166 ± 0.006
7.38LeuLeu: 7.38 ± 3.727
4.428LeuMet: 4.428 ± 0.434
3.69LeuAsn: 3.69 ± 1.14
2.214LeuPro: 2.214 ± 0.718
1.476LeuGln: 1.476 ± 0.145
7.38LeuArg: 7.38 ± 1.725
2.214LeuSer: 2.214 ± 1.719
2.214LeuThr: 2.214 ± 1.284
2.952LeuVal: 2.952 ± 0.711
0.738LeuTrp: 0.738 ± 0.428
5.166LeuTyr: 5.166 ± 0.006
0.0LeuXaa: 0.0 ± 0.0
Met
7.38MetAla: 7.38 ± 0.277
0.0MetCys: 0.0 ± 0.0
2.214MetAsp: 2.214 ± 0.283
0.0MetGlu: 0.0 ± 0.0
1.476MetPhe: 1.476 ± 0.145
2.952MetGly: 2.952 ± 0.711
3.69MetHis: 3.69 ± 1.14
2.214MetIle: 2.214 ± 0.283
2.952MetLys: 2.952 ± 0.711
4.428MetLeu: 4.428 ± 1.568
1.476MetMet: 1.476 ± 0.856
1.476MetAsn: 1.476 ± 0.145
1.476MetPro: 1.476 ± 0.856
2.952MetGln: 2.952 ± 0.29
2.214MetArg: 2.214 ± 1.284
0.738MetSer: 0.738 ± 0.573
4.428MetThr: 4.428 ± 0.567
0.738MetVal: 0.738 ± 0.573
0.0MetTrp: 0.0 ± 0.0
2.214MetTyr: 2.214 ± 0.718
0.0MetXaa: 0.0 ± 0.0
Asn
2.214AsnAla: 2.214 ± 1.719
0.738AsnCys: 0.738 ± 0.573
3.69AsnAsp: 3.69 ± 1.863
2.952AsnGlu: 2.952 ± 0.29
0.738AsnPhe: 0.738 ± 0.573
2.952AsnGly: 2.952 ± 0.711
0.0AsnHis: 0.0 ± 0.0
5.166AsnIle: 5.166 ± 0.006
5.166AsnLys: 5.166 ± 0.995
2.214AsnLeu: 2.214 ± 0.718
2.952AsnMet: 2.952 ± 1.712
1.476AsnAsn: 1.476 ± 0.856
0.738AsnPro: 0.738 ± 0.573
0.738AsnGln: 0.738 ± 0.573
2.952AsnArg: 2.952 ± 0.711
3.69AsnSer: 3.69 ± 1.14
2.952AsnThr: 2.952 ± 0.29
4.428AsnVal: 4.428 ± 1.568
0.738AsnTrp: 0.738 ± 0.573
1.476AsnTyr: 1.476 ± 0.856
0.0AsnXaa: 0.0 ± 0.0
Pro
0.738ProAla: 0.738 ± 0.573
0.738ProCys: 0.738 ± 0.573
2.214ProAsp: 2.214 ± 0.283
1.476ProGlu: 1.476 ± 0.145
2.214ProPhe: 2.214 ± 0.718
1.476ProGly: 1.476 ± 0.145
0.738ProHis: 0.738 ± 0.573
0.0ProIle: 0.0 ± 0.0
2.952ProLys: 2.952 ± 0.711
0.738ProLeu: 0.738 ± 0.573
0.0ProMet: 0.0 ± 0.0
1.476ProAsn: 1.476 ± 0.145
0.738ProPro: 0.738 ± 0.428
0.0ProGln: 0.0 ± 0.0
2.214ProArg: 2.214 ± 1.719
5.166ProSer: 5.166 ± 0.006
1.476ProThr: 1.476 ± 0.856
4.428ProVal: 4.428 ± 0.434
0.738ProTrp: 0.738 ± 0.428
2.214ProTyr: 2.214 ± 1.719
0.0ProXaa: 0.0 ± 0.0
Gln
2.214GlnAla: 2.214 ± 1.284
0.0GlnCys: 0.0 ± 0.0
1.476GlnAsp: 1.476 ± 1.146
1.476GlnGlu: 1.476 ± 0.145
0.738GlnPhe: 0.738 ± 0.573
0.0GlnGly: 0.0 ± 0.0
2.214GlnHis: 2.214 ± 0.283
2.214GlnIle: 2.214 ± 0.718
1.476GlnLys: 1.476 ± 0.145
3.69GlnLeu: 3.69 ± 0.862
0.738GlnMet: 0.738 ± 0.428
1.476GlnAsn: 1.476 ± 1.146
1.476GlnPro: 1.476 ± 0.145
0.0GlnGln: 0.0 ± 0.0
1.476GlnArg: 1.476 ± 0.856
1.476GlnSer: 1.476 ± 0.145
2.214GlnThr: 2.214 ± 1.284
2.214GlnVal: 2.214 ± 0.718
0.0GlnTrp: 0.0 ± 0.0
2.214GlnTyr: 2.214 ± 1.284
0.0GlnXaa: 0.0 ± 0.0
Arg
2.952ArgAla: 2.952 ± 0.711
0.0ArgCys: 0.0 ± 0.0
2.214ArgAsp: 2.214 ± 1.284
2.214ArgGlu: 2.214 ± 0.718
1.476ArgPhe: 1.476 ± 1.146
2.952ArgGly: 2.952 ± 0.29
2.214ArgHis: 2.214 ± 0.283
7.38ArgIle: 7.38 ± 2.279
1.476ArgLys: 1.476 ± 0.856
8.118ArgLeu: 8.118 ± 0.705
3.69ArgMet: 3.69 ± 1.863
2.214ArgAsn: 2.214 ± 1.284
2.952ArgPro: 2.952 ± 0.29
2.214ArgGln: 2.214 ± 0.283
2.214ArgArg: 2.214 ± 1.284
7.38ArgSer: 7.38 ± 0.724
3.69ArgThr: 3.69 ± 1.14
1.476ArgVal: 1.476 ± 0.856
0.738ArgTrp: 0.738 ± 0.573
1.476ArgTyr: 1.476 ± 1.146
0.0ArgXaa: 0.0 ± 0.0
Ser
2.952SerAla: 2.952 ± 1.291
2.214SerCys: 2.214 ± 0.718
3.69SerAsp: 3.69 ± 1.14
5.166SerGlu: 5.166 ± 0.006
2.952SerPhe: 2.952 ± 1.291
4.428SerGly: 4.428 ± 0.434
1.476SerHis: 1.476 ± 0.145
0.738SerIle: 0.738 ± 0.428
5.166SerLys: 5.166 ± 2.008
3.69SerLeu: 3.69 ± 1.863
3.69SerMet: 3.69 ± 1.14
1.476SerAsn: 1.476 ± 0.145
1.476SerPro: 1.476 ± 0.145
2.214SerGln: 2.214 ± 0.283
2.214SerArg: 2.214 ± 1.284
2.952SerSer: 2.952 ± 1.291
2.952SerThr: 2.952 ± 0.711
7.38SerVal: 7.38 ± 1.278
2.214SerTrp: 2.214 ± 0.283
0.738SerTyr: 0.738 ± 0.573
0.0SerXaa: 0.0 ± 0.0
Thr
2.952ThrAla: 2.952 ± 0.29
1.476ThrCys: 1.476 ± 0.145
2.952ThrAsp: 2.952 ± 0.29
0.738ThrGlu: 0.738 ± 0.573
1.476ThrPhe: 1.476 ± 0.145
4.428ThrGly: 4.428 ± 0.434
0.0ThrHis: 0.0 ± 0.0
3.69ThrIle: 3.69 ± 1.14
2.952ThrLys: 2.952 ± 1.712
5.166ThrLeu: 5.166 ± 2.008
3.69ThrMet: 3.69 ± 1.14
3.69ThrAsn: 3.69 ± 1.863
4.428ThrPro: 4.428 ± 0.567
0.738ThrGln: 0.738 ± 0.428
5.904ThrArg: 5.904 ± 0.579
3.69ThrSer: 3.69 ± 0.139
2.952ThrThr: 2.952 ± 0.29
5.166ThrVal: 5.166 ± 1.996
2.214ThrTrp: 2.214 ± 0.718
3.69ThrTyr: 3.69 ± 1.863
0.0ThrXaa: 0.0 ± 0.0
Val
1.476ValAla: 1.476 ± 0.856
0.738ValCys: 0.738 ± 0.428
8.856ValAsp: 8.856 ± 0.132
3.69ValGlu: 3.69 ± 0.139
2.952ValPhe: 2.952 ± 0.29
2.952ValGly: 2.952 ± 0.711
1.476ValHis: 1.476 ± 0.856
5.904ValIle: 5.904 ± 1.423
8.856ValLys: 8.856 ± 0.132
8.118ValLeu: 8.118 ± 1.297
2.952ValMet: 2.952 ± 1.291
2.214ValAsn: 2.214 ± 0.718
2.214ValPro: 2.214 ± 0.718
0.738ValGln: 0.738 ± 0.428
3.69ValArg: 3.69 ± 0.139
3.69ValSer: 3.69 ± 0.139
5.166ValThr: 5.166 ± 0.006
2.952ValVal: 2.952 ± 0.711
0.0ValTrp: 0.0 ± 0.0
3.69ValTyr: 3.69 ± 1.14
0.0ValXaa: 0.0 ± 0.0
Trp
2.952TrpAla: 2.952 ± 0.711
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.738TrpGlu: 0.738 ± 0.428
0.738TrpPhe: 0.738 ± 0.428
2.214TrpGly: 2.214 ± 1.284
0.0TrpHis: 0.0 ± 0.0
0.738TrpIle: 0.738 ± 0.573
0.0TrpLys: 0.0 ± 0.0
0.738TrpLeu: 0.738 ± 0.573
1.476TrpMet: 1.476 ± 0.145
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.476TrpArg: 1.476 ± 0.856
1.476TrpSer: 1.476 ± 1.146
0.0TrpThr: 0.0 ± 0.0
1.476TrpVal: 1.476 ± 0.145
0.0TrpTrp: 0.0 ± 0.0
1.476TrpTyr: 1.476 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.952TyrAla: 2.952 ± 1.291
0.0TyrCys: 0.0 ± 0.0
4.428TyrAsp: 4.428 ± 1.435
2.952TyrGlu: 2.952 ± 0.29
0.738TyrPhe: 0.738 ± 0.573
2.952TyrGly: 2.952 ± 0.29
2.214TyrHis: 2.214 ± 0.718
3.69TyrIle: 3.69 ± 1.14
0.738TyrLys: 0.738 ± 0.573
2.952TyrLeu: 2.952 ± 0.711
0.738TyrMet: 0.738 ± 0.701
1.476TyrAsn: 1.476 ± 0.145
0.738TyrPro: 0.738 ± 0.573
1.476TyrGln: 1.476 ± 0.145
2.214TyrArg: 2.214 ± 0.283
1.476TyrSer: 1.476 ± 0.145
0.738TyrThr: 0.738 ± 0.573
5.166TyrVal: 5.166 ± 0.006
0.0TyrTrp: 0.0 ± 0.0
3.69TyrTyr: 3.69 ± 1.863
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1356 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski