Amino acid dipepetide frequency for Red clover powdery mildew-associated totivirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.402AlaAla: 5.402 ± 1.499
1.35AlaCys: 1.35 ± 0.822
4.727AlaAsp: 4.727 ± 0.194
2.026AlaGlu: 2.026 ± 0.339
2.701AlaPhe: 2.701 ± 0.75
5.402AlaGly: 5.402 ± 0.29
2.701AlaHis: 2.701 ± 1.644
2.701AlaIle: 2.701 ± 0.75
2.701AlaLys: 2.701 ± 0.145
7.427AlaLeu: 7.427 ± 3.53
4.051AlaMet: 4.051 ± 2.467
1.35AlaAsn: 1.35 ± 0.072
2.701AlaPro: 2.701 ± 0.145
4.051AlaGln: 4.051 ± 0.677
6.752AlaArg: 6.752 ± 4.111
6.077AlaSer: 6.077 ± 1.91
8.103AlaThr: 8.103 ± 2.249
8.103AlaVal: 8.103 ± 0.435
2.026AlaTrp: 2.026 ± 0.556
2.026AlaTyr: 2.026 ± 1.233
0.0AlaXaa: 0.0 ± 0.0
Cys
2.026CysAla: 2.026 ± 0.339
0.675CysCys: 0.675 ± 0.484
0.675CysAsp: 0.675 ± 0.411
1.35CysGlu: 1.35 ± 0.822
0.0CysPhe: 0.0 ± 0.0
2.701CysGly: 2.701 ± 1.934
0.675CysHis: 0.675 ± 0.484
0.675CysIle: 0.675 ± 0.484
0.0CysLys: 0.0 ± 0.0
2.026CysLeu: 2.026 ± 1.451
0.0CysMet: 0.0 ± 0.0
0.675CysAsn: 0.675 ± 0.411
1.35CysPro: 1.35 ± 0.967
0.675CysGln: 0.675 ± 0.484
0.675CysArg: 0.675 ± 0.484
2.701CysSer: 2.701 ± 0.75
0.675CysThr: 0.675 ± 0.411
0.675CysVal: 0.675 ± 0.411
0.0CysTrp: 0.0 ± 0.0
0.675CysTyr: 0.675 ± 0.411
0.0CysXaa: 0.0 ± 0.0
Asp
3.376AspAla: 3.376 ± 1.161
0.675AspCys: 0.675 ± 0.484
3.376AspAsp: 3.376 ± 1.161
2.701AspGlu: 2.701 ± 0.145
3.376AspPhe: 3.376 ± 0.266
3.376AspGly: 3.376 ± 0.266
0.675AspHis: 0.675 ± 0.411
4.051AspIle: 4.051 ± 0.677
4.051AspLys: 4.051 ± 0.217
7.427AspLeu: 7.427 ± 0.846
2.026AspMet: 2.026 ± 0.556
1.35AspAsn: 1.35 ± 0.967
2.026AspPro: 2.026 ± 0.339
0.675AspGln: 0.675 ± 0.484
2.026AspArg: 2.026 ± 0.339
1.35AspSer: 1.35 ± 0.822
2.701AspThr: 2.701 ± 1.04
6.752AspVal: 6.752 ± 1.257
1.35AspTrp: 1.35 ± 0.822
1.35AspTyr: 1.35 ± 0.072
0.0AspXaa: 0.0 ± 0.0
Glu
5.402GluAla: 5.402 ± 1.499
1.35GluCys: 1.35 ± 0.822
5.402GluAsp: 5.402 ± 0.605
4.051GluGlu: 4.051 ± 0.217
2.701GluPhe: 2.701 ± 0.145
4.051GluGly: 4.051 ± 1.112
2.701GluHis: 2.701 ± 1.644
2.026GluIle: 2.026 ± 0.339
2.701GluLys: 2.701 ± 0.75
4.051GluLeu: 4.051 ± 1.112
1.35GluMet: 1.35 ± 0.638
0.675GluAsn: 0.675 ± 0.411
1.35GluPro: 1.35 ± 0.072
0.675GluGln: 0.675 ± 0.484
3.376GluArg: 3.376 ± 1.523
5.402GluSer: 5.402 ± 1.185
3.376GluThr: 3.376 ± 0.266
4.051GluVal: 4.051 ± 0.217
2.026GluTrp: 2.026 ± 0.339
2.701GluTyr: 2.701 ± 1.934
0.0GluXaa: 0.0 ± 0.0
Phe
2.701PheAla: 2.701 ± 0.145
1.35PheCys: 1.35 ± 0.072
3.376PheAsp: 3.376 ± 1.523
4.051PheGlu: 4.051 ± 0.677
0.0PhePhe: 0.0 ± 0.0
0.675PheGly: 0.675 ± 0.411
0.0PheHis: 0.0 ± 0.0
4.051PheIle: 4.051 ± 1.112
0.675PheLys: 0.675 ± 0.411
3.376PheLeu: 3.376 ± 1.161
0.0PheMet: 0.0 ± 0.0
1.35PheAsn: 1.35 ± 0.072
1.35PhePro: 1.35 ± 0.072
0.0PheGln: 0.0 ± 0.0
3.376PheArg: 3.376 ± 0.266
4.051PheSer: 4.051 ± 0.217
2.701PheThr: 2.701 ± 0.75
2.026PheVal: 2.026 ± 0.339
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.727GlyAla: 4.727 ± 0.701
0.0GlyCys: 0.0 ± 0.0
4.727GlyAsp: 4.727 ± 0.701
2.026GlyGlu: 2.026 ± 1.233
2.026GlyPhe: 2.026 ± 0.339
3.376GlyGly: 3.376 ± 0.266
0.675GlyHis: 0.675 ± 0.484
3.376GlyIle: 3.376 ± 0.629
4.051GlyLys: 4.051 ± 0.677
1.35GlyLeu: 1.35 ± 0.072
3.376GlyMet: 3.376 ± 1.523
7.427GlyAsn: 7.427 ± 1.741
0.0GlyPro: 0.0 ± 0.0
2.701GlyGln: 2.701 ± 1.04
2.026GlyArg: 2.026 ± 0.556
4.727GlySer: 4.727 ± 1.596
1.35GlyThr: 1.35 ± 0.822
7.427GlyVal: 7.427 ± 0.049
0.675GlyTrp: 0.675 ± 0.484
2.026GlyTyr: 2.026 ± 0.339
0.0GlyXaa: 0.0 ± 0.0
His
2.026HisAla: 2.026 ± 1.233
0.0HisCys: 0.0 ± 0.0
0.675HisAsp: 0.675 ± 0.411
1.35HisGlu: 1.35 ± 0.072
0.0HisPhe: 0.0 ± 0.0
0.675HisGly: 0.675 ± 0.411
2.026HisHis: 2.026 ± 0.339
1.35HisIle: 1.35 ± 0.072
2.701HisLys: 2.701 ± 0.75
1.35HisLeu: 1.35 ± 0.072
0.0HisMet: 0.0 ± 0.0
1.35HisAsn: 1.35 ± 0.072
0.675HisPro: 0.675 ± 0.411
0.0HisGln: 0.0 ± 0.0
2.026HisArg: 2.026 ± 1.233
4.051HisSer: 4.051 ± 0.217
0.675HisThr: 0.675 ± 0.411
5.402HisVal: 5.402 ± 0.605
0.0HisTrp: 0.0 ± 0.0
1.35HisTyr: 1.35 ± 0.822
0.0HisXaa: 0.0 ± 0.0
Ile
4.727IleAla: 4.727 ± 1.088
0.0IleCys: 0.0 ± 0.0
4.051IleAsp: 4.051 ± 0.677
4.727IleGlu: 4.727 ± 0.194
1.35IlePhe: 1.35 ± 0.822
2.701IleGly: 2.701 ± 1.04
1.35IleHis: 1.35 ± 0.072
2.026IleIle: 2.026 ± 0.556
4.727IleLys: 4.727 ± 0.701
4.727IleLeu: 4.727 ± 0.701
0.675IleMet: 0.675 ± 0.411
4.727IleAsn: 4.727 ± 1.596
3.376IlePro: 3.376 ± 0.629
1.35IleGln: 1.35 ± 0.072
1.35IleArg: 1.35 ± 0.822
2.026IleSer: 2.026 ± 0.339
0.675IleThr: 0.675 ± 0.411
4.051IleVal: 4.051 ± 0.217
0.0IleTrp: 0.0 ± 0.0
2.026IleTyr: 2.026 ± 1.233
0.0IleXaa: 0.0 ± 0.0
Lys
1.35LysAla: 1.35 ± 0.072
1.35LysCys: 1.35 ± 0.822
2.026LysAsp: 2.026 ± 1.233
4.727LysGlu: 4.727 ± 1.088
3.376LysPhe: 3.376 ± 0.266
2.026LysGly: 2.026 ± 1.451
2.701LysHis: 2.701 ± 0.145
2.026LysIle: 2.026 ± 0.339
2.026LysLys: 2.026 ± 0.339
6.752LysLeu: 6.752 ± 1.257
2.701LysMet: 2.701 ± 0.145
2.026LysAsn: 2.026 ± 0.339
1.35LysPro: 1.35 ± 0.822
2.701LysGln: 2.701 ± 1.644
4.727LysArg: 4.727 ± 0.701
3.376LysSer: 3.376 ± 0.266
1.35LysThr: 1.35 ± 0.072
2.026LysVal: 2.026 ± 1.233
0.675LysTrp: 0.675 ± 0.484
2.701LysTyr: 2.701 ± 0.145
0.0LysXaa: 0.0 ± 0.0
Leu
8.103LeuAla: 8.103 ± 1.354
4.051LeuCys: 4.051 ± 2.007
1.35LeuAsp: 1.35 ± 0.967
4.051LeuGlu: 4.051 ± 1.112
0.0LeuPhe: 0.0 ± 0.0
3.376LeuGly: 3.376 ± 0.629
1.35LeuHis: 1.35 ± 0.072
1.35LeuIle: 1.35 ± 0.072
6.077LeuLys: 6.077 ± 0.774
10.804LeuLeu: 10.804 ± 1.475
2.701LeuMet: 2.701 ± 0.596
6.077LeuAsn: 6.077 ± 1.668
7.427LeuPro: 7.427 ± 3.53
1.35LeuGln: 1.35 ± 0.072
7.427LeuArg: 7.427 ± 0.049
4.727LeuSer: 4.727 ± 1.983
8.778LeuThr: 8.778 ± 0.871
5.402LeuVal: 5.402 ± 2.079
0.675LeuTrp: 0.675 ± 0.411
4.051LeuTyr: 4.051 ± 1.112
0.0LeuXaa: 0.0 ± 0.0
Met
2.701MetAla: 2.701 ± 1.04
1.35MetCys: 1.35 ± 0.072
2.026MetAsp: 2.026 ± 0.339
0.675MetGlu: 0.675 ± 0.411
0.675MetPhe: 0.675 ± 0.411
2.026MetGly: 2.026 ± 0.556
0.675MetHis: 0.675 ± 0.411
1.35MetIle: 1.35 ± 0.822
1.35MetLys: 1.35 ± 0.072
1.35MetLeu: 1.35 ± 0.822
0.0MetMet: 0.0 ± 0.0
1.35MetAsn: 1.35 ± 0.822
2.701MetPro: 2.701 ± 1.04
0.0MetGln: 0.0 ± 0.0
2.026MetArg: 2.026 ± 0.556
5.402MetSer: 5.402 ± 0.29
2.026MetThr: 2.026 ± 0.556
1.35MetVal: 1.35 ± 0.822
0.675MetTrp: 0.675 ± 0.411
2.701MetTyr: 2.701 ± 0.75
0.0MetXaa: 0.0 ± 0.0
Asn
3.376AsnAla: 3.376 ± 2.055
0.675AsnCys: 0.675 ± 0.411
1.35AsnAsp: 1.35 ± 0.072
2.701AsnGlu: 2.701 ± 1.934
1.35AsnPhe: 1.35 ± 0.072
3.376AsnGly: 3.376 ± 0.266
0.675AsnHis: 0.675 ± 0.484
3.376AsnIle: 3.376 ± 1.523
2.701AsnLys: 2.701 ± 0.75
4.051AsnLeu: 4.051 ± 2.902
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.026AsnPro: 2.026 ± 1.451
0.675AsnGln: 0.675 ± 0.411
3.376AsnArg: 3.376 ± 0.266
2.701AsnSer: 2.701 ± 1.04
1.35AsnThr: 1.35 ± 0.072
3.376AsnVal: 3.376 ± 0.266
0.0AsnTrp: 0.0 ± 0.0
2.701AsnTyr: 2.701 ± 0.145
0.0AsnXaa: 0.0 ± 0.0
Pro
5.402ProAla: 5.402 ± 0.29
0.0ProCys: 0.0 ± 0.0
2.701ProAsp: 2.701 ± 1.04
4.727ProGlu: 4.727 ± 2.49
2.701ProPhe: 2.701 ± 0.145
1.35ProGly: 1.35 ± 0.822
0.675ProHis: 0.675 ± 0.484
4.051ProIle: 4.051 ± 1.112
3.376ProLys: 3.376 ± 1.523
4.051ProLeu: 4.051 ± 1.112
1.35ProMet: 1.35 ± 0.072
0.0ProAsn: 0.0 ± 0.0
3.376ProPro: 3.376 ± 0.629
0.675ProGln: 0.675 ± 0.484
0.0ProArg: 0.0 ± 0.0
1.35ProSer: 1.35 ± 0.822
0.675ProThr: 0.675 ± 0.484
5.402ProVal: 5.402 ± 0.29
0.0ProTrp: 0.0 ± 0.0
0.675ProTyr: 0.675 ± 0.411
0.0ProXaa: 0.0 ± 0.0
Gln
5.402GlnAla: 5.402 ± 1.499
0.0GlnCys: 0.0 ± 0.0
1.35GlnAsp: 1.35 ± 0.072
0.675GlnGlu: 0.675 ± 0.411
0.675GlnPhe: 0.675 ± 0.484
1.35GlnGly: 1.35 ± 0.072
2.026GlnHis: 2.026 ± 0.339
2.701GlnIle: 2.701 ± 0.75
0.0GlnLys: 0.0 ± 0.0
2.701GlnLeu: 2.701 ± 0.75
1.35GlnMet: 1.35 ± 0.072
0.675GlnAsn: 0.675 ± 0.411
1.35GlnPro: 1.35 ± 0.967
1.35GlnGln: 1.35 ± 0.967
3.376GlnArg: 3.376 ± 0.266
2.701GlnSer: 2.701 ± 1.934
3.376GlnThr: 3.376 ± 0.629
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
2.026GlnTyr: 2.026 ± 0.556
0.0GlnXaa: 0.0 ± 0.0
Arg
6.752ArgAla: 6.752 ± 3.216
1.35ArgCys: 1.35 ± 0.967
2.026ArgAsp: 2.026 ± 0.339
3.376ArgGlu: 3.376 ± 1.161
5.402ArgPhe: 5.402 ± 2.079
3.376ArgGly: 3.376 ± 0.629
3.376ArgHis: 3.376 ± 1.161
2.701ArgIle: 2.701 ± 0.75
0.0ArgLys: 0.0 ± 0.0
5.402ArgLeu: 5.402 ± 1.499
0.675ArgMet: 0.675 ± 0.411
2.701ArgAsn: 2.701 ± 0.75
0.0ArgPro: 0.0 ± 0.0
6.077ArgGln: 6.077 ± 0.121
3.376ArgArg: 3.376 ± 0.266
4.727ArgSer: 4.727 ± 0.701
5.402ArgThr: 5.402 ± 0.29
4.727ArgVal: 4.727 ± 0.194
0.675ArgTrp: 0.675 ± 0.411
2.701ArgTyr: 2.701 ± 0.145
0.0ArgXaa: 0.0 ± 0.0
Ser
5.402SerAla: 5.402 ± 1.499
1.35SerCys: 1.35 ± 0.967
5.402SerAsp: 5.402 ± 1.185
4.051SerGlu: 4.051 ± 1.112
0.675SerPhe: 0.675 ± 0.484
4.051SerGly: 4.051 ± 2.007
0.675SerHis: 0.675 ± 0.484
4.727SerIle: 4.727 ± 1.088
3.376SerLys: 3.376 ± 0.629
6.752SerLeu: 6.752 ± 0.532
4.051SerMet: 4.051 ± 1.572
0.675SerAsn: 0.675 ± 0.411
1.35SerPro: 1.35 ± 0.967
4.051SerGln: 4.051 ± 0.677
4.727SerArg: 4.727 ± 0.194
2.026SerSer: 2.026 ± 0.556
6.752SerThr: 6.752 ± 0.532
6.752SerVal: 6.752 ± 0.532
2.026SerTrp: 2.026 ± 1.451
0.675SerTyr: 0.675 ± 0.411
0.0SerXaa: 0.0 ± 0.0
Thr
6.077ThrAla: 6.077 ± 0.121
1.35ThrCys: 1.35 ± 0.822
2.701ThrAsp: 2.701 ± 0.145
4.051ThrGlu: 4.051 ± 1.112
2.026ThrPhe: 2.026 ± 1.233
2.026ThrGly: 2.026 ± 0.339
1.35ThrHis: 1.35 ± 0.822
2.701ThrIle: 2.701 ± 0.75
3.376ThrLys: 3.376 ± 0.266
4.727ThrLeu: 4.727 ± 1.088
3.376ThrMet: 3.376 ± 1.523
0.675ThrAsn: 0.675 ± 0.484
3.376ThrPro: 3.376 ± 1.161
1.35ThrGln: 1.35 ± 0.967
4.051ThrArg: 4.051 ± 1.572
3.376ThrSer: 3.376 ± 0.266
3.376ThrThr: 3.376 ± 0.266
5.402ThrVal: 5.402 ± 0.605
2.026ThrTrp: 2.026 ± 0.556
2.026ThrTyr: 2.026 ± 0.556
0.0ThrXaa: 0.0 ± 0.0
Val
3.376ValAla: 3.376 ± 1.523
1.35ValCys: 1.35 ± 0.967
4.051ValAsp: 4.051 ± 1.572
6.752ValGlu: 6.752 ± 0.532
5.402ValPhe: 5.402 ± 0.605
6.077ValGly: 6.077 ± 1.016
2.701ValHis: 2.701 ± 1.644
3.376ValIle: 3.376 ± 1.523
4.051ValLys: 4.051 ± 2.467
6.752ValLeu: 6.752 ± 0.532
2.701ValMet: 2.701 ± 0.145
3.376ValAsn: 3.376 ± 1.523
4.051ValPro: 4.051 ± 0.217
3.376ValGln: 3.376 ± 0.629
3.376ValArg: 3.376 ± 0.629
4.051ValSer: 4.051 ± 0.217
4.051ValThr: 4.051 ± 0.217
5.402ValVal: 5.402 ± 0.605
1.35ValTrp: 1.35 ± 0.967
3.376ValTyr: 3.376 ± 1.161
0.0ValXaa: 0.0 ± 0.0
Trp
2.701TrpAla: 2.701 ± 0.145
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.35TrpGlu: 1.35 ± 0.072
0.675TrpPhe: 0.675 ± 0.484
1.35TrpGly: 1.35 ± 0.072
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.35TrpLeu: 1.35 ± 0.967
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.026TrpPro: 2.026 ± 0.339
0.675TrpGln: 0.675 ± 0.411
1.35TrpArg: 1.35 ± 0.072
0.675TrpSer: 0.675 ± 0.484
1.35TrpThr: 1.35 ± 0.072
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.675TrpTyr: 0.675 ± 0.484
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.35TyrAla: 1.35 ± 0.822
0.675TyrCys: 0.675 ± 0.484
2.701TyrAsp: 2.701 ± 1.04
0.675TyrGlu: 0.675 ± 0.411
0.0TyrPhe: 0.0 ± 0.0
4.727TyrGly: 4.727 ± 0.194
0.675TyrHis: 0.675 ± 0.411
2.026TyrIle: 2.026 ± 1.233
4.051TyrLys: 4.051 ± 1.572
2.701TyrLeu: 2.701 ± 1.04
1.35TyrMet: 1.35 ± 0.072
3.376TyrAsn: 3.376 ± 0.629
0.675TyrPro: 0.675 ± 0.484
0.675TyrGln: 0.675 ± 0.411
5.402TyrArg: 5.402 ± 0.605
4.051TyrSer: 4.051 ± 1.112
0.675TyrThr: 0.675 ± 0.411
0.675TyrVal: 0.675 ± 0.411
0.0TyrTrp: 0.0 ± 0.0
0.675TyrTyr: 0.675 ± 0.484
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1482 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski