Amino acid dipepetide frequency for Cacao swollen shoot CE virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.527AlaAla: 3.527 ± 2.137
0.0AlaCys: 0.0 ± 0.0
2.205AlaAsp: 2.205 ± 0.971
4.409AlaGlu: 4.409 ± 3.133
3.968AlaPhe: 3.968 ± 2.08
0.882AlaGly: 0.882 ± 0.462
0.882AlaHis: 0.882 ± 1.079
4.85AlaIle: 4.85 ± 0.988
2.205AlaLys: 2.205 ± 0.971
5.291AlaLeu: 5.291 ± 2.278
2.646AlaMet: 2.646 ± 1.387
2.205AlaAsn: 2.205 ± 1.478
0.441AlaPro: 0.441 ± 0.231
4.409AlaGln: 4.409 ± 2.697
3.527AlaArg: 3.527 ± 1.077
3.968AlaSer: 3.968 ± 3.898
3.527AlaThr: 3.527 ± 2.507
3.968AlaVal: 3.968 ± 1.001
0.0AlaTrp: 0.0 ± 0.0
4.409AlaTyr: 4.409 ± 2.181
0.0AlaXaa: 0.0 ± 0.0
Cys
0.882CysAla: 0.882 ± 0.462
0.882CysCys: 0.882 ± 0.462
0.441CysAsp: 0.441 ± 0.231
0.441CysGlu: 0.441 ± 0.231
0.882CysPhe: 0.882 ± 0.462
0.441CysGly: 0.441 ± 0.231
0.882CysHis: 0.882 ± 0.462
1.323CysIle: 1.323 ± 0.693
2.205CysLys: 2.205 ± 1.156
0.441CysLeu: 0.441 ± 0.231
0.441CysMet: 0.441 ± 0.231
1.323CysAsn: 1.323 ± 0.693
0.882CysPro: 0.882 ± 0.462
1.764CysGln: 1.764 ± 0.924
0.0CysArg: 0.0 ± 0.0
0.441CysSer: 0.441 ± 0.231
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.882CysTyr: 0.882 ± 0.462
0.0CysXaa: 0.0 ± 0.0
Asp
2.205AspAla: 2.205 ± 1.156
0.441AspCys: 0.441 ± 0.231
3.086AspAsp: 3.086 ± 1.618
4.409AspGlu: 4.409 ± 1.51
0.441AspPhe: 0.441 ± 0.231
3.527AspGly: 3.527 ± 1.163
1.764AspHis: 1.764 ± 0.924
3.086AspIle: 3.086 ± 1.618
3.968AspLys: 3.968 ± 1.341
5.291AspLeu: 5.291 ± 0.758
0.882AspMet: 0.882 ± 0.462
3.968AspAsn: 3.968 ± 0.985
3.527AspPro: 3.527 ± 1.996
2.646AspGln: 2.646 ± 0.998
2.646AspArg: 2.646 ± 2.143
1.764AspSer: 1.764 ± 0.998
3.086AspThr: 3.086 ± 1.618
0.441AspVal: 0.441 ± 0.231
0.882AspTrp: 0.882 ± 0.462
4.409AspTyr: 4.409 ± 1.449
0.0AspXaa: 0.0 ± 0.0
Glu
3.968GluAla: 3.968 ± 2.203
0.882GluCys: 0.882 ± 0.462
4.85GluAsp: 4.85 ± 1.693
11.023GluGlu: 11.023 ± 2.25
1.764GluPhe: 1.764 ± 0.924
4.85GluGly: 4.85 ± 1.693
1.764GluHis: 1.764 ± 0.924
4.409GluIle: 4.409 ± 1.219
8.377GluLys: 8.377 ± 4.271
6.173GluLeu: 6.173 ± 2.606
0.0GluMet: 0.0 ± 0.0
3.527GluAsn: 3.527 ± 2.214
0.882GluPro: 0.882 ± 1.295
3.527GluGln: 3.527 ± 1.077
3.968GluArg: 3.968 ± 2.347
5.291GluSer: 5.291 ± 1.113
3.968GluThr: 3.968 ± 1.001
4.409GluVal: 4.409 ± 1.51
1.764GluTrp: 1.764 ± 0.886
1.764GluTyr: 1.764 ± 0.886
0.0GluXaa: 0.0 ± 0.0
Phe
2.205PheAla: 2.205 ± 1.156
0.882PheCys: 0.882 ± 0.462
1.764PheAsp: 1.764 ± 0.886
1.323PheGlu: 1.323 ± 0.693
0.882PhePhe: 0.882 ± 0.462
0.441PheGly: 0.441 ± 0.231
1.323PheHis: 1.323 ± 0.693
3.086PheIle: 3.086 ± 1.01
1.764PheLys: 1.764 ± 0.998
2.646PheLeu: 2.646 ± 1.387
0.441PheMet: 0.441 ± 0.231
0.882PheAsn: 0.882 ± 0.462
0.882PhePro: 0.882 ± 0.462
1.323PheGln: 1.323 ± 0.693
3.527PheArg: 3.527 ± 1.163
3.086PheSer: 3.086 ± 1.618
0.882PheThr: 0.882 ± 1.295
0.882PheVal: 0.882 ± 0.462
0.441PheTrp: 0.441 ± 0.231
1.323PheTyr: 1.323 ± 0.693
0.0PheXaa: 0.0 ± 0.0
Gly
1.764GlyAla: 1.764 ± 0.924
0.882GlyCys: 0.882 ± 0.462
3.527GlyAsp: 3.527 ± 1.849
2.646GlyGlu: 2.646 ± 1.387
1.764GlyPhe: 1.764 ± 0.998
1.764GlyGly: 1.764 ± 1.068
0.882GlyHis: 0.882 ± 0.462
3.086GlyIle: 3.086 ± 2.061
4.409GlyLys: 4.409 ± 1.942
6.173GlyLeu: 6.173 ± 1.611
0.882GlyMet: 0.882 ± 0.462
2.205GlyAsn: 2.205 ± 1.156
0.441GlyPro: 0.441 ± 0.231
0.882GlyGln: 0.882 ± 1.079
4.85GlyArg: 4.85 ± 2.542
3.086GlySer: 3.086 ± 4.252
3.527GlyThr: 3.527 ± 1.163
3.527GlyVal: 3.527 ± 1.096
0.882GlyTrp: 0.882 ± 0.462
2.646GlyTyr: 2.646 ± 1.387
0.0GlyXaa: 0.0 ± 0.0
His
0.882HisAla: 0.882 ± 0.462
0.441HisCys: 0.441 ± 0.231
0.882HisAsp: 0.882 ± 2.901
0.441HisGlu: 0.441 ± 0.231
0.441HisPhe: 0.441 ± 0.231
1.323HisGly: 1.323 ± 1.164
1.323HisHis: 1.323 ± 1.164
2.646HisIle: 2.646 ± 1.387
0.882HisLys: 0.882 ± 1.295
2.205HisLeu: 2.205 ± 1.156
0.441HisMet: 0.441 ± 0.231
3.086HisAsn: 3.086 ± 1.01
0.0HisPro: 0.0 ± 0.0
2.646HisGln: 2.646 ± 1.919
1.764HisArg: 1.764 ± 0.924
2.205HisSer: 2.205 ± 1.156
1.764HisThr: 1.764 ± 0.886
0.882HisVal: 0.882 ± 0.462
0.441HisTrp: 0.441 ± 0.231
0.441HisTyr: 0.441 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
3.086IleAla: 3.086 ± 1.912
1.764IleCys: 1.764 ± 0.924
4.409IleAsp: 4.409 ± 1.942
4.85IleGlu: 4.85 ± 0.955
2.646IlePhe: 2.646 ± 1.387
3.086IleGly: 3.086 ± 1.01
0.882IleHis: 0.882 ± 0.462
3.086IleIle: 3.086 ± 1.068
6.173IleLys: 6.173 ± 1.389
8.377IleLeu: 8.377 ± 3.056
0.882IleMet: 0.882 ± 0.462
1.323IleAsn: 1.323 ± 0.693
3.527IlePro: 3.527 ± 1.163
4.85IleGln: 4.85 ± 3.039
2.205IleArg: 2.205 ± 1.156
3.086IleSer: 3.086 ± 1.833
5.291IleThr: 5.291 ± 3.415
3.527IleVal: 3.527 ± 1.193
0.441IleTrp: 0.441 ± 0.231
2.205IleTyr: 2.205 ± 1.156
0.0IleXaa: 0.0 ± 0.0
Lys
4.85LysAla: 4.85 ± 0.938
0.882LysCys: 0.882 ± 0.462
5.732LysAsp: 5.732 ± 2.062
6.614LysGlu: 6.614 ± 2.797
2.646LysPhe: 2.646 ± 1.387
3.527LysGly: 3.527 ± 1.996
3.086LysHis: 3.086 ± 1.01
3.527LysIle: 3.527 ± 1.066
4.85LysLys: 4.85 ± 0.938
5.732LysLeu: 5.732 ± 5.961
2.205LysMet: 2.205 ± 1.156
2.205LysAsn: 2.205 ± 0.87
3.086LysPro: 3.086 ± 1.618
3.086LysGln: 3.086 ± 2.508
5.291LysArg: 5.291 ± 1.883
4.409LysSer: 4.409 ± 1.9
4.409LysThr: 4.409 ± 2.052
3.527LysVal: 3.527 ± 2.896
0.441LysTrp: 0.441 ± 0.231
1.764LysTyr: 1.764 ± 0.924
0.0LysXaa: 0.0 ± 0.0
Leu
6.173LeuAla: 6.173 ± 6.237
1.764LeuCys: 1.764 ± 0.924
3.527LeuAsp: 3.527 ± 1.077
8.377LeuGlu: 8.377 ± 5.238
2.646LeuPhe: 2.646 ± 1.017
5.732LeuGly: 5.732 ± 0.528
2.205LeuHis: 2.205 ± 3.042
3.527LeuIle: 3.527 ± 1.066
5.732LeuLys: 5.732 ± 1.912
6.614LeuLeu: 6.614 ± 2.511
1.764LeuMet: 1.764 ± 0.877
3.086LeuAsn: 3.086 ± 1.076
6.173LeuPro: 6.173 ± 1.233
6.614LeuGln: 6.614 ± 1.699
4.85LeuArg: 4.85 ± 0.955
7.055LeuSer: 7.055 ± 2.386
6.614LeuThr: 6.614 ± 2.232
6.173LeuVal: 6.173 ± 1.588
0.882LeuTrp: 0.882 ± 0.462
4.85LeuTyr: 4.85 ± 1.026
0.0LeuXaa: 0.0 ± 0.0
Met
0.882MetAla: 0.882 ± 0.462
0.882MetCys: 0.882 ± 0.462
0.882MetAsp: 0.882 ± 0.462
2.646MetGlu: 2.646 ± 1.017
1.323MetPhe: 1.323 ± 0.693
0.882MetGly: 0.882 ± 1.079
0.441MetHis: 0.441 ± 0.231
1.323MetIle: 1.323 ± 0.693
1.764MetLys: 1.764 ± 0.924
1.764MetLeu: 1.764 ± 0.924
0.882MetMet: 0.882 ± 0.462
0.882MetAsn: 0.882 ± 0.462
2.646MetPro: 2.646 ± 1.387
1.323MetGln: 1.323 ± 0.693
0.441MetArg: 0.441 ± 0.231
1.323MetSer: 1.323 ± 0.96
3.086MetThr: 3.086 ± 1.076
2.205MetVal: 2.205 ± 1.156
0.0MetTrp: 0.0 ± 0.0
0.441MetTyr: 0.441 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
1.323AsnAla: 1.323 ± 0.693
0.0AsnCys: 0.0 ± 0.0
1.323AsnAsp: 1.323 ± 0.693
2.646AsnGlu: 2.646 ± 1.387
0.441AsnPhe: 0.441 ± 0.231
2.205AsnGly: 2.205 ± 0.971
0.882AsnHis: 0.882 ± 0.462
1.323AsnIle: 1.323 ± 1.164
4.409AsnLys: 4.409 ± 1.51
6.614AsnLeu: 6.614 ± 2.524
1.764AsnMet: 1.764 ± 0.924
2.205AsnAsn: 2.205 ± 1.478
0.882AsnPro: 0.882 ± 0.462
2.205AsnGln: 2.205 ± 2.029
1.764AsnArg: 1.764 ± 1.657
3.086AsnSer: 3.086 ± 1.174
2.205AsnThr: 2.205 ± 1.017
2.646AsnVal: 2.646 ± 1.315
0.882AsnTrp: 0.882 ± 1.295
1.764AsnTyr: 1.764 ± 0.924
0.0AsnXaa: 0.0 ± 0.0
Pro
3.527ProAla: 3.527 ± 1.193
0.0ProCys: 0.0 ± 0.0
2.205ProAsp: 2.205 ± 1.156
2.646ProGlu: 2.646 ± 0.998
0.882ProPhe: 0.882 ± 0.462
1.323ProGly: 1.323 ± 0.693
1.323ProHis: 1.323 ± 0.693
1.323ProIle: 1.323 ± 0.693
4.409ProLys: 4.409 ± 1.489
4.409ProLeu: 4.409 ± 1.489
1.323ProMet: 1.323 ± 0.693
2.646ProAsn: 2.646 ± 0.998
3.968ProPro: 3.968 ± 1.955
4.409ProGln: 4.409 ± 2.311
1.323ProArg: 1.323 ± 1.967
2.646ProSer: 2.646 ± 1.017
1.764ProThr: 1.764 ± 1.068
3.086ProVal: 3.086 ± 1.068
0.441ProTrp: 0.441 ± 0.231
1.323ProTyr: 1.323 ± 1.967
0.0ProXaa: 0.0 ± 0.0
Gln
3.968GlnAla: 3.968 ± 1.45
0.441GlnCys: 0.441 ± 0.231
3.086GlnAsp: 3.086 ± 1.233
5.291GlnGlu: 5.291 ± 2.897
1.764GlnPhe: 1.764 ± 0.924
2.646GlnGly: 2.646 ± 2.328
0.882GlnHis: 0.882 ± 0.462
4.85GlnIle: 4.85 ± 1.026
3.968GlnLys: 3.968 ± 3.131
6.173GlnLeu: 6.173 ± 2.413
1.764GlnMet: 1.764 ± 0.773
3.527GlnAsn: 3.527 ± 1.096
5.291GlnPro: 5.291 ± 1.688
7.496GlnGln: 7.496 ± 0.405
3.968GlnArg: 3.968 ± 1.45
2.646GlnSer: 2.646 ± 0.914
3.527GlnThr: 3.527 ± 1.163
1.764GlnVal: 1.764 ± 0.998
1.323GlnTrp: 1.323 ± 0.693
0.882GlnTyr: 0.882 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
3.086ArgAla: 3.086 ± 1.233
0.441ArgCys: 0.441 ± 0.231
3.527ArgAsp: 3.527 ± 1.145
1.323ArgGlu: 1.323 ± 0.96
0.882ArgPhe: 0.882 ± 0.462
4.409ArgGly: 4.409 ± 1.449
0.441ArgHis: 0.441 ± 0.231
4.409ArgIle: 4.409 ± 2.311
3.527ArgLys: 3.527 ± 3.007
7.496ArgLeu: 7.496 ± 2.174
3.527ArgMet: 3.527 ± 1.849
2.646ArgAsn: 2.646 ± 1.017
1.764ArgPro: 1.764 ± 0.924
3.527ArgGln: 3.527 ± 2.137
3.527ArgArg: 3.527 ± 1.145
2.646ArgSer: 2.646 ± 1.387
2.646ArgThr: 2.646 ± 1.396
3.968ArgVal: 3.968 ± 2.05
1.764ArgTrp: 1.764 ± 0.886
0.882ArgTyr: 0.882 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
3.086SerAla: 3.086 ± 4.903
1.323SerCys: 1.323 ± 0.693
3.968SerAsp: 3.968 ± 2.08
3.968SerGlu: 3.968 ± 1.45
1.323SerPhe: 1.323 ± 0.693
3.527SerGly: 3.527 ± 1.996
1.323SerHis: 1.323 ± 0.96
5.291SerIle: 5.291 ± 1.688
3.527SerLys: 3.527 ± 3.129
6.614SerLeu: 6.614 ± 5.154
0.882SerMet: 0.882 ± 0.462
1.764SerAsn: 1.764 ± 0.886
2.205SerPro: 2.205 ± 1.156
4.409SerGln: 4.409 ± 0.938
3.968SerArg: 3.968 ± 2.08
3.968SerSer: 3.968 ± 1.955
4.85SerThr: 4.85 ± 3.569
3.086SerVal: 3.086 ± 1.01
0.441SerTrp: 0.441 ± 0.231
1.323SerTyr: 1.323 ± 0.693
0.0SerXaa: 0.0 ± 0.0
Thr
3.968ThrAla: 3.968 ± 2.79
0.882ThrCys: 0.882 ± 0.462
2.646ThrAsp: 2.646 ± 0.998
4.85ThrGlu: 4.85 ± 0.955
2.646ThrPhe: 2.646 ± 1.363
4.85ThrGly: 4.85 ± 2.542
2.646ThrHis: 2.646 ± 2.328
8.818ThrIle: 8.818 ± 3.129
2.205ThrLys: 2.205 ± 1.156
3.968ThrLeu: 3.968 ± 2.335
1.764ThrMet: 1.764 ± 0.846
0.882ThrAsn: 0.882 ± 1.079
3.086ThrPro: 3.086 ± 1.076
3.527ThrGln: 3.527 ± 1.077
2.646ThrArg: 2.646 ± 2.328
6.173ThrSer: 6.173 ± 2.466
7.496ThrThr: 7.496 ± 1.256
2.205ThrVal: 2.205 ± 0.87
0.441ThrTrp: 0.441 ± 0.231
0.882ThrTyr: 0.882 ± 1.295
0.0ThrXaa: 0.0 ± 0.0
Val
3.527ValAla: 3.527 ± 1.849
0.441ValCys: 0.441 ± 0.231
1.764ValAsp: 1.764 ± 0.924
4.409ValGlu: 4.409 ± 1.899
2.646ValPhe: 2.646 ± 0.914
1.764ValGly: 1.764 ± 0.924
1.323ValHis: 1.323 ± 0.693
2.205ValIle: 2.205 ± 1.156
4.409ValLys: 4.409 ± 3.485
3.086ValLeu: 3.086 ± 1.068
2.205ValMet: 2.205 ± 0.87
0.441ValAsn: 0.441 ± 1.34
3.086ValPro: 3.086 ± 1.068
3.968ValGln: 3.968 ± 1.45
3.968ValArg: 3.968 ± 1.001
1.764ValSer: 1.764 ± 1.657
3.968ValThr: 3.968 ± 2.05
1.764ValVal: 1.764 ± 1.728
1.323ValTrp: 1.323 ± 0.693
1.764ValTyr: 1.764 ± 0.924
0.0ValXaa: 0.0 ± 0.0
Trp
1.323TrpAla: 1.323 ± 1.164
0.0TrpCys: 0.0 ± 0.0
0.882TrpAsp: 0.882 ± 0.462
1.764TrpGlu: 1.764 ± 0.886
0.0TrpPhe: 0.0 ± 0.0
0.441TrpGly: 0.441 ± 0.231
0.441TrpHis: 0.441 ± 0.231
1.323TrpIle: 1.323 ± 0.693
1.323TrpLys: 1.323 ± 0.693
1.764TrpLeu: 1.764 ± 0.924
0.0TrpMet: 0.0 ± 0.0
0.441TrpAsn: 0.441 ± 0.231
0.0TrpPro: 0.0 ± 0.0
0.882TrpGln: 0.882 ± 0.462
0.441TrpArg: 0.441 ± 0.231
0.0TrpSer: 0.0 ± 0.0
1.764TrpThr: 1.764 ± 0.924
0.882TrpVal: 0.882 ± 0.462
0.441TrpTrp: 0.441 ± 0.231
0.441TrpTyr: 0.441 ± 1.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.086TyrAla: 3.086 ± 1.068
1.323TyrCys: 1.323 ± 0.693
1.764TyrAsp: 1.764 ± 1.068
3.086TyrGlu: 3.086 ± 1.01
0.0TyrPhe: 0.0 ± 0.0
2.205TyrGly: 2.205 ± 0.971
0.441TyrHis: 0.441 ± 0.231
2.205TyrIle: 2.205 ± 1.156
1.764TyrLys: 1.764 ± 0.924
3.527TyrLeu: 3.527 ± 1.077
0.882TyrMet: 0.882 ± 0.462
1.323TyrAsn: 1.323 ± 0.693
2.205TyrPro: 2.205 ± 1.017
2.205TyrGln: 2.205 ± 1.478
1.764TyrArg: 1.764 ± 0.924
2.205TyrSer: 2.205 ± 1.017
2.205TyrThr: 2.205 ± 0.971
0.882TyrVal: 0.882 ± 0.462
1.323TyrTrp: 1.323 ± 0.693
2.205TyrTyr: 2.205 ± 1.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2269 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski