Amino acid dipepetide frequency for Pagoda yellow mosaic associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.418AlaAla: 6.418 ± 1.861
1.604AlaCys: 1.604 ± 0.823
2.808AlaAsp: 2.808 ± 0.985
5.215AlaGlu: 5.215 ± 0.983
2.407AlaPhe: 2.407 ± 0.92
4.412AlaGly: 4.412 ± 2.497
0.802AlaHis: 0.802 ± 0.412
5.616AlaIle: 5.616 ± 2.703
2.407AlaLys: 2.407 ± 2.017
5.215AlaLeu: 5.215 ± 1.612
2.006AlaMet: 2.006 ± 1.029
0.401AlaAsn: 0.401 ± 0.206
4.412AlaPro: 4.412 ± 1.625
5.215AlaGln: 5.215 ± 2.017
3.61AlaArg: 3.61 ± 0.802
4.412AlaSer: 4.412 ± 1.159
5.616AlaThr: 5.616 ± 1.97
2.006AlaVal: 2.006 ± 1.185
0.802AlaTrp: 0.802 ± 0.412
2.808AlaTyr: 2.808 ± 0.985
0.0AlaXaa: 0.0 ± 0.0
Cys
2.407CysAla: 2.407 ± 0.866
0.802CysCys: 0.802 ± 0.412
0.802CysAsp: 0.802 ± 0.847
0.0CysGlu: 0.0 ± 0.0
1.203CysPhe: 1.203 ± 1.016
1.604CysGly: 1.604 ± 0.781
0.401CysHis: 0.401 ± 0.206
0.802CysIle: 0.802 ± 0.412
1.604CysLys: 1.604 ± 0.823
0.401CysLeu: 0.401 ± 0.206
0.401CysMet: 0.401 ± 0.206
0.802CysAsn: 0.802 ± 0.412
3.209CysPro: 3.209 ± 2.562
0.401CysGln: 0.401 ± 0.206
1.604CysArg: 1.604 ± 1.281
0.802CysSer: 0.802 ± 0.412
0.0CysThr: 0.0 ± 0.0
0.401CysVal: 0.401 ± 1.132
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.209AspAla: 3.209 ± 1.502
0.802AspCys: 0.802 ± 0.847
3.209AspAsp: 3.209 ± 1.124
4.412AspGlu: 4.412 ± 2.264
2.407AspPhe: 2.407 ± 1.235
2.808AspGly: 2.808 ± 1.511
1.203AspHis: 1.203 ± 0.923
4.011AspIle: 4.011 ± 1.421
2.407AspLys: 2.407 ± 0.92
4.412AspLeu: 4.412 ± 0.901
0.401AspMet: 0.401 ± 0.206
2.006AspAsn: 2.006 ± 0.784
3.209AspPro: 3.209 ± 2.111
2.407AspGln: 2.407 ± 0.866
1.604AspArg: 1.604 ± 2.249
2.407AspSer: 2.407 ± 2.032
3.209AspThr: 3.209 ± 1.036
2.407AspVal: 2.407 ± 0.919
1.604AspTrp: 1.604 ± 0.823
3.61AspTyr: 3.61 ± 0.802
0.0AspXaa: 0.0 ± 0.0
Glu
4.011GluAla: 4.011 ± 1.462
0.401GluCys: 0.401 ± 0.206
8.424GluAsp: 8.424 ± 1.346
11.231GluGlu: 11.231 ± 3.327
0.802GluPhe: 0.802 ± 1.012
4.813GluGly: 4.813 ± 0.658
2.407GluHis: 2.407 ± 1.547
2.808GluIle: 2.808 ± 1.441
3.209GluLys: 3.209 ± 0.931
7.22GluLeu: 7.22 ± 1.83
1.203GluMet: 1.203 ± 0.812
3.61GluAsn: 3.61 ± 1.571
2.407GluPro: 2.407 ± 1.235
2.006GluGln: 2.006 ± 1.029
4.011GluArg: 4.011 ± 3.61
3.61GluSer: 3.61 ± 1.192
7.22GluThr: 7.22 ± 4.41
6.017GluVal: 6.017 ± 1.7
1.203GluTrp: 1.203 ± 1.016
1.203GluTyr: 1.203 ± 1.496
0.0GluXaa: 0.0 ± 0.0
Phe
2.407PheAla: 2.407 ± 0.919
1.203PheCys: 1.203 ± 0.617
0.802PheAsp: 0.802 ± 0.412
1.604PheGlu: 1.604 ± 0.823
0.401PhePhe: 0.401 ± 0.206
1.604PheGly: 1.604 ± 0.823
1.604PheHis: 1.604 ± 3.632
2.006PheIle: 2.006 ± 0.784
1.604PheLys: 1.604 ± 0.875
5.616PheLeu: 5.616 ± 1.492
1.604PheMet: 1.604 ± 1.05
1.203PheAsn: 1.203 ± 0.617
0.802PhePro: 0.802 ± 0.412
3.209PheGln: 3.209 ± 1.295
2.407PheArg: 2.407 ± 1.547
2.006PheSer: 2.006 ± 0.907
1.604PheThr: 1.604 ± 0.823
1.604PheVal: 1.604 ± 0.875
0.0PheTrp: 0.0 ± 0.0
1.604PheTyr: 1.604 ± 0.823
0.0PheXaa: 0.0 ± 0.0
Gly
4.813GlyAla: 4.813 ± 1.828
0.802GlyCys: 0.802 ± 0.412
3.209GlyAsp: 3.209 ± 1.124
4.813GlyGlu: 4.813 ± 1.765
2.006GlyPhe: 2.006 ± 0.873
3.61GlyGly: 3.61 ± 2.701
0.401GlyHis: 0.401 ± 1.006
3.209GlyIle: 3.209 ± 0.931
4.813GlyLys: 4.813 ± 1.661
5.616GlyLeu: 5.616 ± 2.881
0.802GlyMet: 0.802 ± 0.412
2.006GlyAsn: 2.006 ± 1.16
2.407GlyPro: 2.407 ± 1.235
2.006GlyGln: 2.006 ± 1.87
2.808GlyArg: 2.808 ± 1.771
3.61GlySer: 3.61 ± 3.939
3.209GlyThr: 3.209 ± 1.124
4.011GlyVal: 4.011 ± 1.462
0.401GlyTrp: 0.401 ± 0.206
2.407GlyTyr: 2.407 ± 1.235
0.0GlyXaa: 0.0 ± 0.0
His
1.203HisAla: 1.203 ± 1.016
0.401HisCys: 0.401 ± 1.132
0.401HisAsp: 0.401 ± 0.206
1.604HisGlu: 1.604 ± 1.378
0.802HisPhe: 0.802 ± 0.412
0.802HisGly: 0.802 ± 0.412
0.802HisHis: 0.802 ± 0.412
2.407HisIle: 2.407 ± 1.029
1.203HisLys: 1.203 ± 0.774
1.203HisLeu: 1.203 ± 1.639
0.401HisMet: 0.401 ± 0.206
0.802HisAsn: 0.802 ± 1.124
0.401HisPro: 0.401 ± 0.206
1.604HisGln: 1.604 ± 2.249
1.203HisArg: 1.203 ± 0.774
1.203HisSer: 1.203 ± 1.722
0.802HisThr: 0.802 ± 1.679
2.407HisVal: 2.407 ± 1.235
0.401HisTrp: 0.401 ± 0.206
0.802HisTyr: 0.802 ± 0.412
0.0HisXaa: 0.0 ± 0.0
Ile
3.209IleAla: 3.209 ± 1.647
1.203IleCys: 1.203 ± 0.617
2.808IleAsp: 2.808 ± 1.441
4.011IleGlu: 4.011 ± 2.137
3.209IlePhe: 3.209 ± 0.834
3.61IleGly: 3.61 ± 1.571
1.604IleHis: 1.604 ± 1.378
4.011IleIle: 4.011 ± 1.446
3.61IleLys: 3.61 ± 1.29
5.215IleLeu: 5.215 ± 1.718
0.802IleMet: 0.802 ± 0.412
3.61IleAsn: 3.61 ± 1.852
5.616IlePro: 5.616 ± 2.656
4.813IleGln: 4.813 ± 0.979
3.209IleArg: 3.209 ± 1.07
2.006IleSer: 2.006 ± 2.17
4.412IleThr: 4.412 ± 2.782
1.203IleVal: 1.203 ± 0.617
0.0IleTrp: 0.0 ± 0.0
1.604IleTyr: 1.604 ± 0.875
0.0IleXaa: 0.0 ± 0.0
Lys
4.813LysAla: 4.813 ± 3.86
0.401LysCys: 0.401 ± 0.206
3.209LysAsp: 3.209 ± 1.129
6.418LysGlu: 6.418 ± 2.603
3.61LysPhe: 3.61 ± 1.039
4.011LysGly: 4.011 ± 1.421
0.802LysHis: 0.802 ± 0.412
2.808LysIle: 2.808 ± 0.985
5.215LysLys: 5.215 ± 3.207
4.011LysLeu: 4.011 ± 1.081
1.604LysMet: 1.604 ± 0.751
3.61LysAsn: 3.61 ± 1.571
2.006LysPro: 2.006 ± 0.873
3.61LysGln: 3.61 ± 1.824
2.808LysArg: 2.808 ± 1.441
5.616LysSer: 5.616 ± 1.227
3.61LysThr: 3.61 ± 2.504
2.808LysVal: 2.808 ± 1.006
0.401LysTrp: 0.401 ± 0.206
0.802LysTyr: 0.802 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
4.412LeuAla: 4.412 ± 1.781
1.203LeuCys: 1.203 ± 0.617
4.412LeuAsp: 4.412 ± 0.901
5.616LeuGlu: 5.616 ± 5.453
2.006LeuPhe: 2.006 ± 1.937
6.017LeuGly: 6.017 ± 1.544
1.203LeuHis: 1.203 ± 0.617
3.61LeuIle: 3.61 ± 0.925
7.621LeuLys: 7.621 ± 2.164
5.215LeuLeu: 5.215 ± 3.362
2.006LeuMet: 2.006 ± 1.199
3.209LeuAsn: 3.209 ± 2.499
6.819LeuPro: 6.819 ± 1.915
3.209LeuGln: 3.209 ± 1.122
2.407LeuArg: 2.407 ± 1.235
6.418LeuSer: 6.418 ± 2.574
7.22LeuThr: 7.22 ± 1.97
9.226LeuVal: 9.226 ± 2.11
0.0LeuTrp: 0.0 ± 0.0
2.407LeuTyr: 2.407 ± 0.866
0.0LeuXaa: 0.0 ± 0.0
Met
2.808MetAla: 2.808 ± 1.862
0.0MetCys: 0.0 ± 0.0
1.604MetAsp: 1.604 ± 0.823
4.412MetGlu: 4.412 ± 1.642
0.401MetPhe: 0.401 ± 0.206
0.802MetGly: 0.802 ± 0.891
0.802MetHis: 0.802 ± 0.412
0.0MetIle: 0.0 ± 0.0
1.203MetLys: 1.203 ± 0.617
1.203MetLeu: 1.203 ± 0.617
1.203MetMet: 1.203 ± 0.617
1.203MetAsn: 1.203 ± 1.016
1.203MetPro: 1.203 ± 0.617
2.006MetGln: 2.006 ± 1.029
1.203MetArg: 1.203 ± 0.812
2.006MetSer: 2.006 ± 0.873
0.401MetThr: 0.401 ± 0.96
0.802MetVal: 0.802 ± 0.412
0.802MetTrp: 0.802 ± 0.412
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.209AsnAla: 3.209 ± 0.983
0.802AsnCys: 0.802 ± 0.412
0.401AsnAsp: 0.401 ± 0.206
0.802AsnGlu: 0.802 ± 0.847
3.209AsnPhe: 3.209 ± 0.938
1.203AsnGly: 1.203 ± 0.617
0.401AsnHis: 0.401 ± 1.257
2.407AsnIle: 2.407 ± 1.235
2.407AsnLys: 2.407 ± 0.919
4.011AsnLeu: 4.011 ± 0.82
1.203AsnMet: 1.203 ± 0.617
2.407AsnAsn: 2.407 ± 0.919
2.006AsnPro: 2.006 ± 2.133
2.808AsnGln: 2.808 ± 2.164
3.61AsnArg: 3.61 ± 2.104
3.61AsnSer: 3.61 ± 1.852
2.006AsnThr: 2.006 ± 0.907
2.407AsnVal: 2.407 ± 0.875
1.604AsnTrp: 1.604 ± 0.823
1.203AsnTyr: 1.203 ± 0.617
0.0AsnXaa: 0.0 ± 0.0
Pro
4.011ProAla: 4.011 ± 2.058
0.401ProCys: 0.401 ± 1.257
3.209ProAsp: 3.209 ± 0.938
5.616ProGlu: 5.616 ± 2.845
1.203ProPhe: 1.203 ± 0.923
2.808ProGly: 2.808 ± 1.441
0.802ProHis: 0.802 ± 0.847
2.808ProIle: 2.808 ± 0.985
3.209ProLys: 3.209 ± 1.462
2.407ProLeu: 2.407 ± 0.875
0.401ProMet: 0.401 ± 0.206
2.808ProAsn: 2.808 ± 2.159
4.412ProPro: 4.412 ± 1.181
1.604ProGln: 1.604 ± 0.823
2.407ProArg: 2.407 ± 1.029
3.209ProSer: 3.209 ± 2.324
4.813ProThr: 4.813 ± 1.806
4.813ProVal: 4.813 ± 0.976
0.401ProTrp: 0.401 ± 0.206
2.407ProTyr: 2.407 ± 0.919
0.0ProXaa: 0.0 ± 0.0
Gln
4.412GlnAla: 4.412 ± 1.686
0.401GlnCys: 0.401 ± 1.006
1.604GlnAsp: 1.604 ± 0.823
3.209GlnGlu: 3.209 ± 1.036
0.802GlnPhe: 0.802 ± 0.412
3.61GlnGly: 3.61 ± 1.824
0.0GlnHis: 0.0 ± 0.0
2.407GlnIle: 2.407 ± 1.161
4.011GlnLys: 4.011 ± 3.992
7.22GlnLeu: 7.22 ± 1.83
0.802GlnMet: 0.802 ± 0.412
1.604GlnAsn: 1.604 ± 0.875
2.407GlnPro: 2.407 ± 1.319
1.604GlnGln: 1.604 ± 0.823
6.017GlnArg: 6.017 ± 1.324
1.604GlnSer: 1.604 ± 0.941
3.61GlnThr: 3.61 ± 2.544
2.407GlnVal: 2.407 ± 1.235
0.401GlnTrp: 0.401 ± 0.206
1.604GlnTyr: 1.604 ± 0.941
0.0GlnXaa: 0.0 ± 0.0
Arg
2.006ArgAla: 2.006 ± 1.029
1.604ArgCys: 1.604 ± 1.694
2.808ArgAsp: 2.808 ± 2.802
3.61ArgGlu: 3.61 ± 1.217
2.006ArgPhe: 2.006 ± 0.907
3.61ArgGly: 3.61 ± 1.005
2.407ArgHis: 2.407 ± 0.866
5.215ArgIle: 5.215 ± 1.377
4.412ArgLys: 4.412 ± 1.163
4.412ArgLeu: 4.412 ± 5.398
2.407ArgMet: 2.407 ± 0.866
4.011ArgAsn: 4.011 ± 1.462
3.209ArgPro: 3.209 ± 1.07
2.808ArgGln: 2.808 ± 2.397
6.017ArgArg: 6.017 ± 1.692
2.407ArgSer: 2.407 ± 0.92
2.407ArgThr: 2.407 ± 0.875
4.412ArgVal: 4.412 ± 2.264
1.604ArgTrp: 1.604 ± 0.823
0.802ArgTyr: 0.802 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
3.61SerAla: 3.61 ± 1.657
0.802SerCys: 0.802 ± 0.891
2.006SerAsp: 2.006 ± 1.029
2.407SerGlu: 2.407 ± 0.866
2.808SerPhe: 2.808 ± 1.771
4.813SerGly: 4.813 ± 1.828
0.802SerHis: 0.802 ± 1.124
5.215SerIle: 5.215 ± 3.24
3.61SerLys: 3.61 ± 0.925
6.819SerLeu: 6.819 ± 2.757
2.407SerMet: 2.407 ± 0.876
1.203SerAsn: 1.203 ± 0.617
1.604SerPro: 1.604 ± 0.781
2.808SerGln: 2.808 ± 3.802
3.209SerArg: 3.209 ± 1.647
7.22SerSer: 7.22 ± 2.14
7.621SerThr: 7.621 ± 2.298
3.209SerVal: 3.209 ± 1.124
2.006SerTrp: 2.006 ± 0.873
1.203SerTyr: 1.203 ± 0.617
0.0SerXaa: 0.0 ± 0.0
Thr
6.418ThrAla: 6.418 ± 1.876
1.203ThrCys: 1.203 ± 0.812
3.61ThrAsp: 3.61 ± 3.317
4.412ThrGlu: 4.412 ± 1.642
2.808ThrPhe: 2.808 ± 0.985
3.209ThrGly: 3.209 ± 1.124
2.006ThrHis: 2.006 ± 1.927
4.412ThrIle: 4.412 ± 1.624
1.604ThrLys: 1.604 ± 0.823
6.017ThrLeu: 6.017 ± 3.204
1.203ThrMet: 1.203 ± 0.617
2.006ThrAsn: 2.006 ± 0.907
2.006ThrPro: 2.006 ± 1.029
3.209ThrGln: 3.209 ± 0.931
7.621ThrArg: 7.621 ± 2.411
6.017ThrSer: 6.017 ± 4.049
5.616ThrThr: 5.616 ± 3.78
4.412ThrVal: 4.412 ± 1.048
1.203ThrTrp: 1.203 ± 0.617
1.203ThrTyr: 1.203 ± 0.617
0.0ThrXaa: 0.0 ± 0.0
Val
2.808ValAla: 2.808 ± 1.006
2.006ValCys: 2.006 ± 1.16
3.209ValAsp: 3.209 ± 1.502
4.011ValGlu: 4.011 ± 1.421
1.203ValPhe: 1.203 ± 0.617
2.006ValGly: 2.006 ± 1.029
1.203ValHis: 1.203 ± 2.376
3.209ValIle: 3.209 ± 1.647
4.412ValLys: 4.412 ± 2.658
5.215ValLeu: 5.215 ± 1.473
2.407ValMet: 2.407 ± 0.866
2.006ValAsn: 2.006 ± 1.029
4.412ValPro: 4.412 ± 1.044
2.006ValGln: 2.006 ± 1.251
2.808ValArg: 2.808 ± 1.441
4.412ValSer: 4.412 ± 1.163
5.616ValThr: 5.616 ± 1.282
0.802ValVal: 0.802 ± 0.412
0.401ValTrp: 0.401 ± 0.206
2.407ValTyr: 2.407 ± 0.919
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.802TrpCys: 0.802 ± 0.412
0.802TrpAsp: 0.802 ± 0.412
2.006TrpGlu: 2.006 ± 1.029
0.0TrpPhe: 0.0 ± 0.0
0.401TrpGly: 0.401 ± 0.206
0.401TrpHis: 0.401 ± 0.206
0.802TrpIle: 0.802 ± 1.012
0.802TrpLys: 0.802 ± 0.412
1.604TrpLeu: 1.604 ± 0.823
0.401TrpMet: 0.401 ± 0.206
0.802TrpAsn: 0.802 ± 1.124
0.802TrpPro: 0.802 ± 0.412
0.401TrpGln: 0.401 ± 0.206
1.604TrpArg: 1.604 ± 0.823
0.0TrpSer: 0.0 ± 0.0
0.401TrpThr: 0.401 ± 0.206
0.802TrpVal: 0.802 ± 0.412
0.0TrpTrp: 0.0 ± 0.0
0.401TrpTyr: 0.401 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.006TyrAla: 2.006 ± 1.029
0.802TyrCys: 0.802 ± 0.847
2.407TyrAsp: 2.407 ± 0.92
2.407TyrGlu: 2.407 ± 0.919
1.604TyrPhe: 1.604 ± 0.823
1.203TyrGly: 1.203 ± 0.617
0.802TyrHis: 0.802 ± 0.412
2.006TyrIle: 2.006 ± 1.029
3.209TyrLys: 3.209 ± 1.07
1.203TyrLeu: 1.203 ± 1.016
0.0TyrMet: 0.0 ± 0.0
2.407TyrAsn: 2.407 ± 1.235
0.401TyrPro: 0.401 ± 0.206
2.006TyrGln: 2.006 ± 2.4
2.006TyrArg: 2.006 ± 0.784
2.808TyrSer: 2.808 ± 0.985
0.802TyrThr: 0.802 ± 0.412
0.401TyrVal: 0.401 ± 0.206
0.0TyrTrp: 0.0 ± 0.0
0.401TyrTyr: 0.401 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2494 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski