Amino acid dipepetide frequency for Pepino mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.979AlaAla: 8.979 ± 2.737
0.473AlaCys: 0.473 ± 0.806
4.726AlaAsp: 4.726 ± 1.007
4.726AlaGlu: 4.726 ± 1.559
2.836AlaPhe: 2.836 ± 0.651
2.836AlaGly: 2.836 ± 1.624
0.945AlaHis: 0.945 ± 0.675
8.034AlaIle: 8.034 ± 3.795
5.671AlaLys: 5.671 ± 3.226
6.144AlaLeu: 6.144 ± 2.543
3.781AlaMet: 3.781 ± 1.137
5.198AlaAsn: 5.198 ± 1.679
3.781AlaPro: 3.781 ± 2.7
1.89AlaGln: 1.89 ± 1.35
3.308AlaArg: 3.308 ± 1.127
4.726AlaSer: 4.726 ± 1.047
5.671AlaThr: 5.671 ± 1.55
4.253AlaVal: 4.253 ± 1.639
0.473AlaTrp: 0.473 ± 0.241
4.726AlaTyr: 4.726 ± 1.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.473CysAla: 0.473 ± 0.241
0.473CysCys: 0.473 ± 0.241
0.473CysAsp: 0.473 ± 0.716
1.418CysGlu: 1.418 ± 0.955
0.473CysPhe: 0.473 ± 0.716
1.418CysGly: 1.418 ± 0.724
0.0CysHis: 0.0 ± 0.0
0.473CysIle: 0.473 ± 0.806
0.945CysLys: 0.945 ± 0.483
1.418CysLeu: 1.418 ± 0.598
0.945CysMet: 0.945 ± 0.675
0.473CysAsn: 0.473 ± 0.241
0.473CysPro: 0.473 ± 0.241
1.89CysGln: 1.89 ± 0.966
0.0CysArg: 0.0 ± 0.0
1.89CysSer: 1.89 ± 0.965
1.418CysThr: 1.418 ± 1.311
0.473CysVal: 0.473 ± 0.241
0.0CysTrp: 0.0 ± 0.0
0.945CysTyr: 0.945 ± 1.163
0.0CysXaa: 0.0 ± 0.0
Asp
1.89AspAla: 1.89 ± 1.35
0.945AspCys: 0.945 ± 0.483
2.836AspAsp: 2.836 ± 0.999
4.726AspGlu: 4.726 ± 1.866
4.726AspPhe: 4.726 ± 0.903
2.836AspGly: 2.836 ± 1.408
0.0AspHis: 0.0 ± 0.0
4.253AspIle: 4.253 ± 1.479
1.418AspLys: 1.418 ± 0.598
2.836AspLeu: 2.836 ± 1.743
0.945AspMet: 0.945 ± 0.483
2.363AspAsn: 2.363 ± 0.822
2.836AspPro: 2.836 ± 1.195
1.418AspGln: 1.418 ± 0.598
1.418AspArg: 1.418 ± 0.576
3.781AspSer: 3.781 ± 1.046
3.781AspThr: 3.781 ± 1.125
1.89AspVal: 1.89 ± 0.569
0.945AspTrp: 0.945 ± 0.483
2.363AspTyr: 2.363 ± 1.206
0.0AspXaa: 0.0 ± 0.0
Glu
6.616GluAla: 6.616 ± 2.578
0.473GluCys: 0.473 ± 0.241
1.418GluAsp: 1.418 ± 0.937
2.836GluGlu: 2.836 ± 1.841
3.308GluPhe: 3.308 ± 1.251
2.363GluGly: 2.363 ± 0.818
0.945GluHis: 0.945 ± 0.614
4.253GluIle: 4.253 ± 0.846
5.198GluLys: 5.198 ± 1.878
6.144GluLeu: 6.144 ± 1.046
0.473GluMet: 0.473 ± 0.241
3.781GluAsn: 3.781 ± 1.716
4.253GluPro: 4.253 ± 1.479
1.418GluGln: 1.418 ± 0.724
1.418GluArg: 1.418 ± 1.311
3.308GluSer: 3.308 ± 1.202
3.308GluThr: 3.308 ± 0.551
2.836GluVal: 2.836 ± 1.682
0.945GluTrp: 0.945 ± 0.483
1.418GluTyr: 1.418 ± 0.598
0.0GluXaa: 0.0 ± 0.0
Phe
3.308PheAla: 3.308 ± 1.127
0.945PheCys: 0.945 ± 0.675
4.726PheAsp: 4.726 ± 1.626
4.726PheGlu: 4.726 ± 1.12
3.308PhePhe: 3.308 ± 0.551
1.418PheGly: 1.418 ± 1.755
1.89PheHis: 1.89 ± 0.703
3.781PheIle: 3.781 ± 0.962
1.89PheLys: 1.89 ± 0.965
3.781PheLeu: 3.781 ± 0.962
0.945PheMet: 0.945 ± 0.483
2.363PheAsn: 2.363 ± 0.818
3.781PhePro: 3.781 ± 1.09
2.363PheGln: 2.363 ± 0.822
1.418PheArg: 1.418 ± 0.598
4.253PheSer: 4.253 ± 3.95
1.89PheThr: 1.89 ± 0.569
2.363PheVal: 2.363 ± 0.818
0.945PheTrp: 0.945 ± 0.483
0.945PheTyr: 0.945 ± 0.614
0.0PheXaa: 0.0 ± 0.0
Gly
3.781GlyAla: 3.781 ± 2.7
0.945GlyCys: 0.945 ± 0.483
4.726GlyAsp: 4.726 ± 1.31
2.363GlyGlu: 2.363 ± 0.598
1.89GlyPhe: 1.89 ± 0.965
2.836GlyGly: 2.836 ± 0.651
2.836GlyHis: 2.836 ± 0.651
1.418GlyIle: 1.418 ± 0.995
3.308GlyLys: 3.308 ± 1.066
3.781GlyLeu: 3.781 ± 2.108
0.0GlyMet: 0.0 ± 0.0
2.363GlyAsn: 2.363 ± 0.737
1.418GlyPro: 1.418 ± 0.576
1.418GlyGln: 1.418 ± 0.656
0.0GlyArg: 0.0 ± 0.0
1.418GlySer: 1.418 ± 0.598
2.836GlyThr: 2.836 ± 1.327
2.363GlyVal: 2.363 ± 0.657
0.945GlyTrp: 0.945 ± 0.483
1.89GlyTyr: 1.89 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
2.363HisAla: 2.363 ± 1.206
1.418HisCys: 1.418 ± 1.111
1.418HisAsp: 1.418 ± 0.724
1.418HisGlu: 1.418 ± 0.724
2.363HisPhe: 2.363 ± 0.818
2.363HisGly: 2.363 ± 0.818
2.363HisHis: 2.363 ± 0.785
2.363HisIle: 2.363 ± 0.865
2.836HisLys: 2.836 ± 0.999
2.836HisLeu: 2.836 ± 0.999
0.945HisMet: 0.945 ± 1.044
1.89HisAsn: 1.89 ± 0.96
0.473HisPro: 0.473 ± 0.241
1.418HisGln: 1.418 ± 0.656
2.363HisArg: 2.363 ± 0.785
2.363HisSer: 2.363 ± 2.525
2.363HisThr: 2.363 ± 0.818
1.418HisVal: 1.418 ± 0.724
0.0HisTrp: 0.0 ± 0.0
0.945HisTyr: 0.945 ± 1.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.253IleAla: 4.253 ± 1.307
0.945IleCys: 0.945 ± 0.483
1.89IleAsp: 1.89 ± 1.246
4.253IleGlu: 4.253 ± 0.846
4.253IlePhe: 4.253 ± 0.426
3.308IleGly: 3.308 ± 1.066
2.363IleHis: 2.363 ± 1.023
2.836IleIle: 2.836 ± 1.242
2.836IleLys: 2.836 ± 1.448
4.253IleLeu: 4.253 ± 1.999
0.945IleMet: 0.945 ± 0.483
6.144IleAsn: 6.144 ± 1.541
3.308IlePro: 3.308 ± 0.999
3.781IleGln: 3.781 ± 1.457
0.945IleArg: 0.945 ± 0.675
6.144IleSer: 6.144 ± 1.399
4.253IleThr: 4.253 ± 2.361
3.781IleVal: 3.781 ± 2.975
0.0IleTrp: 0.0 ± 0.0
0.945IleTyr: 0.945 ± 0.694
0.0IleXaa: 0.0 ± 0.0
Lys
3.781LysAla: 3.781 ± 1.046
0.473LysCys: 0.473 ± 0.241
3.308LysAsp: 3.308 ± 0.999
3.308LysGlu: 3.308 ± 0.551
2.363LysPhe: 2.363 ± 1.658
1.418LysGly: 1.418 ± 0.724
1.418LysHis: 1.418 ± 0.724
4.253LysIle: 4.253 ± 1.853
4.253LysLys: 4.253 ± 0.426
10.397LysLeu: 10.397 ± 2.249
1.418LysMet: 1.418 ± 0.724
2.836LysAsn: 2.836 ± 1.448
2.363LysPro: 2.363 ± 1.206
1.89LysGln: 1.89 ± 0.674
0.945LysArg: 0.945 ± 0.483
5.198LysSer: 5.198 ± 1.262
4.253LysThr: 4.253 ± 1.479
4.253LysVal: 4.253 ± 1.205
0.0LysTrp: 0.0 ± 0.0
2.363LysTyr: 2.363 ± 1.231
0.0LysXaa: 0.0 ± 0.0
Leu
7.561LeuAla: 7.561 ± 4.43
0.945LeuCys: 0.945 ± 0.483
3.781LeuAsp: 3.781 ± 0.987
5.198LeuGlu: 5.198 ± 1.919
6.144LeuPhe: 6.144 ± 2.198
5.671LeuGly: 5.671 ± 1.55
3.308LeuHis: 3.308 ± 0.84
3.781LeuIle: 3.781 ± 2.067
6.616LeuLys: 6.616 ± 1.975
5.198LeuLeu: 5.198 ± 1.957
0.945LeuMet: 0.945 ± 0.546
4.726LeuAsn: 4.726 ± 1.754
4.726LeuPro: 4.726 ± 1.824
4.253LeuGln: 4.253 ± 0.846
3.781LeuArg: 3.781 ± 1.349
7.561LeuSer: 7.561 ± 3.847
7.561LeuThr: 7.561 ± 2.925
3.781LeuVal: 3.781 ± 1.716
0.0LeuTrp: 0.0 ± 0.0
3.308LeuTyr: 3.308 ± 1.689
0.0LeuXaa: 0.0 ± 0.0
Met
1.89MetAla: 1.89 ± 0.965
0.473MetCys: 0.473 ± 0.241
0.0MetAsp: 0.0 ± 0.0
1.418MetGlu: 1.418 ± 0.598
0.0MetPhe: 0.0 ± 0.0
1.418MetGly: 1.418 ± 1.499
0.473MetHis: 0.473 ± 0.241
0.945MetIle: 0.945 ± 0.483
1.89MetLys: 1.89 ± 0.965
2.836MetLeu: 2.836 ± 1.448
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.418MetPro: 1.418 ± 1.062
1.89MetGln: 1.89 ± 0.965
1.89MetArg: 1.89 ± 0.965
0.945MetSer: 0.945 ± 1.009
0.945MetThr: 0.945 ± 0.483
0.473MetVal: 0.473 ± 0.241
0.473MetTrp: 0.473 ± 0.835
0.945MetTyr: 0.945 ± 1.669
0.0MetXaa: 0.0 ± 0.0
Asn
4.726AsnAla: 4.726 ± 1.866
1.418AsnCys: 1.418 ± 0.955
2.363AsnAsp: 2.363 ± 1.206
2.363AsnGlu: 2.363 ± 1.231
1.89AsnPhe: 1.89 ± 0.96
0.945AsnGly: 0.945 ± 1.009
1.89AsnHis: 1.89 ± 0.96
4.726AsnIle: 4.726 ± 1.626
4.726AsnLys: 4.726 ± 1.569
5.198AsnLeu: 5.198 ± 1.163
0.945AsnMet: 0.945 ± 0.959
3.781AsnAsn: 3.781 ± 1.384
3.781AsnPro: 3.781 ± 2.368
0.945AsnGln: 0.945 ± 0.483
1.89AsnArg: 1.89 ± 0.965
2.363AsnSer: 2.363 ± 0.785
6.144AsnThr: 6.144 ± 1.838
1.89AsnVal: 1.89 ± 0.703
0.473AsnTrp: 0.473 ± 0.835
3.308AsnTyr: 3.308 ± 0.999
0.0AsnXaa: 0.0 ± 0.0
Pro
7.089ProAla: 7.089 ± 3.746
1.418ProCys: 1.418 ± 1.311
2.363ProAsp: 2.363 ± 0.657
3.308ProGlu: 3.308 ± 1.017
0.945ProPhe: 0.945 ± 1.163
2.836ProGly: 2.836 ± 0.986
1.418ProHis: 1.418 ± 0.598
2.363ProIle: 2.363 ± 1.023
3.781ProLys: 3.781 ± 1.093
2.836ProLeu: 2.836 ± 0.986
1.89ProMet: 1.89 ± 0.965
2.363ProAsn: 2.363 ± 1.468
2.363ProPro: 2.363 ± 2.17
2.363ProGln: 2.363 ± 0.822
3.781ProArg: 3.781 ± 0.51
3.308ProSer: 3.308 ± 2.084
2.363ProThr: 2.363 ± 1.296
3.781ProVal: 3.781 ± 0.733
0.945ProTrp: 0.945 ± 1.009
0.945ProTyr: 0.945 ± 0.483
0.0ProXaa: 0.0 ± 0.0
Gln
1.89GlnAla: 1.89 ± 0.96
0.945GlnCys: 0.945 ± 0.614
1.89GlnAsp: 1.89 ± 0.703
1.418GlnGlu: 1.418 ± 0.576
1.89GlnPhe: 1.89 ± 1.35
1.418GlnGly: 1.418 ± 0.724
3.308GlnHis: 3.308 ± 1.177
0.473GlnIle: 0.473 ± 0.241
1.418GlnLys: 1.418 ± 0.576
5.198GlnLeu: 5.198 ± 1.262
1.418GlnMet: 1.418 ± 0.724
2.363GlnAsn: 2.363 ± 0.785
4.253GlnPro: 4.253 ± 2.411
2.836GlnGln: 2.836 ± 1.914
1.89GlnArg: 1.89 ± 0.965
3.781GlnSer: 3.781 ± 0.733
3.308GlnThr: 3.308 ± 1.689
2.836GlnVal: 2.836 ± 1.195
0.945GlnTrp: 0.945 ± 0.483
0.945GlnTyr: 0.945 ± 0.694
0.0GlnXaa: 0.0 ± 0.0
Arg
3.781ArgAla: 3.781 ± 1.125
0.945ArgCys: 0.945 ± 0.483
1.418ArgAsp: 1.418 ± 0.724
2.363ArgGlu: 2.363 ± 1.206
1.418ArgPhe: 1.418 ± 0.598
1.418ArgGly: 1.418 ± 0.576
0.945ArgHis: 0.945 ± 0.614
0.945ArgIle: 0.945 ± 0.675
1.418ArgLys: 1.418 ± 0.724
2.363ArgLeu: 2.363 ± 0.785
0.473ArgMet: 0.473 ± 0.241
1.89ArgAsn: 1.89 ± 0.965
1.418ArgPro: 1.418 ± 0.598
4.253ArgGln: 4.253 ± 1.727
1.89ArgArg: 1.89 ± 0.569
3.308ArgSer: 3.308 ± 1.251
3.781ArgThr: 3.781 ± 1.21
1.89ArgVal: 1.89 ± 0.674
0.473ArgTrp: 0.473 ± 1.114
1.89ArgTyr: 1.89 ± 0.965
0.0ArgXaa: 0.0 ± 0.0
Ser
5.198SerAla: 5.198 ± 2.699
0.945SerCys: 0.945 ± 1.163
3.781SerAsp: 3.781 ± 1.001
2.836SerGlu: 2.836 ± 1.195
5.198SerPhe: 5.198 ± 2.722
2.836SerGly: 2.836 ± 1.311
1.89SerHis: 1.89 ± 0.703
4.726SerIle: 4.726 ± 1.85
2.836SerLys: 2.836 ± 0.811
6.616SerLeu: 6.616 ± 1.102
0.945SerMet: 0.945 ± 0.521
4.726SerAsn: 4.726 ± 1.626
3.308SerPro: 3.308 ± 1.783
3.308SerGln: 3.308 ± 1.202
4.726SerArg: 4.726 ± 1.155
8.034SerSer: 8.034 ± 2.584
3.781SerThr: 3.781 ± 1.496
3.308SerVal: 3.308 ± 1.678
0.0SerTrp: 0.0 ± 0.0
3.308SerTyr: 3.308 ± 1.827
0.0SerXaa: 0.0 ± 0.0
Thr
6.144ThrAla: 6.144 ± 1.411
0.473ThrCys: 0.473 ± 0.241
1.418ThrAsp: 1.418 ± 0.656
3.308ThrGlu: 3.308 ± 1.202
4.253ThrPhe: 4.253 ± 1.479
3.308ThrGly: 3.308 ± 1.127
7.089ThrHis: 7.089 ± 1.913
5.198ThrIle: 5.198 ± 1.207
1.89ThrLys: 1.89 ± 0.881
7.089ThrLeu: 7.089 ± 1.993
0.945ThrMet: 0.945 ± 0.675
2.836ThrAsn: 2.836 ± 2.291
5.671ThrPro: 5.671 ± 2.971
1.89ThrGln: 1.89 ± 0.703
2.363ThrArg: 2.363 ± 0.598
4.726ThrSer: 4.726 ± 1.579
6.144ThrThr: 6.144 ± 2.493
2.836ThrVal: 2.836 ± 0.811
0.945ThrTrp: 0.945 ± 0.483
3.781ThrTyr: 3.781 ± 0.962
0.0ThrXaa: 0.0 ± 0.0
Val
3.308ValAla: 3.308 ± 2.505
0.473ValCys: 0.473 ± 0.716
2.836ValAsp: 2.836 ± 1.195
3.308ValGlu: 3.308 ± 1.251
0.945ValPhe: 0.945 ± 0.483
0.945ValGly: 0.945 ± 0.483
0.945ValHis: 0.945 ± 0.614
2.836ValIle: 2.836 ± 0.825
5.671ValLys: 5.671 ± 1.485
5.198ValLeu: 5.198 ± 4.775
1.418ValMet: 1.418 ± 0.724
2.836ValAsn: 2.836 ± 1.448
1.89ValPro: 1.89 ± 0.674
2.836ValGln: 2.836 ± 0.811
2.363ValArg: 2.363 ± 1.206
2.363ValSer: 2.363 ± 1.346
4.726ValThr: 4.726 ± 1.626
6.144ValVal: 6.144 ± 0.732
0.473ValTrp: 0.473 ± 0.835
1.89ValTyr: 1.89 ± 0.965
0.0ValXaa: 0.0 ± 0.0
Trp
1.418TrpAla: 1.418 ± 0.576
0.0TrpCys: 0.0 ± 0.0
0.945TrpAsp: 0.945 ± 0.675
0.473TrpGlu: 0.473 ± 0.241
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.473TrpIle: 0.473 ± 0.241
0.945TrpLys: 0.945 ± 0.483
0.473TrpLeu: 0.473 ± 0.241
0.0TrpMet: 0.0 ± 0.0
1.418TrpAsn: 1.418 ± 1.4
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.945TrpArg: 0.945 ± 0.483
0.0TrpSer: 0.0 ± 0.0
0.473TrpThr: 0.473 ± 0.241
1.418TrpVal: 1.418 ± 0.955
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.726TyrAla: 4.726 ± 1.314
0.945TyrCys: 0.945 ± 0.614
1.89TyrAsp: 1.89 ± 0.965
0.945TyrGlu: 0.945 ± 0.483
2.836TyrPhe: 2.836 ± 2.36
0.945TyrGly: 0.945 ± 0.675
2.363TyrHis: 2.363 ± 0.818
3.308TyrIle: 3.308 ± 0.551
0.945TyrLys: 0.945 ± 0.694
4.253TyrLeu: 4.253 ± 1.085
0.473TyrMet: 0.473 ± 0.241
1.418TyrAsn: 1.418 ± 0.656
0.945TyrPro: 0.945 ± 0.614
2.363TyrGln: 2.363 ± 1.588
0.945TyrArg: 0.945 ± 0.483
2.836TyrSer: 2.836 ± 1.448
3.308TyrThr: 3.308 ± 1.202
1.418TyrVal: 1.418 ± 0.576
0.0TyrTrp: 0.0 ± 0.0
0.473TyrTyr: 0.473 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2117 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski