Amino acid dipepetide frequency for Honeysuckle yellow vein virus-[Japan:Masuda:2003]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.633AlaAla: 3.633 ± 1.847
0.908AlaCys: 0.908 ± 0.769
0.908AlaAsp: 0.908 ± 0.769
1.817AlaGlu: 1.817 ± 1.286
1.817AlaPhe: 1.817 ± 1.093
0.0AlaGly: 0.0 ± 0.0
3.633AlaHis: 3.633 ± 1.402
2.725AlaIle: 2.725 ± 1.375
2.725AlaLys: 2.725 ± 0.858
7.266AlaLeu: 7.266 ± 1.745
0.0AlaMet: 0.0 ± 0.0
2.725AlaAsn: 2.725 ± 0.881
4.541AlaPro: 4.541 ± 1.778
6.358AlaGln: 6.358 ± 1.71
3.633AlaArg: 3.633 ± 1.847
4.541AlaSer: 4.541 ± 2.369
5.45AlaThr: 5.45 ± 2.727
2.725AlaVal: 2.725 ± 1.359
1.817AlaTrp: 1.817 ± 1.286
2.725AlaTyr: 2.725 ± 1.289
0.0AlaXaa: 0.0 ± 0.0
Cys
0.908CysAla: 0.908 ± 0.992
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.817CysGlu: 1.817 ± 1.116
0.908CysPhe: 0.908 ± 0.806
1.817CysGly: 1.817 ± 1.012
0.0CysHis: 0.0 ± 0.0
0.908CysIle: 0.908 ± 0.769
0.908CysLys: 0.908 ± 0.769
0.908CysLeu: 0.908 ± 0.93
0.908CysMet: 0.908 ± 0.974
0.908CysAsn: 0.908 ± 0.643
1.817CysPro: 1.817 ± 1.948
1.817CysGln: 1.817 ± 0.947
1.817CysArg: 1.817 ± 0.947
2.725CysSer: 2.725 ± 1.898
1.817CysThr: 1.817 ± 0.738
0.908CysVal: 0.908 ± 0.769
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.908AspAla: 0.908 ± 0.643
0.0AspCys: 0.0 ± 0.0
0.908AspAsp: 0.908 ± 0.643
2.725AspGlu: 2.725 ± 1.152
3.633AspPhe: 3.633 ± 1.39
2.725AspGly: 2.725 ± 1.929
0.908AspHis: 0.908 ± 0.806
4.541AspIle: 4.541 ± 2.596
0.0AspLys: 0.0 ± 0.0
6.358AspLeu: 6.358 ± 2.475
0.0AspMet: 0.0 ± 0.0
0.908AspAsn: 0.908 ± 0.806
1.817AspPro: 1.817 ± 0.947
2.725AspGln: 2.725 ± 1.188
2.725AspArg: 2.725 ± 1.363
7.266AspSer: 7.266 ± 1.417
3.633AspThr: 3.633 ± 2.297
6.358AspVal: 6.358 ± 1.301
0.908AspTrp: 0.908 ± 0.643
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.45GluAla: 5.45 ± 1.832
0.908GluCys: 0.908 ± 0.974
1.817GluAsp: 1.817 ± 0.957
7.266GluGlu: 7.266 ± 4.312
2.725GluPhe: 2.725 ± 1.339
4.541GluGly: 4.541 ± 0.836
0.0GluHis: 0.0 ± 0.0
0.908GluIle: 0.908 ± 0.974
4.541GluLys: 4.541 ± 1.556
4.541GluLeu: 4.541 ± 0.972
0.0GluMet: 0.0 ± 0.0
6.358GluAsn: 6.358 ± 2.606
0.908GluPro: 0.908 ± 0.769
1.817GluGln: 1.817 ± 1.537
0.0GluArg: 0.0 ± 0.0
2.725GluSer: 2.725 ± 1.237
2.725GluThr: 2.725 ± 1.162
0.908GluVal: 0.908 ± 0.93
1.817GluTrp: 1.817 ± 1.012
1.817GluTyr: 1.817 ± 0.861
0.0GluXaa: 0.0 ± 0.0
Phe
1.817PheAla: 1.817 ± 1.012
0.908PheCys: 0.908 ± 0.769
2.725PheAsp: 2.725 ± 1.152
1.817PheGlu: 1.817 ± 0.738
0.908PhePhe: 0.908 ± 0.643
0.908PheGly: 0.908 ± 0.643
2.725PheHis: 2.725 ± 1.294
0.908PheIle: 0.908 ± 0.643
4.541PheLys: 4.541 ± 2.36
4.541PheLeu: 4.541 ± 1.646
0.908PheMet: 0.908 ± 0.643
2.725PheAsn: 2.725 ± 1.643
0.908PhePro: 0.908 ± 0.974
3.633PheGln: 3.633 ± 1.724
2.725PheArg: 2.725 ± 1.666
2.725PheSer: 2.725 ± 0.923
3.633PheThr: 3.633 ± 1.917
2.725PheVal: 2.725 ± 1.929
1.817PheTrp: 1.817 ± 1.537
0.908PheTyr: 0.908 ± 0.769
0.0PheXaa: 0.0 ± 0.0
Gly
4.541GlyAla: 4.541 ± 1.831
3.633GlyCys: 3.633 ± 1.724
2.725GlyAsp: 2.725 ± 1.289
0.0GlyGlu: 0.0 ± 0.0
2.725GlyPhe: 2.725 ± 1.368
2.725GlyGly: 2.725 ± 1.152
3.633GlyHis: 3.633 ± 1.724
2.725GlyIle: 2.725 ± 0.881
5.45GlyLys: 5.45 ± 1.561
0.908GlyLeu: 0.908 ± 0.769
0.0GlyMet: 0.0 ± 0.0
0.908GlyAsn: 0.908 ± 0.769
2.725GlyPro: 2.725 ± 1.152
1.817GlyGln: 1.817 ± 1.123
0.908GlyArg: 0.908 ± 0.643
4.541GlySer: 4.541 ± 0.965
3.633GlyThr: 3.633 ± 1.008
2.725GlyVal: 2.725 ± 1.192
0.0GlyTrp: 0.0 ± 0.0
1.817GlyTyr: 1.817 ± 1.398
0.0GlyXaa: 0.0 ± 0.0
His
1.817HisAla: 1.817 ± 0.998
2.725HisCys: 2.725 ± 1.237
0.908HisAsp: 0.908 ± 0.992
0.908HisGlu: 0.908 ± 0.643
2.725HisPhe: 2.725 ± 1.289
2.725HisGly: 2.725 ± 2.22
2.725HisHis: 2.725 ± 1.941
0.908HisIle: 0.908 ± 0.93
1.817HisLys: 1.817 ± 1.339
2.725HisLeu: 2.725 ± 1.339
0.908HisMet: 0.908 ± 0.93
2.725HisAsn: 2.725 ± 1.289
1.817HisPro: 1.817 ± 0.861
2.725HisGln: 2.725 ± 0.923
4.541HisArg: 4.541 ± 2.285
1.817HisSer: 1.817 ± 1.093
3.633HisThr: 3.633 ± 1.724
3.633HisVal: 3.633 ± 1.008
0.0HisTrp: 0.0 ± 0.0
0.908HisTyr: 0.908 ± 0.643
0.0HisXaa: 0.0 ± 0.0
Ile
0.908IleAla: 0.908 ± 0.769
0.908IleCys: 0.908 ± 0.643
2.725IleAsp: 2.725 ± 1.375
0.908IleGlu: 0.908 ± 0.643
3.633IlePhe: 3.633 ± 1.088
2.725IleGly: 2.725 ± 1.651
0.908IleHis: 0.908 ± 0.806
1.817IleIle: 1.817 ± 0.998
6.358IleLys: 6.358 ± 1.53
1.817IleLeu: 1.817 ± 0.957
1.817IleMet: 1.817 ± 1.316
3.633IleAsn: 3.633 ± 1.403
0.908IlePro: 0.908 ± 0.643
8.174IleGln: 8.174 ± 2.695
4.541IleArg: 4.541 ± 1.006
3.633IleSer: 3.633 ± 2.99
3.633IleThr: 3.633 ± 2.387
0.908IleVal: 0.908 ± 0.643
1.817IleTrp: 1.817 ± 0.998
0.908IleTyr: 0.908 ± 0.992
0.0IleXaa: 0.0 ± 0.0
Lys
3.633LysAla: 3.633 ± 1.81
0.908LysCys: 0.908 ± 0.806
1.817LysAsp: 1.817 ± 1.286
5.45LysGlu: 5.45 ± 0.899
0.908LysPhe: 0.908 ± 0.806
0.908LysGly: 0.908 ± 0.643
0.908LysHis: 0.908 ± 0.643
4.541LysIle: 4.541 ± 1.684
1.817LysLys: 1.817 ± 1.537
2.725LysLeu: 2.725 ± 0.793
0.0LysMet: 0.0 ± 0.0
5.45LysAsn: 5.45 ± 1.516
2.725LysPro: 2.725 ± 0.858
1.817LysGln: 1.817 ± 1.116
5.45LysArg: 5.45 ± 2.668
3.633LysSer: 3.633 ± 1.127
2.725LysThr: 2.725 ± 0.923
3.633LysVal: 3.633 ± 1.307
0.0LysTrp: 0.0 ± 0.0
4.541LysTyr: 4.541 ± 1.006
0.0LysXaa: 0.0 ± 0.0
Leu
1.817LeuAla: 1.817 ± 1.116
2.725LeuCys: 2.725 ± 1.152
6.358LeuAsp: 6.358 ± 2.257
6.358LeuGlu: 6.358 ± 2.148
2.725LeuPhe: 2.725 ± 0.881
2.725LeuGly: 2.725 ± 1.55
4.541LeuHis: 4.541 ± 1.467
4.541LeuIle: 4.541 ± 2.411
3.633LeuLys: 3.633 ± 1.044
4.541LeuLeu: 4.541 ± 2.711
0.0LeuMet: 0.0 ± 0.0
3.633LeuAsn: 3.633 ± 1.307
1.817LeuPro: 1.817 ± 1.354
3.633LeuGln: 3.633 ± 1.008
5.45LeuArg: 5.45 ± 1.784
5.45LeuSer: 5.45 ± 2.359
3.633LeuThr: 3.633 ± 1.088
3.633LeuVal: 3.633 ± 1.68
0.0LeuTrp: 0.0 ± 0.0
6.358LeuTyr: 6.358 ± 2.227
0.0LeuXaa: 0.0 ± 0.0
Met
1.817MetAla: 1.817 ± 0.738
0.0MetCys: 0.0 ± 0.0
2.725MetAsp: 2.725 ± 1.643
1.817MetGlu: 1.817 ± 0.957
0.908MetPhe: 0.908 ± 0.769
2.725MetGly: 2.725 ± 1.162
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.908MetLeu: 0.908 ± 0.974
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.908MetPro: 0.908 ± 0.93
0.908MetGln: 0.908 ± 0.992
0.0MetArg: 0.0 ± 0.0
2.725MetSer: 2.725 ± 0.923
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.817MetTrp: 1.817 ± 0.947
2.725MetTyr: 2.725 ± 2.306
0.0MetXaa: 0.0 ± 0.0
Asn
3.633AsnAla: 3.633 ± 1.127
0.908AsnCys: 0.908 ± 0.992
3.633AsnAsp: 3.633 ± 0.996
1.817AsnGlu: 1.817 ± 0.738
2.725AsnPhe: 2.725 ± 1.368
1.817AsnGly: 1.817 ± 0.998
6.358AsnHis: 6.358 ± 3.086
1.817AsnIle: 1.817 ± 0.738
1.817AsnLys: 1.817 ± 0.738
3.633AsnLeu: 3.633 ± 1.723
3.633AsnMet: 3.633 ± 1.236
2.725AsnAsn: 2.725 ± 1.162
2.725AsnPro: 2.725 ± 0.793
1.817AsnGln: 1.817 ± 0.861
0.908AsnArg: 0.908 ± 0.769
3.633AsnSer: 3.633 ± 1.871
1.817AsnThr: 1.817 ± 0.957
4.541AsnVal: 4.541 ± 1.006
0.0AsnTrp: 0.0 ± 0.0
2.725AsnTyr: 2.725 ± 1.294
0.0AsnXaa: 0.0 ± 0.0
Pro
0.908ProAla: 0.908 ± 0.769
2.725ProCys: 2.725 ± 1.366
2.725ProAsp: 2.725 ± 0.858
1.817ProGlu: 1.817 ± 0.947
1.817ProPhe: 1.817 ± 0.861
1.817ProGly: 1.817 ± 0.738
4.541ProHis: 4.541 ± 1.733
3.633ProIle: 3.633 ± 1.724
2.725ProLys: 2.725 ± 1.929
4.541ProLeu: 4.541 ± 1.374
2.725ProMet: 2.725 ± 0.8
1.817ProAsn: 1.817 ± 0.861
1.817ProPro: 1.817 ± 1.286
2.725ProGln: 2.725 ± 2.022
5.45ProArg: 5.45 ± 1.293
4.541ProSer: 4.541 ± 1.791
4.541ProThr: 4.541 ± 2.238
2.725ProVal: 2.725 ± 1.651
0.908ProTrp: 0.908 ± 0.643
0.908ProTyr: 0.908 ± 0.769
0.0ProXaa: 0.0 ± 0.0
Gln
3.633GlnAla: 3.633 ± 1.303
0.908GlnCys: 0.908 ± 0.643
3.633GlnAsp: 3.633 ± 1.875
3.633GlnGlu: 3.633 ± 2.272
3.633GlnPhe: 3.633 ± 1.723
2.725GlnGly: 2.725 ± 1.294
1.817GlnHis: 1.817 ± 1.354
4.541GlnIle: 4.541 ± 1.831
1.817GlnLys: 1.817 ± 1.948
1.817GlnLeu: 1.817 ± 1.297
0.908GlnMet: 0.908 ± 0.93
2.725GlnAsn: 2.725 ± 1.368
4.541GlnPro: 4.541 ± 3.575
0.908GlnGln: 0.908 ± 0.806
3.633GlnArg: 3.633 ± 1.044
5.45GlnSer: 5.45 ± 1.179
1.817GlnThr: 1.817 ± 1.297
5.45GlnVal: 5.45 ± 1.508
0.0GlnTrp: 0.0 ± 0.0
0.908GlnTyr: 0.908 ± 0.769
0.0GlnXaa: 0.0 ± 0.0
Arg
4.541ArgAla: 4.541 ± 1.73
0.908ArgCys: 0.908 ± 0.974
4.541ArgAsp: 4.541 ± 1.27
3.633ArgGlu: 3.633 ± 1.871
4.541ArgPhe: 4.541 ± 2.054
3.633ArgGly: 3.633 ± 1.307
2.725ArgHis: 2.725 ± 1.237
2.725ArgIle: 2.725 ± 1.375
1.817ArgLys: 1.817 ± 1.123
3.633ArgLeu: 3.633 ± 1.755
1.817ArgMet: 1.817 ± 1.537
2.725ArgAsn: 2.725 ± 1.984
7.266ArgPro: 7.266 ± 2.159
1.817ArgGln: 1.817 ± 1.328
6.358ArgArg: 6.358 ± 4.062
6.358ArgSer: 6.358 ± 1.873
2.725ArgThr: 2.725 ± 1.539
3.633ArgVal: 3.633 ± 1.581
0.0ArgTrp: 0.0 ± 0.0
0.908ArgTyr: 0.908 ± 0.974
0.0ArgXaa: 0.0 ± 0.0
Ser
7.266SerAla: 7.266 ± 2.544
0.908SerCys: 0.908 ± 0.974
2.725SerAsp: 2.725 ± 0.881
2.725SerGlu: 2.725 ± 2.196
1.817SerPhe: 1.817 ± 0.861
4.541SerGly: 4.541 ± 1.735
0.908SerHis: 0.908 ± 0.93
6.358SerIle: 6.358 ± 2.778
7.266SerLys: 7.266 ± 2.022
4.541SerLeu: 4.541 ± 1.967
0.908SerMet: 0.908 ± 0.93
4.541SerAsn: 4.541 ± 1.586
9.083SerPro: 9.083 ± 3.055
1.817SerGln: 1.817 ± 0.738
3.633SerArg: 3.633 ± 1.403
11.807SerSer: 11.807 ± 4.616
8.174SerThr: 8.174 ± 3.612
2.725SerVal: 2.725 ± 2.306
0.908SerTrp: 0.908 ± 0.769
2.725SerTyr: 2.725 ± 0.923
0.0SerXaa: 0.0 ± 0.0
Thr
4.541ThrAla: 4.541 ± 2.137
0.0ThrCys: 0.0 ± 0.0
1.817ThrAsp: 1.817 ± 1.177
3.633ThrGlu: 3.633 ± 1.051
1.817ThrPhe: 1.817 ± 0.957
3.633ThrGly: 3.633 ± 0.782
5.45ThrHis: 5.45 ± 1.25
2.725ThrIle: 2.725 ± 1.051
2.725ThrLys: 2.725 ± 1.152
4.541ThrLeu: 4.541 ± 1.684
0.908ThrMet: 0.908 ± 0.643
4.541ThrAsn: 4.541 ± 1.389
2.725ThrPro: 2.725 ± 1.363
3.633ThrGln: 3.633 ± 1.853
3.633ThrArg: 3.633 ± 1.491
4.541ThrSer: 4.541 ± 1.967
1.817ThrThr: 1.817 ± 1.297
4.541ThrVal: 4.541 ± 2.353
1.817ThrTrp: 1.817 ± 1.297
2.725ThrTyr: 2.725 ± 0.858
0.0ThrXaa: 0.0 ± 0.0
Val
1.817ValAla: 1.817 ± 1.354
0.908ValCys: 0.908 ± 0.974
2.725ValAsp: 2.725 ± 1.289
1.817ValGlu: 1.817 ± 1.948
1.817ValPhe: 1.817 ± 0.861
2.725ValGly: 2.725 ± 1.651
0.908ValHis: 0.908 ± 0.974
5.45ValIle: 5.45 ± 1.766
2.725ValLys: 2.725 ± 0.858
7.266ValLeu: 7.266 ± 1.834
0.0ValMet: 0.0 ± 0.0
0.908ValAsn: 0.908 ± 0.974
4.541ValPro: 4.541 ± 1.14
4.541ValGln: 4.541 ± 2.353
4.541ValArg: 4.541 ± 1.276
4.541ValSer: 4.541 ± 1.27
2.725ValThr: 2.725 ± 1.363
0.908ValVal: 0.908 ± 0.974
0.0ValTrp: 0.0 ± 0.0
4.541ValTyr: 4.541 ± 1.955
0.0ValXaa: 0.0 ± 0.0
Trp
3.633TrpAla: 3.633 ± 1.714
0.0TrpCys: 0.0 ± 0.0
0.908TrpAsp: 0.908 ± 0.974
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.908TrpLys: 0.908 ± 0.769
0.908TrpLeu: 0.908 ± 0.769
0.908TrpMet: 0.908 ± 0.769
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.908TrpGln: 0.908 ± 0.643
2.725TrpArg: 2.725 ± 1.229
0.0TrpSer: 0.0 ± 0.0
1.817TrpThr: 1.817 ± 1.611
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.908TrpTyr: 0.908 ± 0.643
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.725TyrAla: 2.725 ± 1.363
0.0TyrCys: 0.0 ± 0.0
2.725TyrAsp: 2.725 ± 1.359
1.817TyrGlu: 1.817 ± 1.537
2.725TyrPhe: 2.725 ± 0.793
2.725TyrGly: 2.725 ± 1.152
0.0TyrHis: 0.0 ± 0.0
0.908TyrIle: 0.908 ± 0.769
0.0TyrLys: 0.0 ± 0.0
5.45TyrLeu: 5.45 ± 1.672
2.725TyrMet: 2.725 ± 0.937
2.725TyrAsn: 2.725 ± 0.881
1.817TyrPro: 1.817 ± 0.957
0.908TyrGln: 0.908 ± 0.643
3.633TyrArg: 3.633 ± 2.321
2.725TyrSer: 2.725 ± 1.294
1.817TyrThr: 1.817 ± 0.861
2.725TyrVal: 2.725 ± 1.597
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski