Amino acid dipepetide frequency for Wissadula yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.992AlaAla: 3.992 ± 1.838
0.998AlaCys: 0.998 ± 0.927
1.996AlaAsp: 1.996 ± 1.197
2.994AlaGlu: 2.994 ± 1.411
0.0AlaPhe: 0.0 ± 0.0
2.994AlaGly: 2.994 ± 1.329
1.996AlaHis: 1.996 ± 1.237
0.998AlaIle: 0.998 ± 1.236
5.988AlaLys: 5.988 ± 1.297
5.988AlaLeu: 5.988 ± 2.658
0.998AlaMet: 0.998 ± 0.927
1.996AlaAsn: 1.996 ± 0.919
2.994AlaPro: 2.994 ± 1.329
1.996AlaGln: 1.996 ± 0.919
4.99AlaArg: 4.99 ± 2.534
5.988AlaSer: 5.988 ± 1.937
2.994AlaThr: 2.994 ± 1.844
3.992AlaVal: 3.992 ± 2.138
0.0AlaTrp: 0.0 ± 0.0
0.998AlaTyr: 0.998 ± 1.158
0.0AlaXaa: 0.0 ± 0.0
Cys
1.996CysAla: 1.996 ± 2.471
0.0CysCys: 0.0 ± 0.0
0.998CysAsp: 0.998 ± 0.684
1.996CysGlu: 1.996 ± 0.919
0.998CysPhe: 0.998 ± 1.194
0.998CysGly: 0.998 ± 1.236
0.0CysHis: 0.0 ± 0.0
1.996CysIle: 1.996 ± 1.237
1.996CysLys: 1.996 ± 0.919
1.996CysLeu: 1.996 ± 1.063
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.998CysSer: 0.998 ± 1.236
1.996CysThr: 1.996 ± 1.237
0.998CysVal: 0.998 ± 0.927
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.998AspAla: 0.998 ± 0.927
0.0AspCys: 0.0 ± 0.0
3.992AspAsp: 3.992 ± 1.145
2.994AspGlu: 2.994 ± 2.053
3.992AspPhe: 3.992 ± 1.11
0.998AspGly: 0.998 ± 0.684
0.0AspHis: 0.0 ± 0.0
1.996AspIle: 1.996 ± 1.237
0.998AspLys: 0.998 ± 0.684
5.988AspLeu: 5.988 ± 1.878
0.0AspMet: 0.0 ± 0.0
2.994AspAsn: 2.994 ± 2.001
1.996AspPro: 1.996 ± 1.369
1.996AspGln: 1.996 ± 1.363
2.994AspArg: 2.994 ± 1.844
3.992AspSer: 3.992 ± 1.145
1.996AspThr: 1.996 ± 1.217
5.988AspVal: 5.988 ± 1.697
0.998AspTrp: 0.998 ± 0.684
0.998AspTyr: 0.998 ± 0.684
0.0AspXaa: 0.0 ± 0.0
Glu
2.994GluAla: 2.994 ± 1.329
0.998GluCys: 0.998 ± 1.236
0.998GluAsp: 0.998 ± 1.158
3.992GluGlu: 3.992 ± 2.014
1.996GluPhe: 1.996 ± 1.197
4.99GluGly: 4.99 ± 1.525
0.998GluHis: 0.998 ± 0.684
2.994GluIle: 2.994 ± 1.361
0.0GluLys: 0.0 ± 0.0
1.996GluLeu: 1.996 ± 1.217
0.998GluMet: 0.998 ± 0.684
4.99GluAsn: 4.99 ± 2.211
1.996GluPro: 1.996 ± 0.919
1.996GluGln: 1.996 ± 1.363
2.994GluArg: 2.994 ± 1.33
4.99GluSer: 4.99 ± 2.381
0.998GluThr: 0.998 ± 0.684
0.998GluVal: 0.998 ± 0.684
2.994GluTrp: 2.994 ± 0.976
1.996GluTyr: 1.996 ± 1.197
0.0GluXaa: 0.0 ± 0.0
Phe
1.996PheAla: 1.996 ± 1.217
0.998PheCys: 0.998 ± 0.927
0.998PheAsp: 0.998 ± 0.684
0.0PheGlu: 0.0 ± 0.0
0.998PhePhe: 0.998 ± 0.684
1.996PheGly: 1.996 ± 0.919
1.996PheHis: 1.996 ± 1.369
0.998PheIle: 0.998 ± 0.684
1.996PheLys: 1.996 ± 2.315
2.994PheLeu: 2.994 ± 2.053
0.998PheMet: 0.998 ± 1.194
5.988PheAsn: 5.988 ± 2.308
0.998PhePro: 0.998 ± 1.236
3.992PheGln: 3.992 ± 1.042
2.994PheArg: 2.994 ± 1.361
1.996PheSer: 1.996 ± 1.824
4.99PheThr: 4.99 ± 3.339
0.0PheVal: 0.0 ± 0.0
2.994PheTrp: 2.994 ± 1.804
0.998PheTyr: 0.998 ± 0.927
0.0PheXaa: 0.0 ± 0.0
Gly
0.998GlyAla: 0.998 ± 0.684
1.996GlyCys: 1.996 ± 1.237
2.994GlyAsp: 2.994 ± 2.053
3.992GlyGlu: 3.992 ± 1.732
0.0GlyPhe: 0.0 ± 0.0
3.992GlyGly: 3.992 ± 1.838
2.994GlyHis: 2.994 ± 1.329
1.996GlyIle: 1.996 ± 0.919
5.988GlyLys: 5.988 ± 2.757
3.992GlyLeu: 3.992 ± 1.248
0.998GlyMet: 0.998 ± 0.684
3.992GlyAsn: 3.992 ± 1.48
1.996GlyPro: 1.996 ± 0.919
4.99GlyGln: 4.99 ± 0.968
1.996GlyArg: 1.996 ± 1.197
1.996GlySer: 1.996 ± 1.197
2.994GlyThr: 2.994 ± 0.976
1.996GlyVal: 1.996 ± 2.315
0.0GlyTrp: 0.0 ± 0.0
1.996GlyTyr: 1.996 ± 1.237
0.0GlyXaa: 0.0 ± 0.0
His
1.996HisAla: 1.996 ± 0.919
1.996HisCys: 1.996 ± 1.197
0.998HisAsp: 0.998 ± 0.927
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.998HisHis: 0.998 ± 1.236
0.998HisIle: 0.998 ± 1.236
0.0HisLys: 0.0 ± 0.0
4.99HisLeu: 4.99 ± 2.107
0.0HisMet: 0.0 ± 0.0
4.99HisAsn: 4.99 ± 2.054
1.996HisPro: 1.996 ± 1.369
1.996HisGln: 1.996 ± 1.063
2.994HisArg: 2.994 ± 2.292
3.992HisSer: 3.992 ± 2.466
5.988HisThr: 5.988 ± 3.092
1.996HisVal: 1.996 ± 1.229
0.998HisTrp: 0.998 ± 0.684
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.994IleAla: 2.994 ± 1.804
0.0IleCys: 0.0 ± 0.0
1.996IleAsp: 1.996 ± 1.197
0.0IleGlu: 0.0 ± 0.0
0.998IlePhe: 0.998 ± 0.684
1.996IleGly: 1.996 ± 0.919
0.998IleHis: 0.998 ± 1.236
3.992IleIle: 3.992 ± 1.11
7.984IleLys: 7.984 ± 1.417
0.0IleLeu: 0.0 ± 0.0
0.0IleMet: 0.0 ± 0.0
1.996IleAsn: 1.996 ± 1.636
1.996IlePro: 1.996 ± 2.471
3.992IleGln: 3.992 ± 2.04
3.992IleArg: 3.992 ± 1.042
4.99IleSer: 4.99 ± 0.968
4.99IleThr: 4.99 ± 2.288
1.996IleVal: 1.996 ± 1.063
1.996IleTrp: 1.996 ± 2.315
3.992IleTyr: 3.992 ± 1.48
0.0IleXaa: 0.0 ± 0.0
Lys
3.992LysAla: 3.992 ± 1.145
0.0LysCys: 0.0 ± 0.0
5.988LysAsp: 5.988 ± 4.107
2.994LysGlu: 2.994 ± 2.053
1.996LysPhe: 1.996 ± 1.229
2.994LysGly: 2.994 ± 1.508
0.998LysHis: 0.998 ± 0.684
1.996LysIle: 1.996 ± 1.854
1.996LysLys: 1.996 ± 2.471
5.988LysLeu: 5.988 ± 3.09
0.0LysMet: 0.0 ± 0.0
4.99LysAsn: 4.99 ± 2.18
3.992LysPro: 3.992 ± 1.904
1.996LysGln: 1.996 ± 2.389
3.992LysArg: 3.992 ± 2.048
2.994LysSer: 2.994 ± 0.976
4.99LysThr: 4.99 ± 2.15
4.99LysVal: 4.99 ± 4.635
0.998LysTrp: 0.998 ± 0.684
0.998LysTyr: 0.998 ± 0.684
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
5.988LeuAsp: 5.988 ± 1.853
4.99LeuGlu: 4.99 ± 2.456
3.992LeuPhe: 3.992 ± 1.248
3.992LeuGly: 3.992 ± 1.732
5.988LeuHis: 5.988 ± 1.898
2.994LeuIle: 2.994 ± 1.599
7.984LeuLys: 7.984 ± 2.551
2.994LeuLeu: 2.994 ± 2.001
0.0LeuMet: 0.0 ± 0.0
1.996LeuAsn: 1.996 ± 1.229
3.992LeuPro: 3.992 ± 1.732
1.996LeuGln: 1.996 ± 1.369
3.992LeuArg: 3.992 ± 1.605
6.986LeuSer: 6.986 ± 2.898
3.992LeuThr: 3.992 ± 1.471
4.99LeuVal: 4.99 ± 2.211
0.0LeuTrp: 0.0 ± 0.0
4.99LeuTyr: 4.99 ± 1.975
0.0LeuXaa: 0.0 ± 0.0
Met
1.996MetAla: 1.996 ± 1.854
0.998MetCys: 0.998 ± 0.927
3.992MetAsp: 3.992 ± 1.605
0.0MetGlu: 0.0 ± 0.0
0.998MetPhe: 0.998 ± 0.927
0.0MetGly: 0.0 ± 0.0
0.998MetHis: 0.998 ± 1.194
0.0MetIle: 0.0 ± 0.0
0.998MetLys: 0.998 ± 1.194
1.996MetLeu: 1.996 ± 1.063
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.992MetPro: 3.992 ± 1.248
0.998MetGln: 0.998 ± 0.684
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.998MetTrp: 0.998 ± 0.684
3.992MetTyr: 3.992 ± 1.918
0.0MetXaa: 0.0 ± 0.0
Asn
8.982AsnAla: 8.982 ± 3.974
1.996AsnCys: 1.996 ± 1.197
0.998AsnAsp: 0.998 ± 0.927
2.994AsnGlu: 2.994 ± 1.714
0.998AsnPhe: 0.998 ± 1.158
3.992AsnGly: 3.992 ± 1.918
4.99AsnHis: 4.99 ± 2.994
3.992AsnIle: 3.992 ± 1.11
3.992AsnLys: 3.992 ± 2.028
3.992AsnLeu: 3.992 ± 1.664
3.992AsnMet: 3.992 ± 1.502
4.99AsnAsn: 4.99 ± 2.388
2.994AsnPro: 2.994 ± 1.489
1.996AsnGln: 1.996 ± 1.363
2.994AsnArg: 2.994 ± 1.081
0.998AsnSer: 0.998 ± 0.927
1.996AsnThr: 1.996 ± 1.063
3.992AsnVal: 3.992 ± 2.138
0.998AsnTrp: 0.998 ± 0.684
4.99AsnTyr: 4.99 ± 1.815
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
2.994ProCys: 2.994 ± 2.292
3.992ProAsp: 3.992 ± 1.248
3.992ProGlu: 3.992 ± 1.595
1.996ProPhe: 1.996 ± 1.217
0.0ProGly: 0.0 ± 0.0
2.994ProHis: 2.994 ± 1.508
0.998ProIle: 0.998 ± 0.684
3.992ProLys: 3.992 ± 2.04
1.996ProLeu: 1.996 ± 1.217
0.998ProMet: 0.998 ± 0.927
2.994ProAsn: 2.994 ± 0.976
1.996ProPro: 1.996 ± 1.197
3.992ProGln: 3.992 ± 1.664
4.99ProArg: 4.99 ± 2.59
7.984ProSer: 7.984 ± 2.897
1.996ProThr: 1.996 ± 1.063
3.992ProVal: 3.992 ± 1.248
1.996ProTrp: 1.996 ± 0.919
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.99GlnAla: 4.99 ± 1.057
1.996GlnCys: 1.996 ± 1.063
1.996GlnAsp: 1.996 ± 1.824
7.984GlnGlu: 7.984 ± 2.964
0.0GlnPhe: 0.0 ± 0.0
0.998GlnGly: 0.998 ± 1.236
0.998GlnHis: 0.998 ± 1.194
3.992GlnIle: 3.992 ± 2.253
0.998GlnLys: 0.998 ± 0.684
2.994GlnLeu: 2.994 ± 1.508
1.996GlnMet: 1.996 ± 1.063
0.0GlnAsn: 0.0 ± 0.0
1.996GlnPro: 1.996 ± 1.824
0.0GlnGln: 0.0 ± 0.0
2.994GlnArg: 2.994 ± 2.001
3.992GlnSer: 3.992 ± 1.11
1.996GlnThr: 1.996 ± 1.574
2.994GlnVal: 2.994 ± 1.489
0.0GlnTrp: 0.0 ± 0.0
2.994GlnTyr: 2.994 ± 1.329
0.0GlnXaa: 0.0 ± 0.0
Arg
2.994ArgAla: 2.994 ± 1.501
0.998ArgCys: 0.998 ± 1.194
2.994ArgAsp: 2.994 ± 2.781
0.998ArgGlu: 0.998 ± 0.684
7.984ArgPhe: 7.984 ± 3.886
5.988ArgGly: 5.988 ± 2.569
1.996ArgHis: 1.996 ± 1.229
4.99ArgIle: 4.99 ± 1.539
1.996ArgLys: 1.996 ± 0.919
3.992ArgLeu: 3.992 ± 1.664
0.998ArgMet: 0.998 ± 1.194
2.994ArgAsn: 2.994 ± 1.037
3.992ArgPro: 3.992 ± 1.838
0.998ArgGln: 0.998 ± 1.236
7.984ArgArg: 7.984 ± 3.091
5.988ArgSer: 5.988 ± 1.305
3.992ArgThr: 3.992 ± 1.605
5.988ArgVal: 5.988 ± 3.036
0.0ArgTrp: 0.0 ± 0.0
0.998ArgTyr: 0.998 ± 0.927
0.0ArgXaa: 0.0 ± 0.0
Ser
2.994SerAla: 2.994 ± 2.053
0.0SerCys: 0.0 ± 0.0
1.996SerAsp: 1.996 ± 0.919
0.998SerGlu: 0.998 ± 1.194
4.99SerPhe: 4.99 ± 1.887
6.986SerGly: 6.986 ± 3.152
1.996SerHis: 1.996 ± 1.824
5.988SerIle: 5.988 ± 4.01
2.994SerLys: 2.994 ± 1.037
4.99SerLeu: 4.99 ± 1.658
2.994SerMet: 2.994 ± 1.569
8.982SerAsn: 8.982 ± 2.579
6.986SerPro: 6.986 ± 2.682
1.996SerGln: 1.996 ± 2.389
4.99SerArg: 4.99 ± 2.323
6.986SerSer: 6.986 ± 4.525
5.988SerThr: 5.988 ± 3.418
4.99SerVal: 4.99 ± 2.59
1.996SerTrp: 1.996 ± 1.063
2.994SerTyr: 2.994 ± 1.081
0.0SerXaa: 0.0 ± 0.0
Thr
3.992ThrAla: 3.992 ± 2.043
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
0.998ThrGlu: 0.998 ± 0.927
1.996ThrPhe: 1.996 ± 1.063
3.992ThrGly: 3.992 ± 1.605
3.992ThrHis: 3.992 ± 2.028
1.996ThrIle: 1.996 ± 1.197
0.998ThrLys: 0.998 ± 0.684
2.994ThrLeu: 2.994 ± 1.714
0.998ThrMet: 0.998 ± 0.684
4.99ThrAsn: 4.99 ± 2.107
4.99ThrPro: 4.99 ± 3.769
3.992ThrGln: 3.992 ± 3.318
2.994ThrArg: 2.994 ± 2.274
7.984ThrSer: 7.984 ± 4.412
4.99ThrThr: 4.99 ± 4.003
5.988ThrVal: 5.988 ± 2.138
0.0ThrTrp: 0.0 ± 0.0
4.99ThrTyr: 4.99 ± 2.107
0.0ThrXaa: 0.0 ± 0.0
Val
0.998ValAla: 0.998 ± 0.684
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
2.994ValGlu: 2.994 ± 1.599
3.992ValPhe: 3.992 ± 1.595
1.996ValGly: 1.996 ± 0.919
0.0ValHis: 0.0 ± 0.0
3.992ValIle: 3.992 ± 2.292
3.992ValLys: 3.992 ± 1.838
3.992ValLeu: 3.992 ± 2.459
1.996ValMet: 1.996 ± 1.854
6.986ValAsn: 6.986 ± 1.769
3.992ValPro: 3.992 ± 1.327
4.99ValGln: 4.99 ± 2.456
5.988ValArg: 5.988 ± 3.144
5.988ValSer: 5.988 ± 2.075
1.996ValThr: 1.996 ± 1.854
0.998ValVal: 0.998 ± 0.684
1.996ValTrp: 1.996 ± 1.363
3.992ValTyr: 3.992 ± 2.459
0.0ValXaa: 0.0 ± 0.0
Trp
3.992TrpAla: 3.992 ± 1.595
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.998TrpGlu: 0.998 ± 1.158
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.996TrpLys: 1.996 ± 0.919
0.998TrpLeu: 0.998 ± 0.927
1.996TrpMet: 1.996 ± 1.246
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.998TrpGln: 0.998 ± 0.684
1.996TrpArg: 1.996 ± 1.237
1.996TrpSer: 1.996 ± 1.369
1.996TrpThr: 1.996 ± 1.217
1.996TrpVal: 1.996 ± 0.919
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.996TyrAla: 1.996 ± 1.854
0.998TyrCys: 0.998 ± 0.684
2.994TyrAsp: 2.994 ± 2.781
0.998TyrGlu: 0.998 ± 0.927
2.994TyrPhe: 2.994 ± 1.081
2.994TyrGly: 2.994 ± 0.976
1.996TyrHis: 1.996 ± 1.217
3.992TyrIle: 3.992 ± 1.327
1.996TyrLys: 1.996 ± 1.063
6.986TyrLeu: 6.986 ± 3.831
0.998TyrMet: 0.998 ± 1.217
0.998TyrAsn: 0.998 ± 0.927
0.998TyrPro: 0.998 ± 0.684
0.998TyrGln: 0.998 ± 0.684
2.994TyrArg: 2.994 ± 1.844
1.996TyrSer: 1.996 ± 0.919
1.996TyrThr: 1.996 ± 1.197
1.996TyrVal: 1.996 ± 2.315
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1003 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski