Amino acid dipepetide frequency for Epirus cherry virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.505AlaAla: 4.505 ± 1.511
0.751AlaCys: 0.751 ± 0.422
3.003AlaAsp: 3.003 ± 1.9
2.252AlaGlu: 2.252 ± 1.267
1.502AlaPhe: 1.502 ± 0.671
5.255AlaGly: 5.255 ± 1.368
1.502AlaHis: 1.502 ± 0.845
3.003AlaIle: 3.003 ± 3.267
0.0AlaLys: 0.0 ± 0.0
7.508AlaLeu: 7.508 ± 2.184
5.255AlaMet: 5.255 ± 1.906
0.0AlaAsn: 0.0 ± 0.0
6.757AlaPro: 6.757 ± 1.065
2.252AlaGln: 2.252 ± 0.611
9.76AlaArg: 9.76 ± 0.366
2.252AlaSer: 2.252 ± 0.611
3.003AlaThr: 3.003 ± 1.69
3.754AlaVal: 3.754 ± 1.426
0.751AlaTrp: 0.751 ± 0.422
3.754AlaTyr: 3.754 ± 1.211
0.0AlaXaa: 0.0 ± 0.0
Cys
0.751CysAla: 0.751 ± 1.178
0.0CysCys: 0.0 ± 0.0
0.751CysAsp: 0.751 ± 0.422
0.751CysGlu: 0.751 ± 0.422
0.0CysPhe: 0.0 ± 0.0
1.502CysGly: 1.502 ± 0.845
0.0CysHis: 0.0 ± 0.0
1.502CysIle: 1.502 ± 0.671
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.751CysAsn: 0.751 ± 0.422
1.502CysPro: 1.502 ± 0.845
0.751CysGln: 0.751 ± 0.422
0.751CysArg: 0.751 ± 0.422
0.751CysSer: 0.751 ± 0.422
0.0CysThr: 0.0 ± 0.0
1.502CysVal: 1.502 ± 0.845
0.0CysTrp: 0.0 ± 0.0
0.751CysTyr: 0.751 ± 0.422
0.0CysXaa: 0.0 ± 0.0
Asp
1.502AspAla: 1.502 ± 0.95
0.751AspCys: 0.751 ± 0.422
2.252AspAsp: 2.252 ± 0.881
3.754AspGlu: 3.754 ± 1.211
0.751AspPhe: 0.751 ± 1.178
3.003AspGly: 3.003 ± 0.808
2.252AspHis: 2.252 ± 0.611
3.003AspIle: 3.003 ± 0.808
0.0AspLys: 0.0 ± 0.0
6.006AspLeu: 6.006 ± 4.145
0.0AspMet: 0.0 ± 0.0
0.751AspAsn: 0.751 ± 0.422
0.751AspPro: 0.751 ± 0.422
1.502AspGln: 1.502 ± 0.845
3.754AspArg: 3.754 ± 0.37
1.502AspSer: 1.502 ± 2.356
3.754AspThr: 3.754 ± 1.426
3.754AspVal: 3.754 ± 0.37
0.751AspTrp: 0.751 ± 0.422
1.502AspTyr: 1.502 ± 0.845
0.0AspXaa: 0.0 ± 0.0
Glu
4.505GluAla: 4.505 ± 1.511
0.751GluCys: 0.751 ± 0.422
0.751GluAsp: 0.751 ± 0.422
6.757GluGlu: 6.757 ± 1.925
2.252GluPhe: 2.252 ± 1.267
3.003GluGly: 3.003 ± 0.808
3.754GluHis: 3.754 ± 1.211
3.003GluIle: 3.003 ± 1.002
2.252GluLys: 2.252 ± 0.611
6.006GluLeu: 6.006 ± 1.616
2.252GluMet: 2.252 ± 0.611
0.0GluAsn: 0.0 ± 0.0
3.003GluPro: 3.003 ± 2.509
3.003GluGln: 3.003 ± 0.808
6.006GluArg: 6.006 ± 1.797
4.505GluSer: 4.505 ± 1.221
0.0GluThr: 0.0 ± 0.0
4.505GluVal: 4.505 ± 0.333
0.751GluTrp: 0.751 ± 0.422
1.502GluTyr: 1.502 ± 0.671
0.0GluXaa: 0.0 ± 0.0
Phe
1.502PheAla: 1.502 ± 2.356
0.0PheCys: 0.0 ± 0.0
0.751PheAsp: 0.751 ± 0.422
2.252PheGlu: 2.252 ± 0.611
0.751PhePhe: 0.751 ± 0.422
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
0.751PheIle: 0.751 ± 0.422
2.252PheLys: 2.252 ± 1.267
6.757PheLeu: 6.757 ± 2.239
0.751PheMet: 0.751 ± 0.422
0.751PheAsn: 0.751 ± 0.422
3.754PhePro: 3.754 ± 0.37
0.751PheGln: 0.751 ± 0.422
3.003PheArg: 3.003 ± 1.002
2.252PheSer: 2.252 ± 0.611
0.751PheThr: 0.751 ± 1.178
3.003PheVal: 3.003 ± 1.341
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.751GlyAla: 0.751 ± 0.94
1.502GlyCys: 1.502 ± 0.95
4.505GlyAsp: 4.505 ± 1.592
4.505GlyGlu: 4.505 ± 1.221
1.502GlyPhe: 1.502 ± 0.95
9.009GlyGly: 9.009 ± 2.036
2.252GlyHis: 2.252 ± 1.267
3.003GlyIle: 3.003 ± 1.002
5.255GlyLys: 5.255 ± 1.368
6.757GlyLeu: 6.757 ± 2.239
0.0GlyMet: 0.0 ± 0.0
1.502GlyAsn: 1.502 ± 0.671
5.255GlyPro: 5.255 ± 1.839
8.258GlyGln: 8.258 ± 2.558
5.255GlyArg: 5.255 ± 0.631
4.505GlySer: 4.505 ± 1.761
5.255GlyThr: 5.255 ± 1.306
6.006GlyVal: 6.006 ± 1.172
1.502GlyTrp: 1.502 ± 0.845
2.252GlyTyr: 2.252 ± 0.611
0.0GlyXaa: 0.0 ± 0.0
His
3.003HisAla: 3.003 ± 1.341
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.502HisGlu: 1.502 ± 0.671
0.0HisPhe: 0.0 ± 0.0
1.502HisGly: 1.502 ± 0.845
0.751HisHis: 0.751 ± 0.422
1.502HisIle: 1.502 ± 0.671
0.0HisLys: 0.0 ± 0.0
5.255HisLeu: 5.255 ± 1.368
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.502HisPro: 1.502 ± 0.845
0.0HisGln: 0.0 ± 0.0
0.751HisArg: 0.751 ± 0.422
3.754HisSer: 3.754 ± 2.235
1.502HisThr: 1.502 ± 0.671
0.751HisVal: 0.751 ± 0.422
0.751HisTrp: 0.751 ± 0.422
1.502HisTyr: 1.502 ± 0.845
0.0HisXaa: 0.0 ± 0.0
Ile
0.751IleAla: 0.751 ± 0.422
0.0IleCys: 0.0 ± 0.0
0.751IleAsp: 0.751 ± 0.94
3.754IleGlu: 3.754 ± 2.235
2.252IlePhe: 2.252 ± 0.881
1.502IleGly: 1.502 ± 0.845
2.252IleHis: 2.252 ± 0.611
3.754IleIle: 3.754 ± 0.37
2.252IleLys: 2.252 ± 1.122
3.003IleLeu: 3.003 ± 1.69
1.502IleMet: 1.502 ± 0.671
2.252IleAsn: 2.252 ± 0.881
7.508IlePro: 7.508 ± 5.146
4.505IleGln: 4.505 ± 1.548
3.754IleArg: 3.754 ± 1.426
5.255IleSer: 5.255 ± 3.981
6.006IleThr: 6.006 ± 1.566
1.502IleVal: 1.502 ± 0.95
0.751IleTrp: 0.751 ± 0.422
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.252LysAla: 2.252 ± 1.267
0.0LysCys: 0.0 ± 0.0
3.003LysAsp: 3.003 ± 0.808
3.754LysGlu: 3.754 ± 2.112
2.252LysPhe: 2.252 ± 0.611
3.003LysGly: 3.003 ± 1.341
0.0LysHis: 0.0 ± 0.0
1.502LysIle: 1.502 ± 0.845
0.0LysLys: 0.0 ± 0.0
2.252LysLeu: 2.252 ± 1.267
0.0LysMet: 0.0 ± 0.0
2.252LysAsn: 2.252 ± 1.267
0.751LysPro: 0.751 ± 0.422
1.502LysGln: 1.502 ± 0.845
2.252LysArg: 2.252 ± 0.611
1.502LysSer: 1.502 ± 0.671
3.003LysThr: 3.003 ± 0.721
1.502LysVal: 1.502 ± 0.671
0.751LysTrp: 0.751 ± 0.94
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.009LeuAla: 9.009 ± 2.036
2.252LeuCys: 2.252 ± 1.267
2.252LeuAsp: 2.252 ± 1.578
6.757LeuGlu: 6.757 ± 1.832
2.252LeuPhe: 2.252 ± 0.611
5.255LeuGly: 5.255 ± 0.665
3.754LeuHis: 3.754 ± 1.136
6.757LeuIle: 6.757 ± 2.254
8.258LeuLys: 8.258 ± 2.64
7.508LeuLeu: 7.508 ± 3.143
2.252LeuMet: 2.252 ± 0.881
5.255LeuAsn: 5.255 ± 1.839
6.006LeuPro: 6.006 ± 2.311
1.502LeuGln: 1.502 ± 1.535
8.258LeuArg: 8.258 ± 2.31
9.009LeuSer: 9.009 ± 1.669
3.003LeuThr: 3.003 ± 1.69
8.258LeuVal: 8.258 ± 0.563
2.252LeuTrp: 2.252 ± 0.611
3.003LeuTyr: 3.003 ± 1.69
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.751MetAsp: 0.751 ± 0.422
3.003MetGlu: 3.003 ± 1.341
0.0MetPhe: 0.0 ± 0.0
3.003MetGly: 3.003 ± 1.69
0.751MetHis: 0.751 ± 0.422
1.502MetIle: 1.502 ± 0.671
0.0MetLys: 0.0 ± 0.0
2.252MetLeu: 2.252 ± 1.267
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.252MetPro: 2.252 ± 1.578
0.0MetGln: 0.0 ± 0.0
3.003MetArg: 3.003 ± 1.341
2.252MetSer: 2.252 ± 2.098
0.751MetThr: 0.751 ± 0.422
2.252MetVal: 2.252 ± 0.881
0.0MetTrp: 0.0 ± 0.0
0.751MetTyr: 0.751 ± 0.422
0.0MetXaa: 0.0 ± 0.0
Asn
0.751AsnAla: 0.751 ± 0.422
0.0AsnCys: 0.0 ± 0.0
0.751AsnAsp: 0.751 ± 1.178
0.0AsnGlu: 0.0 ± 0.0
0.751AsnPhe: 0.751 ± 0.422
1.502AsnGly: 1.502 ± 0.95
0.0AsnHis: 0.0 ± 0.0
2.252AsnIle: 2.252 ± 0.881
0.751AsnLys: 0.751 ± 0.422
6.006AsnLeu: 6.006 ± 1.063
0.0AsnMet: 0.0 ± 0.0
0.751AsnAsn: 0.751 ± 0.94
2.252AsnPro: 2.252 ± 2.57
2.252AsnGln: 2.252 ± 1.122
6.757AsnArg: 6.757 ± 2.642
1.502AsnSer: 1.502 ± 0.671
0.751AsnThr: 0.751 ± 0.422
0.751AsnVal: 0.751 ± 0.422
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.006ProAla: 6.006 ± 1.442
0.0ProCys: 0.0 ± 0.0
1.502ProAsp: 1.502 ± 1.535
6.006ProGlu: 6.006 ± 1.616
3.754ProPhe: 3.754 ± 2.112
6.757ProGly: 6.757 ± 3.636
2.252ProHis: 2.252 ± 1.578
6.006ProIle: 6.006 ± 4.837
1.502ProLys: 1.502 ± 0.671
4.505ProLeu: 4.505 ± 2.012
0.751ProMet: 0.751 ± 0.94
1.502ProAsn: 1.502 ± 0.845
3.754ProPro: 3.754 ± 2.235
1.502ProGln: 1.502 ± 0.845
3.754ProArg: 3.754 ± 1.426
6.757ProSer: 6.757 ± 1.065
3.754ProThr: 3.754 ± 1.211
6.757ProVal: 6.757 ± 1.475
1.502ProTrp: 1.502 ± 1.535
1.502ProTyr: 1.502 ± 0.671
0.0ProXaa: 0.0 ± 0.0
Gln
3.003GlnAla: 3.003 ± 1.9
0.0GlnCys: 0.0 ± 0.0
2.252GlnAsp: 2.252 ± 0.611
1.502GlnGlu: 1.502 ± 0.845
0.751GlnPhe: 0.751 ± 1.178
3.003GlnGly: 3.003 ± 0.721
0.0GlnHis: 0.0 ± 0.0
5.255GlnIle: 5.255 ± 3.219
0.0GlnLys: 0.0 ± 0.0
3.003GlnLeu: 3.003 ± 0.721
1.502GlnMet: 1.502 ± 0.814
0.0GlnAsn: 0.0 ± 0.0
3.754GlnPro: 3.754 ± 0.37
1.502GlnGln: 1.502 ± 0.95
3.754GlnArg: 3.754 ± 0.37
3.754GlnSer: 3.754 ± 2.112
2.252GlnThr: 2.252 ± 1.122
3.754GlnVal: 3.754 ± 2.112
0.0GlnTrp: 0.0 ± 0.0
1.502GlnTyr: 1.502 ± 0.671
0.0GlnXaa: 0.0 ± 0.0
Arg
6.757ArgAla: 6.757 ± 2.722
2.252ArgCys: 2.252 ± 1.267
2.252ArgAsp: 2.252 ± 1.122
3.754ArgGlu: 3.754 ± 1.262
1.502ArgPhe: 1.502 ± 0.845
9.009ArgGly: 9.009 ± 0.788
0.751ArgHis: 0.751 ± 0.422
6.757ArgIle: 6.757 ± 1.065
1.502ArgLys: 1.502 ± 0.845
8.258ArgLeu: 8.258 ± 0.563
1.502ArgMet: 1.502 ± 0.845
3.003ArgAsn: 3.003 ± 3.07
6.006ArgPro: 6.006 ± 1.566
3.003ArgGln: 3.003 ± 1.002
12.763ArgArg: 12.763 ± 4.705
7.508ArgSer: 7.508 ± 2.851
6.006ArgThr: 6.006 ± 1.616
8.258ArgVal: 8.258 ± 1.918
3.003ArgTrp: 3.003 ± 1.002
3.003ArgTyr: 3.003 ± 1.341
0.0ArgXaa: 0.0 ± 0.0
Ser
10.511SerAla: 10.511 ± 2.998
0.751SerCys: 0.751 ± 0.422
3.003SerAsp: 3.003 ± 0.721
4.505SerGlu: 4.505 ± 1.221
3.003SerPhe: 3.003 ± 1.69
9.009SerGly: 9.009 ± 3.095
2.252SerHis: 2.252 ± 1.578
0.751SerIle: 0.751 ± 0.422
1.502SerLys: 1.502 ± 0.845
8.258SerLeu: 8.258 ± 3.203
2.252SerMet: 2.252 ± 0.595
1.502SerAsn: 1.502 ± 0.95
4.505SerPro: 4.505 ± 1.592
2.252SerGln: 2.252 ± 0.611
7.508SerArg: 7.508 ± 1.632
6.006SerSer: 6.006 ± 0.341
2.252SerThr: 2.252 ± 2.098
2.252SerVal: 2.252 ± 0.611
2.252SerTrp: 2.252 ± 0.881
2.252SerTyr: 2.252 ± 0.881
0.0SerXaa: 0.0 ± 0.0
Thr
4.505ThrAla: 4.505 ± 0.333
0.751ThrCys: 0.751 ± 0.422
4.505ThrAsp: 4.505 ± 1.221
1.502ThrGlu: 1.502 ± 0.671
0.751ThrPhe: 0.751 ± 1.178
4.505ThrGly: 4.505 ± 1.548
0.751ThrHis: 0.751 ± 0.422
2.252ThrIle: 2.252 ± 1.578
1.502ThrLys: 1.502 ± 0.845
6.757ThrLeu: 6.757 ± 1.475
1.502ThrMet: 1.502 ± 0.95
0.751ThrAsn: 0.751 ± 1.178
1.502ThrPro: 1.502 ± 0.671
2.252ThrGln: 2.252 ± 1.122
6.006ThrArg: 6.006 ± 0.341
3.003ThrSer: 3.003 ± 1.69
1.502ThrThr: 1.502 ± 0.845
3.754ThrVal: 3.754 ± 0.37
0.0ThrTrp: 0.0 ± 0.0
2.252ThrTyr: 2.252 ± 0.611
0.0ThrXaa: 0.0 ± 0.0
Val
5.255ValAla: 5.255 ± 1.368
0.751ValCys: 0.751 ± 0.422
3.754ValAsp: 3.754 ± 2.235
0.751ValGlu: 0.751 ± 1.178
3.003ValPhe: 3.003 ± 0.721
3.754ValGly: 3.754 ± 1.783
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
3.003ValLys: 3.003 ± 1.69
7.508ValLeu: 7.508 ± 3.143
0.751ValMet: 0.751 ± 0.656
4.505ValAsn: 4.505 ± 1.548
6.757ValPro: 6.757 ± 1.065
2.252ValGln: 2.252 ± 0.611
7.508ValArg: 7.508 ± 1.335
6.757ValSer: 6.757 ± 0.438
5.255ValThr: 5.255 ± 0.665
4.505ValVal: 4.505 ± 2.305
0.751ValTrp: 0.751 ± 1.178
3.754ValTyr: 3.754 ± 2.112
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.502TrpAsp: 1.502 ± 0.845
0.751TrpGlu: 0.751 ± 0.422
1.502TrpPhe: 1.502 ± 0.95
1.502TrpGly: 1.502 ± 0.95
0.751TrpHis: 0.751 ± 0.94
0.0TrpIle: 0.0 ± 0.0
0.751TrpLys: 0.751 ± 0.94
2.252TrpLeu: 2.252 ± 1.267
0.751TrpMet: 0.751 ± 0.422
2.252TrpAsn: 2.252 ± 0.881
0.751TrpPro: 0.751 ± 0.94
0.0TrpGln: 0.0 ± 0.0
0.751TrpArg: 0.751 ± 1.178
0.751TrpSer: 0.751 ± 0.422
0.751TrpThr: 0.751 ± 0.422
0.751TrpVal: 0.751 ± 0.422
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.502TyrAla: 1.502 ± 0.845
1.502TyrCys: 1.502 ± 0.671
3.003TyrAsp: 3.003 ± 0.721
0.751TyrGlu: 0.751 ± 0.422
1.502TyrPhe: 1.502 ± 0.671
3.754TyrGly: 3.754 ± 1.136
0.0TyrHis: 0.0 ± 0.0
0.751TyrIle: 0.751 ± 0.94
0.751TyrLys: 0.751 ± 0.94
3.003TyrLeu: 3.003 ± 0.808
0.751TyrMet: 0.751 ± 0.422
0.0TyrAsn: 0.0 ± 0.0
1.502TyrPro: 1.502 ± 0.671
1.502TyrGln: 1.502 ± 0.671
1.502TyrArg: 1.502 ± 0.845
3.754TyrSer: 3.754 ± 2.112
0.751TyrThr: 0.751 ± 0.422
3.003TyrVal: 3.003 ± 1.69
0.0TyrTrp: 0.0 ± 0.0
0.751TyrTyr: 0.751 ± 0.422
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1333 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski