Amino acid dipepetide frequency for Citrus endogenous pararetrovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.052AlaAla: 2.052 ± 2.181
1.026AlaCys: 1.026 ± 0.429
1.026AlaAsp: 1.026 ± 2.61
2.565AlaGlu: 2.565 ± 1.967
1.539AlaPhe: 1.539 ± 0.643
1.026AlaGly: 1.026 ± 0.429
1.539AlaHis: 1.539 ± 0.643
1.539AlaIle: 1.539 ± 0.643
4.618AlaLys: 4.618 ± 1.93
5.131AlaLeu: 5.131 ± 2.144
0.0AlaMet: 0.0 ± 0.0
0.513AlaAsn: 0.513 ± 0.214
2.052AlaPro: 2.052 ± 0.858
8.209AlaGln: 8.209 ± 0.391
1.539AlaArg: 1.539 ± 0.643
6.157AlaSer: 6.157 ± 0.466
4.618AlaThr: 4.618 ± 1.109
3.079AlaVal: 3.079 ± 1.286
0.513AlaTrp: 0.513 ± 0.214
0.513AlaTyr: 0.513 ± 0.214
0.0AlaXaa: 0.0 ± 0.0
Cys
1.026CysAla: 1.026 ± 0.429
0.513CysCys: 0.513 ± 0.214
1.026CysAsp: 1.026 ± 0.429
0.513CysGlu: 0.513 ± 0.214
0.513CysPhe: 0.513 ± 0.214
0.0CysGly: 0.0 ± 0.0
0.513CysHis: 0.513 ± 0.214
0.513CysIle: 0.513 ± 0.214
1.026CysLys: 1.026 ± 0.429
0.513CysLeu: 0.513 ± 0.214
0.513CysMet: 0.513 ± 0.214
0.0CysAsn: 0.0 ± 0.0
1.539CysPro: 1.539 ± 0.643
0.513CysGln: 0.513 ± 0.214
0.513CysArg: 0.513 ± 0.214
1.539CysSer: 1.539 ± 0.643
1.539CysThr: 1.539 ± 2.396
0.513CysVal: 0.513 ± 0.214
0.513CysTrp: 0.513 ± 0.214
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.052AspAla: 2.052 ± 2.181
2.052AspCys: 2.052 ± 0.858
4.105AspAsp: 4.105 ± 1.715
2.052AspGlu: 2.052 ± 2.181
3.592AspPhe: 3.592 ± 1.538
1.026AspGly: 1.026 ± 0.429
2.052AspHis: 2.052 ± 0.858
4.618AspIle: 4.618 ± 1.109
3.079AspLys: 3.079 ± 1.286
3.079AspLeu: 3.079 ± 1.286
0.0AspMet: 0.0 ± 0.0
2.052AspAsn: 2.052 ± 0.858
3.592AspPro: 3.592 ± 1.538
4.618AspGln: 4.618 ± 1.109
0.513AspArg: 0.513 ± 0.214
6.157AspSer: 6.157 ± 0.466
4.618AspThr: 4.618 ± 1.109
2.565AspVal: 2.565 ± 1.967
1.026AspTrp: 1.026 ± 0.429
3.079AspTyr: 3.079 ± 1.753
0.0AspXaa: 0.0 ± 0.0
Glu
1.026GluAla: 1.026 ± 2.61
0.513GluCys: 0.513 ± 0.214
1.539GluAsp: 1.539 ± 2.396
3.079GluGlu: 3.079 ± 1.286
3.592GluPhe: 3.592 ± 1.501
0.513GluGly: 0.513 ± 0.214
2.565GluHis: 2.565 ± 1.072
3.079GluIle: 3.079 ± 1.753
2.052GluLys: 2.052 ± 0.858
4.618GluLeu: 4.618 ± 1.109
1.026GluMet: 1.026 ± 0.429
2.052GluAsn: 2.052 ± 5.22
5.644GluPro: 5.644 ± 2.358
3.079GluGln: 3.079 ± 1.753
1.539GluArg: 1.539 ± 0.643
5.131GluSer: 5.131 ± 2.144
1.539GluThr: 1.539 ± 0.643
1.026GluVal: 1.026 ± 2.61
1.026GluTrp: 1.026 ± 0.429
1.026GluTyr: 1.026 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
2.565PheAla: 2.565 ± 1.072
1.539PheCys: 1.539 ± 0.643
2.565PheAsp: 2.565 ± 1.072
2.052PheGlu: 2.052 ± 0.858
1.539PhePhe: 1.539 ± 0.643
1.026PheGly: 1.026 ± 0.429
2.052PheHis: 2.052 ± 0.858
1.539PheIle: 1.539 ± 0.643
5.131PheLys: 5.131 ± 2.144
5.131PheLeu: 5.131 ± 2.144
0.0PheMet: 0.0 ± 0.0
1.026PheAsn: 1.026 ± 0.429
3.592PhePro: 3.592 ± 1.501
4.618PheGln: 4.618 ± 4.148
2.052PheArg: 2.052 ± 0.858
3.079PheSer: 3.079 ± 1.286
6.157PheThr: 6.157 ± 2.573
1.026PheVal: 1.026 ± 0.429
0.513PheTrp: 0.513 ± 0.214
2.565PheTyr: 2.565 ± 1.072
0.0PheXaa: 0.0 ± 0.0
Gly
2.052GlyAla: 2.052 ± 0.858
0.513GlyCys: 0.513 ± 2.825
1.026GlyAsp: 1.026 ± 2.61
0.0GlyGlu: 0.0 ± 0.0
2.565GlyPhe: 2.565 ± 1.072
0.513GlyGly: 0.513 ± 0.214
2.565GlyHis: 2.565 ± 1.072
4.105GlyIle: 4.105 ± 1.324
3.079GlyLys: 3.079 ± 1.753
2.052GlyLeu: 2.052 ± 0.858
1.539GlyMet: 1.539 ± 0.643
0.513GlyAsn: 0.513 ± 0.214
2.565GlyPro: 2.565 ± 1.967
0.513GlyGln: 0.513 ± 0.214
0.513GlyArg: 0.513 ± 0.214
2.565GlySer: 2.565 ± 1.967
1.026GlyThr: 1.026 ± 0.429
0.0GlyVal: 0.0 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
1.026GlyTyr: 1.026 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
2.052HisAla: 2.052 ± 0.858
0.513HisCys: 0.513 ± 0.214
1.026HisAsp: 1.026 ± 0.429
0.513HisGlu: 0.513 ± 0.214
4.105HisPhe: 4.105 ± 1.715
2.052HisGly: 2.052 ± 0.858
2.565HisHis: 2.565 ± 1.967
1.026HisIle: 1.026 ± 0.429
0.513HisLys: 0.513 ± 2.825
5.644HisLeu: 5.644 ± 2.358
0.513HisMet: 0.513 ± 2.825
1.026HisAsn: 1.026 ± 0.429
3.079HisPro: 3.079 ± 1.286
3.079HisGln: 3.079 ± 1.286
0.513HisArg: 0.513 ± 0.214
3.079HisSer: 3.079 ± 1.286
5.131HisThr: 5.131 ± 6.973
2.052HisVal: 2.052 ± 0.858
0.0HisTrp: 0.0 ± 0.0
1.539HisTyr: 1.539 ± 0.643
0.0HisXaa: 0.0 ± 0.0
Ile
2.565IleAla: 2.565 ± 1.072
0.513IleCys: 0.513 ± 0.214
3.592IleAsp: 3.592 ± 1.501
2.565IleGlu: 2.565 ± 1.072
3.592IlePhe: 3.592 ± 1.501
3.079IleGly: 3.079 ± 1.753
1.026IleHis: 1.026 ± 2.61
3.079IleIle: 3.079 ± 1.753
3.592IleLys: 3.592 ± 1.501
6.157IleLeu: 6.157 ± 3.505
0.513IleMet: 0.513 ± 0.214
3.592IleAsn: 3.592 ± 1.501
9.236IlePro: 9.236 ± 0.82
3.079IleGln: 3.079 ± 1.286
4.105IleArg: 4.105 ± 1.324
6.67IleSer: 6.67 ± 2.787
2.565IleThr: 2.565 ± 1.072
1.026IleVal: 1.026 ± 0.429
0.513IleTrp: 0.513 ± 0.214
1.026IleTyr: 1.026 ± 0.429
0.0IleXaa: 0.0 ± 0.0
Lys
2.565LysAla: 2.565 ± 1.072
1.026LysCys: 1.026 ± 0.429
5.131LysAsp: 5.131 ± 0.895
3.079LysGlu: 3.079 ± 1.286
3.079LysPhe: 3.079 ± 1.286
1.539LysGly: 1.539 ± 0.643
3.079LysHis: 3.079 ± 1.286
3.592LysIle: 3.592 ± 1.501
11.288LysLys: 11.288 ± 4.717
5.644LysLeu: 5.644 ± 2.358
0.0LysMet: 0.0 ± 0.0
2.052LysAsn: 2.052 ± 0.858
7.183LysPro: 7.183 ± 3.002
3.592LysGln: 3.592 ± 1.501
6.157LysArg: 6.157 ± 0.466
6.67LysSer: 6.67 ± 2.787
2.565LysThr: 2.565 ± 1.967
2.052LysVal: 2.052 ± 0.858
0.513LysTrp: 0.513 ± 0.214
1.539LysTyr: 1.539 ± 2.396
0.0LysXaa: 0.0 ± 0.0
Leu
5.131LeuAla: 5.131 ± 2.144
0.513LeuCys: 0.513 ± 0.214
5.644LeuAsp: 5.644 ± 0.681
3.592LeuGlu: 3.592 ± 1.538
3.592LeuPhe: 3.592 ± 1.538
4.105LeuGly: 4.105 ± 1.715
4.618LeuHis: 4.618 ± 1.109
6.67LeuIle: 6.67 ± 2.787
6.67LeuLys: 6.67 ± 2.787
11.288LeuLeu: 11.288 ± 1.678
2.052LeuMet: 2.052 ± 0.778
5.131LeuAsn: 5.131 ± 0.895
7.696LeuPro: 7.696 ± 3.216
8.209LeuGln: 8.209 ± 3.43
2.565LeuArg: 2.565 ± 1.072
7.696LeuSer: 7.696 ± 3.216
4.105LeuThr: 4.105 ± 1.715
3.079LeuVal: 3.079 ± 1.753
1.026LeuTrp: 1.026 ± 0.429
1.539LeuTyr: 1.539 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
1.026MetAla: 1.026 ± 0.429
0.0MetCys: 0.0 ± 0.0
0.513MetAsp: 0.513 ± 0.214
0.513MetGlu: 0.513 ± 2.825
2.565MetPhe: 2.565 ± 1.072
0.0MetGly: 0.0 ± 0.0
0.513MetHis: 0.513 ± 0.214
2.052MetIle: 2.052 ± 0.858
0.513MetLys: 0.513 ± 0.214
2.052MetLeu: 2.052 ± 0.858
0.513MetMet: 0.513 ± 0.214
0.513MetAsn: 0.513 ± 0.214
1.026MetPro: 1.026 ± 0.429
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.539MetSer: 1.539 ± 0.643
1.539MetThr: 1.539 ± 2.396
0.513MetVal: 0.513 ± 0.214
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.565AsnAla: 2.565 ± 1.072
0.513AsnCys: 0.513 ± 0.214
2.052AsnAsp: 2.052 ± 0.858
1.539AsnGlu: 1.539 ± 0.643
0.513AsnPhe: 0.513 ± 0.214
3.079AsnGly: 3.079 ± 1.286
0.513AsnHis: 0.513 ± 0.214
2.565AsnIle: 2.565 ± 1.967
3.079AsnLys: 3.079 ± 1.753
5.644AsnLeu: 5.644 ± 2.358
1.026AsnMet: 1.026 ± 0.429
1.026AsnAsn: 1.026 ± 2.61
3.592AsnPro: 3.592 ± 1.538
2.052AsnGln: 2.052 ± 0.858
0.513AsnArg: 0.513 ± 0.214
1.539AsnSer: 1.539 ± 0.643
0.513AsnThr: 0.513 ± 0.214
1.026AsnVal: 1.026 ± 2.61
1.026AsnTrp: 1.026 ± 0.429
1.539AsnTyr: 1.539 ± 0.643
0.0AsnXaa: 0.0 ± 0.0
Pro
4.105ProAla: 4.105 ± 4.363
0.513ProCys: 0.513 ± 0.214
5.131ProAsp: 5.131 ± 3.934
5.131ProGlu: 5.131 ± 2.144
4.105ProPhe: 4.105 ± 1.715
2.565ProGly: 2.565 ± 1.967
2.052ProHis: 2.052 ± 0.858
3.079ProIle: 3.079 ± 1.286
5.131ProLys: 5.131 ± 0.895
5.644ProLeu: 5.644 ± 2.358
1.026ProMet: 1.026 ± 1.005
4.618ProAsn: 4.618 ± 1.93
11.801ProPro: 11.801 ± 4.931
3.079ProGln: 3.079 ± 1.753
4.105ProArg: 4.105 ± 1.324
12.827ProSer: 12.827 ± 6.796
4.618ProThr: 4.618 ± 1.93
3.079ProVal: 3.079 ± 1.286
1.026ProTrp: 1.026 ± 2.61
3.079ProTyr: 3.079 ± 1.286
0.0ProXaa: 0.0 ± 0.0
Gln
1.539GlnAla: 1.539 ± 2.396
0.513GlnCys: 0.513 ± 0.214
5.131GlnAsp: 5.131 ± 3.934
2.052GlnGlu: 2.052 ± 0.858
4.618GlnPhe: 4.618 ± 1.93
2.052GlnGly: 2.052 ± 5.22
1.026GlnHis: 1.026 ± 2.61
7.696GlnIle: 7.696 ± 0.177
3.079GlnLys: 3.079 ± 1.286
7.183GlnLeu: 7.183 ± 3.002
0.513GlnMet: 0.513 ± 0.214
3.592GlnAsn: 3.592 ± 1.501
4.105GlnPro: 4.105 ± 1.715
4.618GlnGln: 4.618 ± 1.109
4.105GlnArg: 4.105 ± 4.363
5.131GlnSer: 5.131 ± 0.895
4.618GlnThr: 4.618 ± 1.93
1.539GlnVal: 1.539 ± 0.643
1.539GlnTrp: 1.539 ± 0.643
2.052GlnTyr: 2.052 ± 2.181
0.0GlnXaa: 0.0 ± 0.0
Arg
1.026ArgAla: 1.026 ± 0.429
1.026ArgCys: 1.026 ± 0.429
2.565ArgAsp: 2.565 ± 1.072
1.539ArgGlu: 1.539 ± 0.643
1.539ArgPhe: 1.539 ± 0.643
0.513ArgGly: 0.513 ± 0.214
3.592ArgHis: 3.592 ± 1.538
1.539ArgIle: 1.539 ± 0.643
3.079ArgLys: 3.079 ± 1.286
4.105ArgLeu: 4.105 ± 1.715
1.026ArgMet: 1.026 ± 0.429
0.513ArgAsn: 0.513 ± 0.214
4.618ArgPro: 4.618 ± 1.109
4.105ArgGln: 4.105 ± 4.363
3.079ArgArg: 3.079 ± 1.753
3.079ArgSer: 3.079 ± 1.286
2.052ArgThr: 2.052 ± 0.858
1.026ArgVal: 1.026 ± 0.429
0.0ArgTrp: 0.0 ± 0.0
2.052ArgTyr: 2.052 ± 5.22
0.0ArgXaa: 0.0 ± 0.0
Ser
8.722SerAla: 8.722 ± 3.645
1.539SerCys: 1.539 ± 0.643
6.67SerAsp: 6.67 ± 2.787
6.157SerGlu: 6.157 ± 2.573
3.079SerPhe: 3.079 ± 1.286
2.565SerGly: 2.565 ± 1.072
4.105SerHis: 4.105 ± 1.324
4.105SerIle: 4.105 ± 1.324
5.644SerLys: 5.644 ± 2.358
7.696SerLeu: 7.696 ± 2.862
4.105SerMet: 4.105 ± 1.715
4.105SerAsn: 4.105 ± 1.715
8.209SerPro: 8.209 ± 8.725
3.592SerGln: 3.592 ± 1.538
4.105SerArg: 4.105 ± 1.324
14.879SerSer: 14.879 ± 2.899
8.209SerThr: 8.209 ± 2.648
1.539SerVal: 1.539 ± 0.643
0.513SerTrp: 0.513 ± 0.214
2.565SerTyr: 2.565 ± 1.072
0.0SerXaa: 0.0 ± 0.0
Thr
5.131ThrAla: 5.131 ± 2.144
0.513ThrCys: 0.513 ± 0.214
3.592ThrAsp: 3.592 ± 1.538
4.105ThrGlu: 4.105 ± 7.402
2.052ThrPhe: 2.052 ± 0.858
2.052ThrGly: 2.052 ± 2.181
3.079ThrHis: 3.079 ± 1.286
4.105ThrIle: 4.105 ± 1.715
4.618ThrLys: 4.618 ± 1.93
7.696ThrLeu: 7.696 ± 0.177
0.513ThrMet: 0.513 ± 0.214
1.539ThrAsn: 1.539 ± 2.396
3.592ThrPro: 3.592 ± 7.616
2.052ThrGln: 2.052 ± 2.181
3.079ThrArg: 3.079 ± 1.286
8.722ThrSer: 8.722 ± 2.433
5.131ThrThr: 5.131 ± 0.895
4.105ThrVal: 4.105 ± 1.324
0.513ThrTrp: 0.513 ± 0.214
2.052ThrTyr: 2.052 ± 0.858
0.0ThrXaa: 0.0 ± 0.0
Val
1.026ValAla: 1.026 ± 0.429
0.0ValCys: 0.0 ± 0.0
1.026ValAsp: 1.026 ± 0.429
2.565ValGlu: 2.565 ± 1.072
1.026ValPhe: 1.026 ± 0.429
0.0ValGly: 0.0 ± 0.0
0.513ValHis: 0.513 ± 0.214
4.105ValIle: 4.105 ± 1.715
1.539ValLys: 1.539 ± 0.643
4.105ValLeu: 4.105 ± 1.715
0.0ValMet: 0.0 ± 0.0
1.026ValAsn: 1.026 ± 0.429
2.052ValPro: 2.052 ± 2.181
3.592ValGln: 3.592 ± 1.501
1.026ValArg: 1.026 ± 0.429
3.079ValSer: 3.079 ± 4.792
3.592ValThr: 3.592 ± 4.577
0.513ValVal: 0.513 ± 0.214
0.0ValTrp: 0.0 ± 0.0
1.026ValTyr: 1.026 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
0.513TrpAla: 0.513 ± 0.214
0.0TrpCys: 0.0 ± 0.0
1.026TrpAsp: 1.026 ± 0.429
1.026TrpGlu: 1.026 ± 2.61
1.539TrpPhe: 1.539 ± 0.643
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.026TrpIle: 1.026 ± 0.429
1.026TrpLys: 1.026 ± 0.429
0.513TrpLeu: 0.513 ± 0.214
0.0TrpMet: 0.0 ± 0.0
0.513TrpAsn: 0.513 ± 0.214
0.0TrpPro: 0.0 ± 0.0
1.026TrpGln: 1.026 ± 0.429
0.513TrpArg: 0.513 ± 0.214
0.513TrpSer: 0.513 ± 0.214
1.539TrpThr: 1.539 ± 0.643
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.513TyrAla: 0.513 ± 0.214
0.0TyrCys: 0.0 ± 0.0
1.539TyrAsp: 1.539 ± 0.643
1.026TyrGlu: 1.026 ± 0.429
1.026TyrPhe: 1.026 ± 0.429
1.026TyrGly: 1.026 ± 2.61
2.052TyrHis: 2.052 ± 0.858
2.052TyrIle: 2.052 ± 0.858
3.592TyrLys: 3.592 ± 1.501
1.026TyrLeu: 1.026 ± 0.429
0.0TyrMet: 0.0 ± 0.0
0.513TyrAsn: 0.513 ± 0.214
2.052TyrPro: 2.052 ± 2.181
3.079TyrGln: 3.079 ± 1.753
1.539TyrArg: 1.539 ± 0.643
2.052TyrSer: 2.052 ± 0.858
2.565TyrThr: 2.565 ± 5.006
2.052TyrVal: 2.052 ± 0.858
0.513TyrTrp: 0.513 ± 0.214
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1950 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski