Amino acid dipepetide frequency for Xanthophyllomyces dendrorhous virus L1B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.326AlaAla: 5.326 ± 1.512
0.0AlaCys: 0.0 ± 0.0
1.997AlaAsp: 1.997 ± 0.589
5.326AlaGlu: 5.326 ± 0.337
7.324AlaPhe: 7.324 ± 0.926
4.66AlaGly: 4.66 ± 2.016
0.666AlaHis: 0.666 ± 0.504
5.326AlaIle: 5.326 ± 1.512
5.992AlaLys: 5.992 ± 0.083
5.326AlaLeu: 5.326 ± 0.337
3.995AlaMet: 3.995 ± 0.671
4.66AlaAsn: 4.66 ± 0.757
1.997AlaPro: 1.997 ± 0.589
1.997AlaGln: 1.997 ± 0.589
7.324AlaArg: 7.324 ± 0.002
3.995AlaSer: 3.995 ± 0.253
9.321AlaThr: 9.321 ± 0.59
8.655AlaVal: 8.655 ± 1.01
1.332AlaTrp: 1.332 ± 0.84
1.332AlaTyr: 1.332 ± 0.084
0.0AlaXaa: 0.0 ± 0.0
Cys
0.666CysAla: 0.666 ± 0.42
0.0CysCys: 0.0 ± 0.0
0.666CysAsp: 0.666 ± 0.42
1.332CysGlu: 1.332 ± 0.84
1.332CysPhe: 1.332 ± 1.009
1.997CysGly: 1.997 ± 0.336
0.0CysHis: 0.0 ± 0.0
0.666CysIle: 0.666 ± 0.42
1.332CysLys: 1.332 ± 0.084
1.332CysLeu: 1.332 ± 0.84
0.0CysMet: 0.0 ± 0.0
0.666CysAsn: 0.666 ± 0.42
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.666CysSer: 0.666 ± 0.504
0.666CysThr: 0.666 ± 0.42
0.666CysVal: 0.666 ± 0.504
0.0CysTrp: 0.0 ± 0.0
0.666CysTyr: 0.666 ± 0.42
0.0CysXaa: 0.0 ± 0.0
Asp
5.992AspAla: 5.992 ± 0.083
1.332AspCys: 1.332 ± 0.084
3.329AspAsp: 3.329 ± 1.176
3.329AspGlu: 3.329 ± 1.597
3.995AspPhe: 3.995 ± 0.671
2.663AspGly: 2.663 ± 0.169
1.332AspHis: 1.332 ± 0.84
3.329AspIle: 3.329 ± 0.673
3.329AspLys: 3.329 ± 2.1
3.995AspLeu: 3.995 ± 0.253
1.997AspMet: 1.997 ± 0.589
2.663AspAsn: 2.663 ± 0.169
3.329AspPro: 3.329 ± 0.673
0.666AspGln: 0.666 ± 0.42
2.663AspArg: 2.663 ± 1.68
3.329AspSer: 3.329 ± 0.673
3.995AspThr: 3.995 ± 0.253
5.326AspVal: 5.326 ± 1.512
1.997AspTrp: 1.997 ± 0.336
2.663AspTyr: 2.663 ± 0.169
0.0AspXaa: 0.0 ± 0.0
Glu
9.321GluAla: 9.321 ± 1.259
1.332GluCys: 1.332 ± 1.009
2.663GluAsp: 2.663 ± 0.756
4.66GluGlu: 4.66 ± 1.091
1.997GluPhe: 1.997 ± 0.589
0.0GluGly: 0.0 ± 0.0
0.666GluHis: 0.666 ± 0.504
1.332GluIle: 1.332 ± 0.084
3.329GluLys: 3.329 ± 0.673
7.324GluLeu: 7.324 ± 0.926
3.329GluMet: 3.329 ± 0.559
2.663GluAsn: 2.663 ± 1.68
3.995GluPro: 3.995 ± 2.102
2.663GluGln: 2.663 ± 1.093
1.332GluArg: 1.332 ± 0.084
5.992GluSer: 5.992 ± 1.007
0.666GluThr: 0.666 ± 0.504
2.663GluVal: 2.663 ± 0.169
0.666GluTrp: 0.666 ± 0.42
1.997GluTyr: 1.997 ± 0.589
0.0GluXaa: 0.0 ± 0.0
Phe
3.329PheAla: 3.329 ± 1.597
0.666PheCys: 0.666 ± 0.42
2.663PheAsp: 2.663 ± 2.017
3.329PheGlu: 3.329 ± 0.673
0.666PhePhe: 0.666 ± 0.42
1.997PheGly: 1.997 ± 0.589
0.666PheHis: 0.666 ± 0.504
2.663PheIle: 2.663 ± 1.093
2.663PheLys: 2.663 ± 0.756
2.663PheLeu: 2.663 ± 0.756
1.332PheMet: 1.332 ± 0.084
3.329PheAsn: 3.329 ± 0.251
1.332PhePro: 1.332 ± 0.084
0.666PheGln: 0.666 ± 0.504
1.997PheArg: 1.997 ± 0.589
1.997PheSer: 1.997 ± 0.589
3.329PheThr: 3.329 ± 0.673
1.332PheVal: 1.332 ± 0.084
1.997PheTrp: 1.997 ± 1.513
2.663PheTyr: 2.663 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
4.66GlyAla: 4.66 ± 2.606
0.0GlyCys: 0.0 ± 0.0
2.663GlyAsp: 2.663 ± 0.756
3.329GlyGlu: 3.329 ± 0.673
1.997GlyPhe: 1.997 ± 1.513
2.663GlyGly: 2.663 ± 0.169
0.0GlyHis: 0.0 ± 0.0
5.326GlyIle: 5.326 ± 0.587
3.995GlyLys: 3.995 ± 1.596
3.995GlyLeu: 3.995 ± 0.671
3.995GlyMet: 3.995 ± 0.253
1.332GlyAsn: 1.332 ± 1.009
0.666GlyPro: 0.666 ± 0.504
0.666GlyGln: 0.666 ± 0.42
3.329GlyArg: 3.329 ± 1.597
7.324GlySer: 7.324 ± 0.926
1.332GlyThr: 1.332 ± 0.084
5.992GlyVal: 5.992 ± 0.083
1.332GlyTrp: 1.332 ± 0.084
3.329GlyTyr: 3.329 ± 0.251
0.0GlyXaa: 0.0 ± 0.0
His
1.332HisAla: 1.332 ± 0.84
0.666HisCys: 0.666 ± 0.42
1.332HisAsp: 1.332 ± 0.084
1.332HisGlu: 1.332 ± 1.009
2.663HisPhe: 2.663 ± 0.169
0.0HisGly: 0.0 ± 0.0
0.666HisHis: 0.666 ± 0.42
1.332HisIle: 1.332 ± 0.084
0.0HisLys: 0.0 ± 0.0
0.666HisLeu: 0.666 ± 0.504
0.0HisMet: 0.0 ± 0.0
0.666HisAsn: 0.666 ± 0.42
0.666HisPro: 0.666 ± 0.504
0.666HisGln: 0.666 ± 0.504
1.997HisArg: 1.997 ± 0.336
1.997HisSer: 1.997 ± 1.26
0.0HisThr: 0.0 ± 0.0
2.663HisVal: 2.663 ± 1.093
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.66IleAla: 4.66 ± 2.016
1.997IleCys: 1.997 ± 1.26
5.992IleAsp: 5.992 ± 0.083
3.329IleGlu: 3.329 ± 0.673
1.332IlePhe: 1.332 ± 0.084
1.332IleGly: 1.332 ± 0.084
1.332IleHis: 1.332 ± 0.84
1.332IleIle: 1.332 ± 0.084
5.992IleLys: 5.992 ± 1.932
2.663IleLeu: 2.663 ± 1.093
2.663IleMet: 2.663 ± 0.169
1.332IleAsn: 1.332 ± 0.084
1.332IlePro: 1.332 ± 0.084
2.663IleGln: 2.663 ± 0.756
1.997IleArg: 1.997 ± 1.26
2.663IleSer: 2.663 ± 1.093
1.997IleThr: 1.997 ± 0.336
2.663IleVal: 2.663 ± 0.169
0.666IleTrp: 0.666 ± 0.504
3.329IleTyr: 3.329 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
7.324LysAla: 7.324 ± 1.847
1.332LysCys: 1.332 ± 0.84
3.995LysAsp: 3.995 ± 1.596
2.663LysGlu: 2.663 ± 0.756
3.995LysPhe: 3.995 ± 2.102
1.997LysGly: 1.997 ± 0.336
0.666LysHis: 0.666 ± 0.504
5.326LysIle: 5.326 ± 2.436
2.663LysLys: 2.663 ± 0.756
3.995LysLeu: 3.995 ± 1.596
2.663LysMet: 2.663 ± 0.756
0.666LysAsn: 0.666 ± 0.42
3.329LysPro: 3.329 ± 0.673
2.663LysGln: 2.663 ± 0.169
2.663LysArg: 2.663 ± 0.756
4.66LysSer: 4.66 ± 1.091
4.66LysThr: 4.66 ± 0.757
1.997LysVal: 1.997 ± 0.336
1.332LysTrp: 1.332 ± 0.84
4.66LysTyr: 4.66 ± 0.167
0.0LysXaa: 0.0 ± 0.0
Leu
5.326LeuAla: 5.326 ± 1.262
0.0LeuCys: 0.0 ± 0.0
5.326LeuAsp: 5.326 ± 0.587
3.329LeuGlu: 3.329 ± 0.673
2.663LeuPhe: 2.663 ± 1.093
6.658LeuGly: 6.658 ± 4.119
0.666LeuHis: 0.666 ± 0.42
2.663LeuIle: 2.663 ± 0.756
5.992LeuLys: 5.992 ± 1.007
5.326LeuLeu: 5.326 ± 1.262
2.663LeuMet: 2.663 ± 0.756
5.326LeuAsn: 5.326 ± 0.587
3.329LeuPro: 3.329 ± 0.251
1.997LeuGln: 1.997 ± 1.26
5.992LeuArg: 5.992 ± 1.932
5.992LeuSer: 5.992 ± 0.083
5.326LeuThr: 5.326 ± 1.262
4.66LeuVal: 4.66 ± 0.167
1.997LeuTrp: 1.997 ± 0.589
2.663LeuTyr: 2.663 ± 0.756
0.0LeuXaa: 0.0 ± 0.0
Met
3.329MetAla: 3.329 ± 0.251
0.666MetCys: 0.666 ± 0.42
1.997MetAsp: 1.997 ± 0.589
3.995MetGlu: 3.995 ± 0.671
0.666MetPhe: 0.666 ± 0.42
1.332MetGly: 1.332 ± 0.084
1.332MetHis: 1.332 ± 0.84
1.332MetIle: 1.332 ± 0.084
3.329MetLys: 3.329 ± 1.176
2.663MetLeu: 2.663 ± 2.017
1.332MetMet: 1.332 ± 0.084
0.0MetAsn: 0.0 ± 0.0
3.329MetPro: 3.329 ± 1.176
0.0MetGln: 0.0 ± 0.0
1.997MetArg: 1.997 ± 0.589
3.995MetSer: 3.995 ± 1.596
2.663MetThr: 2.663 ± 1.68
1.997MetVal: 1.997 ± 1.513
0.0MetTrp: 0.0 ± 0.0
1.332MetTyr: 1.332 ± 0.084
0.0MetXaa: 0.0 ± 0.0
Asn
2.663AsnAla: 2.663 ± 0.169
1.997AsnCys: 1.997 ± 0.336
3.329AsnAsp: 3.329 ± 0.251
1.997AsnGlu: 1.997 ± 0.336
2.663AsnPhe: 2.663 ± 1.093
3.329AsnGly: 3.329 ± 0.251
0.666AsnHis: 0.666 ± 0.504
2.663AsnIle: 2.663 ± 0.756
4.66AsnLys: 4.66 ± 0.757
3.329AsnLeu: 3.329 ± 0.673
3.329AsnMet: 3.329 ± 0.251
1.997AsnAsn: 1.997 ± 1.513
1.332AsnPro: 1.332 ± 0.084
0.0AsnGln: 0.0 ± 0.0
4.66AsnArg: 4.66 ± 1.091
4.66AsnSer: 4.66 ± 1.682
0.666AsnThr: 0.666 ± 0.42
5.992AsnVal: 5.992 ± 1.007
0.0AsnTrp: 0.0 ± 0.0
1.332AsnTyr: 1.332 ± 0.084
0.0AsnXaa: 0.0 ± 0.0
Pro
4.66ProAla: 4.66 ± 2.606
0.0ProCys: 0.0 ± 0.0
1.997ProAsp: 1.997 ± 0.336
0.0ProGlu: 0.0 ± 0.0
0.0ProPhe: 0.0 ± 0.0
2.663ProGly: 2.663 ± 0.169
0.666ProHis: 0.666 ± 0.504
1.332ProIle: 1.332 ± 0.084
3.995ProLys: 3.995 ± 1.596
2.663ProLeu: 2.663 ± 1.093
0.666ProMet: 0.666 ± 0.42
3.995ProAsn: 3.995 ± 1.177
0.666ProPro: 0.666 ± 0.504
0.666ProGln: 0.666 ± 0.42
0.0ProArg: 0.0 ± 0.0
1.332ProSer: 1.332 ± 0.084
3.329ProThr: 3.329 ± 2.522
5.992ProVal: 5.992 ± 0.842
0.0ProTrp: 0.0 ± 0.0
1.332ProTyr: 1.332 ± 0.084
0.0ProXaa: 0.0 ± 0.0
Gln
1.997GlnAla: 1.997 ± 1.26
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.663GlnGlu: 2.663 ± 2.017
0.666GlnPhe: 0.666 ± 0.504
1.997GlnGly: 1.997 ± 0.589
2.663GlnHis: 2.663 ± 0.756
0.666GlnIle: 0.666 ± 0.504
0.0GlnLys: 0.0 ± 0.0
1.332GlnLeu: 1.332 ± 0.084
0.666GlnMet: 0.666 ± 0.42
1.332GlnAsn: 1.332 ± 0.84
0.666GlnPro: 0.666 ± 0.504
1.332GlnGln: 1.332 ± 0.084
0.666GlnArg: 0.666 ± 0.504
0.666GlnSer: 0.666 ± 0.42
2.663GlnThr: 2.663 ± 1.68
1.332GlnVal: 1.332 ± 0.084
0.666GlnTrp: 0.666 ± 0.42
1.997GlnTyr: 1.997 ± 1.26
0.0GlnXaa: 0.0 ± 0.0
Arg
5.326ArgAla: 5.326 ± 0.337
0.0ArgCys: 0.0 ± 0.0
4.66ArgAsp: 4.66 ± 0.167
1.332ArgGlu: 1.332 ± 0.084
2.663ArgPhe: 2.663 ± 0.169
3.329ArgGly: 3.329 ± 1.176
1.332ArgHis: 1.332 ± 0.84
2.663ArgIle: 2.663 ± 1.68
2.663ArgLys: 2.663 ± 0.756
6.658ArgLeu: 6.658 ± 0.422
0.666ArgMet: 0.666 ± 0.42
1.997ArgAsn: 1.997 ± 0.336
1.997ArgPro: 1.997 ± 0.589
1.332ArgGln: 1.332 ± 0.84
1.997ArgArg: 1.997 ± 0.589
3.329ArgSer: 3.329 ± 0.251
5.992ArgThr: 5.992 ± 2.69
5.992ArgVal: 5.992 ± 0.083
0.666ArgTrp: 0.666 ± 0.42
0.666ArgTyr: 0.666 ± 0.42
0.0ArgXaa: 0.0 ± 0.0
Ser
3.995SerAla: 3.995 ± 2.102
0.666SerCys: 0.666 ± 0.504
5.326SerAsp: 5.326 ± 0.587
5.992SerGlu: 5.992 ± 2.856
2.663SerPhe: 2.663 ± 1.093
4.66SerGly: 4.66 ± 1.682
0.0SerHis: 0.0 ± 0.0
2.663SerIle: 2.663 ± 0.169
4.66SerLys: 4.66 ± 0.167
5.326SerLeu: 5.326 ± 0.337
3.995SerMet: 3.995 ± 1.596
5.992SerAsn: 5.992 ± 1.766
2.663SerPro: 2.663 ± 0.169
1.997SerGln: 1.997 ± 0.336
3.995SerArg: 3.995 ± 0.253
3.995SerSer: 3.995 ± 1.177
1.997SerThr: 1.997 ± 0.589
7.324SerVal: 7.324 ± 3.696
0.666SerTrp: 0.666 ± 0.504
3.995SerTyr: 3.995 ± 1.177
0.0SerXaa: 0.0 ± 0.0
Thr
4.66ThrAla: 4.66 ± 1.091
0.0ThrCys: 0.0 ± 0.0
6.658ThrAsp: 6.658 ± 0.503
1.332ThrGlu: 1.332 ± 0.084
1.997ThrPhe: 1.997 ± 0.589
3.329ThrGly: 3.329 ± 2.522
2.663ThrHis: 2.663 ± 1.093
0.666ThrIle: 0.666 ± 0.42
1.997ThrLys: 1.997 ± 0.336
3.995ThrLeu: 3.995 ± 1.596
1.997ThrMet: 1.997 ± 0.589
1.997ThrAsn: 1.997 ± 0.589
0.0ThrPro: 0.0 ± 0.0
0.666ThrGln: 0.666 ± 0.504
5.992ThrArg: 5.992 ± 0.083
4.66ThrSer: 4.66 ± 1.091
1.997ThrThr: 1.997 ± 1.26
5.326ThrVal: 5.326 ± 2.186
0.0ThrTrp: 0.0 ± 0.0
5.326ThrTyr: 5.326 ± 1.262
0.0ThrXaa: 0.0 ± 0.0
Val
5.326ValAla: 5.326 ± 0.337
0.666ValCys: 0.666 ± 0.504
2.663ValAsp: 2.663 ± 0.169
5.326ValGlu: 5.326 ± 1.262
1.997ValPhe: 1.997 ± 1.26
5.326ValGly: 5.326 ± 0.587
1.997ValHis: 1.997 ± 0.589
7.324ValIle: 7.324 ± 0.002
2.663ValLys: 2.663 ± 1.093
9.987ValLeu: 9.987 ± 1.679
1.332ValMet: 1.332 ± 0.835
5.992ValAsn: 5.992 ± 0.083
3.995ValPro: 3.995 ± 0.253
2.663ValGln: 2.663 ± 0.756
3.995ValArg: 3.995 ± 1.177
6.658ValSer: 6.658 ± 0.422
2.663ValThr: 2.663 ± 0.756
3.329ValVal: 3.329 ± 1.176
0.0ValTrp: 0.0 ± 0.0
5.326ValTyr: 5.326 ± 0.587
0.0ValXaa: 0.0 ± 0.0
Trp
2.663TrpAla: 2.663 ± 0.756
0.666TrpCys: 0.666 ± 0.42
0.666TrpAsp: 0.666 ± 0.42
0.666TrpGlu: 0.666 ± 0.42
0.0TrpPhe: 0.0 ± 0.0
0.666TrpGly: 0.666 ± 0.42
0.0TrpHis: 0.0 ± 0.0
1.332TrpIle: 1.332 ± 1.009
0.666TrpLys: 0.666 ± 0.504
1.332TrpLeu: 1.332 ± 1.009
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.666TrpGln: 0.666 ± 0.42
0.666TrpArg: 0.666 ± 0.42
1.997TrpSer: 1.997 ± 0.589
1.332TrpThr: 1.332 ± 0.084
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.666TrpTyr: 0.666 ± 0.42
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.995TyrAla: 3.995 ± 0.253
0.666TyrCys: 0.666 ± 0.42
3.329TyrAsp: 3.329 ± 0.673
3.329TyrGlu: 3.329 ± 0.251
0.0TyrPhe: 0.0 ± 0.0
7.324TyrGly: 7.324 ± 0.002
0.666TyrHis: 0.666 ± 0.504
1.332TyrIle: 1.332 ± 0.084
3.329TyrLys: 3.329 ± 1.176
3.329TyrLeu: 3.329 ± 1.176
0.0TyrMet: 0.0 ± 0.0
4.66TyrAsn: 4.66 ± 0.167
1.332TyrPro: 1.332 ± 1.009
0.0TyrGln: 0.0 ± 0.0
1.997TyrArg: 1.997 ± 0.336
2.663TyrSer: 2.663 ± 1.093
0.666TyrThr: 0.666 ± 0.42
5.992TyrVal: 5.992 ± 0.083
0.666TyrTrp: 0.666 ± 0.42
0.666TyrTyr: 0.666 ± 0.42
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1503 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski