Amino acid dipepetide frequency for Human smacovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.546AlaAla: 3.546 ± 2.157
0.0AlaCys: 0.0 ± 0.0
1.773AlaAsp: 1.773 ± 1.078
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
5.319AlaGly: 5.319 ± 0.778
0.0AlaHis: 0.0 ± 0.0
5.319AlaIle: 5.319 ± 4.137
7.092AlaLys: 7.092 ± 5.516
1.773AlaLeu: 1.773 ± 1.078
0.0AlaMet: 0.0 ± 0.848
0.0AlaAsn: 0.0 ± 0.0
1.773AlaPro: 1.773 ± 1.379
3.546AlaGln: 3.546 ± 0.301
0.0AlaArg: 0.0 ± 0.0
5.319AlaSer: 5.319 ± 3.235
1.773AlaThr: 1.773 ± 1.078
5.319AlaVal: 5.319 ± 3.235
1.773AlaTrp: 1.773 ± 1.078
5.319AlaTyr: 5.319 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.773CysIle: 1.773 ± 1.379
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
5.319CysMet: 5.319 ± 1.68
0.0CysAsn: 0.0 ± 0.0
1.773CysPro: 1.773 ± 1.379
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.546CysSer: 3.546 ± 2.758
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.546AspAla: 3.546 ± 0.301
1.773AspCys: 1.773 ± 1.379
1.773AspAsp: 1.773 ± 1.379
3.546AspGlu: 3.546 ± 0.301
1.773AspPhe: 1.773 ± 1.379
3.546AspGly: 3.546 ± 0.301
0.0AspHis: 0.0 ± 0.0
8.865AspIle: 8.865 ± 1.98
1.773AspLys: 1.773 ± 1.078
7.092AspLeu: 7.092 ± 3.059
1.773AspMet: 1.773 ± 1.078
0.0AspAsn: 0.0 ± 0.0
7.092AspPro: 7.092 ± 4.314
3.546AspGln: 3.546 ± 2.157
10.638AspArg: 10.638 ± 0.902
1.773AspSer: 1.773 ± 1.078
5.319AspThr: 5.319 ± 3.235
0.0AspVal: 0.0 ± 0.0
1.773AspTrp: 1.773 ± 1.078
3.546AspTyr: 3.546 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
1.773GluAla: 1.773 ± 1.078
1.773GluCys: 1.773 ± 1.078
7.092GluAsp: 7.092 ± 0.601
3.546GluGlu: 3.546 ± 2.758
0.0GluPhe: 0.0 ± 0.0
3.546GluGly: 3.546 ± 0.301
1.773GluHis: 1.773 ± 1.379
3.546GluIle: 3.546 ± 2.758
0.0GluLys: 0.0 ± 0.0
1.773GluLeu: 1.773 ± 1.379
1.773GluMet: 1.773 ± 1.379
3.546GluAsn: 3.546 ± 0.301
3.546GluPro: 3.546 ± 0.301
0.0GluGln: 0.0 ± 0.0
5.319GluArg: 5.319 ± 1.68
3.546GluSer: 3.546 ± 2.758
5.319GluThr: 5.319 ± 1.68
5.319GluVal: 5.319 ± 0.778
0.0GluTrp: 0.0 ± 0.0
3.546GluTyr: 3.546 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
1.773PheAla: 1.773 ± 1.078
0.0PheCys: 0.0 ± 0.0
3.546PheAsp: 3.546 ± 2.758
0.0PheGlu: 0.0 ± 0.0
3.546PhePhe: 3.546 ± 2.157
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
5.319PheIle: 5.319 ± 0.778
3.546PheLys: 3.546 ± 2.157
1.773PheLeu: 1.773 ± 1.078
0.0PheMet: 0.0 ± 0.0
3.546PheAsn: 3.546 ± 2.157
3.546PhePro: 3.546 ± 2.157
3.546PheGln: 3.546 ± 0.301
1.773PheArg: 1.773 ± 1.379
1.773PheSer: 1.773 ± 1.078
1.773PheThr: 1.773 ± 1.379
5.319PheVal: 5.319 ± 1.68
1.773PheTrp: 1.773 ± 1.379
1.773PheTyr: 1.773 ± 1.078
0.0PheXaa: 0.0 ± 0.0
Gly
1.773GlyAla: 1.773 ± 1.078
0.0GlyCys: 0.0 ± 0.0
1.773GlyAsp: 1.773 ± 1.379
10.638GlyGlu: 10.638 ± 0.902
3.546GlyPhe: 3.546 ± 2.157
3.546GlyGly: 3.546 ± 2.157
5.319GlyHis: 5.319 ± 1.68
0.0GlyIle: 0.0 ± 0.0
3.546GlyLys: 3.546 ± 2.758
10.638GlyLeu: 10.638 ± 1.555
1.773GlyMet: 1.773 ± 1.078
5.319GlyAsn: 5.319 ± 0.778
0.0GlyPro: 0.0 ± 0.0
0.0GlyGln: 0.0 ± 0.0
3.546GlyArg: 3.546 ± 0.301
7.092GlySer: 7.092 ± 1.856
1.773GlyThr: 1.773 ± 1.379
3.546GlyVal: 3.546 ± 2.157
0.0GlyTrp: 0.0 ± 0.0
5.319GlyTyr: 5.319 ± 1.68
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.773HisCys: 1.773 ± 1.379
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.773HisPhe: 1.773 ± 1.379
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.773HisLeu: 1.773 ± 1.379
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.773HisGln: 1.773 ± 1.078
0.0HisArg: 0.0 ± 0.0
1.773HisSer: 1.773 ± 1.379
0.0HisThr: 0.0 ± 0.0
1.773HisVal: 1.773 ± 1.078
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.546IleAla: 3.546 ± 2.157
1.773IleCys: 1.773 ± 1.379
1.773IleAsp: 1.773 ± 1.379
3.546IleGlu: 3.546 ± 2.758
1.773IlePhe: 1.773 ± 1.379
1.773IleGly: 1.773 ± 1.078
0.0IleHis: 0.0 ± 0.0
3.546IleIle: 3.546 ± 0.301
3.546IleLys: 3.546 ± 2.758
3.546IleLeu: 3.546 ± 0.301
1.773IleMet: 1.773 ± 1.379
3.546IleAsn: 3.546 ± 0.301
5.319IlePro: 5.319 ± 1.68
0.0IleGln: 0.0 ± 0.0
1.773IleArg: 1.773 ± 1.379
1.773IleSer: 1.773 ± 1.078
5.319IleThr: 5.319 ± 0.778
1.773IleVal: 1.773 ± 1.078
1.773IleTrp: 1.773 ± 1.379
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.773LysAla: 1.773 ± 1.379
1.773LysCys: 1.773 ± 1.379
1.773LysAsp: 1.773 ± 1.379
5.319LysGlu: 5.319 ± 1.68
1.773LysPhe: 1.773 ± 1.078
0.0LysGly: 0.0 ± 0.0
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
0.0LysLys: 0.0 ± 0.0
5.319LysLeu: 5.319 ± 0.778
1.773LysMet: 1.773 ± 0.952
3.546LysAsn: 3.546 ± 2.758
0.0LysPro: 0.0 ± 0.0
3.546LysGln: 3.546 ± 2.758
1.773LysArg: 1.773 ± 1.379
1.773LysSer: 1.773 ± 1.379
1.773LysThr: 1.773 ± 1.078
5.319LysVal: 5.319 ± 0.778
3.546LysTrp: 3.546 ± 2.758
7.092LysTyr: 7.092 ± 0.601
0.0LysXaa: 0.0 ± 0.0
Leu
3.546LeuAla: 3.546 ± 0.301
0.0LeuCys: 0.0 ± 0.0
8.865LeuAsp: 8.865 ± 2.934
0.0LeuGlu: 0.0 ± 0.0
0.0LeuPhe: 0.0 ± 0.0
5.319LeuGly: 5.319 ± 1.68
1.773LeuHis: 1.773 ± 1.078
0.0LeuIle: 0.0 ± 0.0
5.319LeuLys: 5.319 ± 1.68
3.546LeuLeu: 3.546 ± 0.301
3.546LeuMet: 3.546 ± 2.157
1.773LeuAsn: 1.773 ± 1.078
8.865LeuPro: 8.865 ± 5.392
1.773LeuGln: 1.773 ± 1.078
8.865LeuArg: 8.865 ± 4.438
5.319LeuSer: 5.319 ± 0.778
1.773LeuThr: 1.773 ± 1.379
7.092LeuVal: 7.092 ± 0.601
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.773MetAla: 1.773 ± 1.379
3.546MetCys: 3.546 ± 2.758
1.773MetAsp: 1.773 ± 1.078
1.773MetGlu: 1.773 ± 1.078
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.546MetLys: 3.546 ± 2.758
1.773MetLeu: 1.773 ± 1.078
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
5.319MetArg: 5.319 ± 1.68
0.0MetSer: 0.0 ± 0.0
5.319MetThr: 5.319 ± 0.778
0.0MetVal: 0.0 ± 0.0
3.546MetTrp: 3.546 ± 2.758
1.773MetTyr: 1.773 ± 1.078
0.0MetXaa: 0.0 ± 0.0
Asn
1.773AsnAla: 1.773 ± 1.379
0.0AsnCys: 0.0 ± 0.0
5.319AsnAsp: 5.319 ± 0.778
1.773AsnGlu: 1.773 ± 1.379
3.546AsnPhe: 3.546 ± 0.301
7.092AsnGly: 7.092 ± 0.601
0.0AsnHis: 0.0 ± 0.0
3.546AsnIle: 3.546 ± 0.301
0.0AsnLys: 0.0 ± 0.0
1.773AsnLeu: 1.773 ± 1.078
1.773AsnMet: 1.773 ± 1.379
1.773AsnAsn: 1.773 ± 1.078
5.319AsnPro: 5.319 ± 0.778
0.0AsnGln: 0.0 ± 0.0
1.773AsnArg: 1.773 ± 1.078
0.0AsnSer: 0.0 ± 0.0
5.319AsnThr: 5.319 ± 0.778
1.773AsnVal: 1.773 ± 1.078
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.319ProAla: 5.319 ± 3.235
0.0ProCys: 0.0 ± 0.0
5.319ProAsp: 5.319 ± 0.778
3.546ProGlu: 3.546 ± 0.301
1.773ProPhe: 1.773 ± 1.078
7.092ProGly: 7.092 ± 1.856
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
5.319ProLys: 5.319 ± 1.68
5.319ProLeu: 5.319 ± 3.235
0.0ProMet: 0.0 ± 0.0
1.773ProAsn: 1.773 ± 1.078
1.773ProPro: 1.773 ± 1.078
1.773ProGln: 1.773 ± 1.078
8.865ProArg: 8.865 ± 0.477
3.546ProSer: 3.546 ± 0.301
8.865ProThr: 8.865 ± 0.477
5.319ProVal: 5.319 ± 3.235
1.773ProTrp: 1.773 ± 1.379
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.546GlnAla: 3.546 ± 0.301
0.0GlnCys: 0.0 ± 0.0
1.773GlnAsp: 1.773 ± 1.078
0.0GlnGlu: 0.0 ± 0.0
1.773GlnPhe: 1.773 ± 1.078
1.773GlnGly: 1.773 ± 1.078
1.773GlnHis: 1.773 ± 1.379
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
5.319GlnLeu: 5.319 ± 1.68
0.0GlnMet: 0.0 ± 0.0
3.546GlnAsn: 3.546 ± 0.301
0.0GlnPro: 0.0 ± 0.0
1.773GlnGln: 1.773 ± 1.379
0.0GlnArg: 0.0 ± 0.0
3.546GlnSer: 3.546 ± 2.157
0.0GlnThr: 0.0 ± 0.0
5.319GlnVal: 5.319 ± 0.778
0.0GlnTrp: 0.0 ± 0.0
1.773GlnTyr: 1.773 ± 1.078
0.0GlnXaa: 0.0 ± 0.0
Arg
3.546ArgAla: 3.546 ± 2.758
0.0ArgCys: 0.0 ± 0.0
3.546ArgAsp: 3.546 ± 2.157
8.865ArgGlu: 8.865 ± 1.98
8.865ArgPhe: 8.865 ± 0.477
5.319ArgGly: 5.319 ± 1.68
0.0ArgHis: 0.0 ± 0.0
3.546ArgIle: 3.546 ± 2.157
1.773ArgLys: 1.773 ± 1.379
3.546ArgLeu: 3.546 ± 2.157
1.773ArgMet: 1.773 ± 1.379
7.092ArgAsn: 7.092 ± 3.059
3.546ArgPro: 3.546 ± 0.301
1.773ArgGln: 1.773 ± 1.078
3.546ArgArg: 3.546 ± 2.157
0.0ArgSer: 0.0 ± 0.0
0.0ArgThr: 0.0 ± 0.0
7.092ArgVal: 7.092 ± 0.601
1.773ArgTrp: 1.773 ± 1.078
3.546ArgTyr: 3.546 ± 2.758
0.0ArgXaa: 0.0 ± 0.0
Ser
7.092SerAla: 7.092 ± 1.856
0.0SerCys: 0.0 ± 0.0
5.319SerAsp: 5.319 ± 3.235
1.773SerGlu: 1.773 ± 1.379
3.546SerPhe: 3.546 ± 0.301
8.865SerGly: 8.865 ± 0.477
0.0SerHis: 0.0 ± 0.0
3.546SerIle: 3.546 ± 0.301
3.546SerLys: 3.546 ± 2.758
1.773SerLeu: 1.773 ± 1.078
0.0SerMet: 0.0 ± 0.0
0.0SerAsn: 0.0 ± 0.0
8.865SerPro: 8.865 ± 0.477
0.0SerGln: 0.0 ± 0.0
1.773SerArg: 1.773 ± 1.078
7.092SerSer: 7.092 ± 4.314
1.773SerThr: 1.773 ± 1.078
5.319SerVal: 5.319 ± 0.778
5.319SerTrp: 5.319 ± 4.137
3.546SerTyr: 3.546 ± 2.157
0.0SerXaa: 0.0 ± 0.0
Thr
1.773ThrAla: 1.773 ± 1.078
0.0ThrCys: 0.0 ± 0.0
1.773ThrAsp: 1.773 ± 1.078
3.546ThrGlu: 3.546 ± 0.301
1.773ThrPhe: 1.773 ± 1.078
7.092ThrGly: 7.092 ± 0.601
0.0ThrHis: 0.0 ± 0.0
0.0ThrIle: 0.0 ± 0.0
1.773ThrLys: 1.773 ± 1.379
1.773ThrLeu: 1.773 ± 1.078
3.546ThrMet: 3.546 ± 0.301
3.546ThrAsn: 3.546 ± 0.301
3.546ThrPro: 3.546 ± 2.157
3.546ThrGln: 3.546 ± 2.157
1.773ThrArg: 1.773 ± 1.078
5.319ThrSer: 5.319 ± 1.68
5.319ThrThr: 5.319 ± 3.235
5.319ThrVal: 5.319 ± 0.778
1.773ThrTrp: 1.773 ± 1.379
3.546ThrTyr: 3.546 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
1.773ValAla: 1.773 ± 1.078
0.0ValCys: 0.0 ± 0.0
8.865ValAsp: 8.865 ± 0.477
1.773ValGlu: 1.773 ± 1.078
3.546ValPhe: 3.546 ± 2.758
3.546ValGly: 3.546 ± 2.157
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
1.773ValLys: 1.773 ± 1.078
3.546ValLeu: 3.546 ± 2.758
0.0ValMet: 0.0 ± 0.0
1.773ValAsn: 1.773 ± 1.078
10.638ValPro: 10.638 ± 4.013
3.546ValGln: 3.546 ± 0.301
7.092ValArg: 7.092 ± 0.601
8.865ValSer: 8.865 ± 2.934
1.773ValThr: 1.773 ± 1.379
8.865ValVal: 8.865 ± 2.934
1.773ValTrp: 1.773 ± 1.078
7.092ValTyr: 7.092 ± 4.314
0.0ValXaa: 0.0 ± 0.0
Trp
1.773TrpAla: 1.773 ± 1.379
0.0TrpCys: 0.0 ± 0.0
3.546TrpAsp: 3.546 ± 0.301
3.546TrpGlu: 3.546 ± 2.758
1.773TrpPhe: 1.773 ± 1.078
1.773TrpGly: 1.773 ± 1.379
0.0TrpHis: 0.0 ± 0.0
3.546TrpIle: 3.546 ± 2.758
0.0TrpLys: 0.0 ± 0.0
3.546TrpLeu: 3.546 ± 0.301
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.773TrpGln: 1.773 ± 1.379
1.773TrpArg: 1.773 ± 1.078
1.773TrpSer: 1.773 ± 1.379
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.773TrpTyr: 1.773 ± 1.379
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.773TyrAla: 1.773 ± 1.379
0.0TyrCys: 0.0 ± 0.0
3.546TyrAsp: 3.546 ± 2.758
3.546TyrGlu: 3.546 ± 0.301
3.546TyrPhe: 3.546 ± 2.157
3.546TyrGly: 3.546 ± 0.301
0.0TyrHis: 0.0 ± 0.0
5.319TyrIle: 5.319 ± 0.778
5.319TyrLys: 5.319 ± 3.235
1.773TyrLeu: 1.773 ± 1.078
3.546TyrMet: 3.546 ± 2.758
1.773TyrAsn: 1.773 ± 1.078
1.773TyrPro: 1.773 ± 1.379
0.0TyrGln: 0.0 ± 0.0
3.546TyrArg: 3.546 ± 2.157
5.319TyrSer: 5.319 ± 0.778
3.546TyrThr: 3.546 ± 2.157
1.773TyrVal: 1.773 ± 1.379
0.0TyrTrp: 0.0 ± 0.0
5.319TyrTyr: 5.319 ± 3.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (565 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski