Amino acid dipepetide frequency for Badger associated gemykibivirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.587AlaAla: 1.587 ± 1.18
0.0AlaCys: 0.0 ± 0.0
4.762AlaAsp: 4.762 ± 0.986
0.0AlaGlu: 0.0 ± 0.0
3.175AlaPhe: 3.175 ± 2.166
4.762AlaGly: 4.762 ± 3.249
0.0AlaHis: 0.0 ± 0.0
4.762AlaIle: 4.762 ± 0.986
3.175AlaLys: 3.175 ± 0.097
6.349AlaLeu: 6.349 ± 0.194
0.0AlaMet: 0.0 ± 0.0
3.175AlaAsn: 3.175 ± 2.166
4.762AlaPro: 4.762 ± 1.277
4.762AlaGln: 4.762 ± 0.986
6.349AlaArg: 6.349 ± 2.457
6.349AlaSer: 6.349 ± 2.457
6.349AlaThr: 6.349 ± 2.069
1.587AlaVal: 1.587 ± 1.083
0.0AlaTrp: 0.0 ± 0.0
3.175AlaTyr: 3.175 ± 2.166
0.0AlaXaa: 0.0 ± 0.0
Cys
4.762CysAla: 4.762 ± 0.986
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.587CysPhe: 1.587 ± 1.18
1.587CysGly: 1.587 ± 1.18
0.0CysHis: 0.0 ± 0.0
3.175CysIle: 3.175 ± 2.166
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
3.175CysGln: 3.175 ± 0.097
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.587CysVal: 1.587 ± 1.083
0.0CysTrp: 0.0 ± 0.0
1.587CysTyr: 1.587 ± 1.18
0.0CysXaa: 0.0 ± 0.0
Asp
4.762AspAla: 4.762 ± 1.277
0.0AspCys: 0.0 ± 0.0
4.762AspAsp: 4.762 ± 0.986
4.762AspGlu: 4.762 ± 0.986
1.587AspPhe: 1.587 ± 1.18
6.349AspGly: 6.349 ± 2.069
3.175AspHis: 3.175 ± 0.097
3.175AspIle: 3.175 ± 2.166
6.349AspLys: 6.349 ± 2.457
1.587AspLeu: 1.587 ± 1.18
1.587AspMet: 1.587 ± 1.18
1.587AspAsn: 1.587 ± 1.18
9.524AspPro: 9.524 ± 1.972
0.0AspGln: 0.0 ± 0.0
3.175AspArg: 3.175 ± 0.097
1.587AspSer: 1.587 ± 1.083
6.349AspThr: 6.349 ± 4.719
7.937AspVal: 7.937 ± 0.889
6.349AspTrp: 6.349 ± 0.194
4.762AspTyr: 4.762 ± 3.249
0.0AspXaa: 0.0 ± 0.0
Glu
1.587GluAla: 1.587 ± 1.083
1.587GluCys: 1.587 ± 1.083
1.587GluAsp: 1.587 ± 1.18
0.0GluGlu: 0.0 ± 0.0
3.175GluPhe: 3.175 ± 2.166
4.762GluGly: 4.762 ± 1.277
3.175GluHis: 3.175 ± 0.097
0.0GluIle: 0.0 ± 0.0
1.587GluLys: 1.587 ± 1.083
1.587GluLeu: 1.587 ± 1.083
0.0GluMet: 0.0 ± 0.813
1.587GluAsn: 1.587 ± 1.083
1.587GluPro: 1.587 ± 1.083
0.0GluGln: 0.0 ± 0.0
3.175GluArg: 3.175 ± 0.097
1.587GluSer: 1.587 ± 1.18
3.175GluThr: 3.175 ± 2.36
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.587PheAla: 1.587 ± 1.18
1.587PheCys: 1.587 ± 1.18
7.937PheAsp: 7.937 ± 3.152
0.0PheGlu: 0.0 ± 0.0
1.587PhePhe: 1.587 ± 1.083
4.762PheGly: 4.762 ± 3.249
4.762PheHis: 4.762 ± 0.986
0.0PheIle: 0.0 ± 0.0
1.587PheLys: 1.587 ± 1.18
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
6.349PheArg: 6.349 ± 0.194
1.587PheSer: 1.587 ± 1.18
1.587PheThr: 1.587 ± 1.18
7.937PheVal: 7.937 ± 0.889
3.175PheTrp: 3.175 ± 2.166
4.762PheTyr: 4.762 ± 3.249
0.0PheXaa: 0.0 ± 0.0
Gly
4.762GlyAla: 4.762 ± 0.986
1.587GlyCys: 1.587 ± 1.18
7.937GlyAsp: 7.937 ± 1.374
1.587GlyGlu: 1.587 ± 1.083
6.349GlyPhe: 6.349 ± 2.069
9.524GlyGly: 9.524 ± 1.972
1.587GlyHis: 1.587 ± 1.083
0.0GlyIle: 0.0 ± 0.0
3.175GlyLys: 3.175 ± 0.097
7.937GlyLeu: 7.937 ± 3.152
4.762GlyMet: 4.762 ± 0.986
4.762GlyAsn: 4.762 ± 0.986
3.175GlyPro: 3.175 ± 0.097
3.175GlyGln: 3.175 ± 2.36
9.524GlyArg: 9.524 ± 1.972
4.762GlySer: 4.762 ± 0.986
3.175GlyThr: 3.175 ± 2.36
9.524GlyVal: 9.524 ± 0.291
0.0GlyTrp: 0.0 ± 0.0
3.175GlyTyr: 3.175 ± 0.097
0.0GlyXaa: 0.0 ± 0.0
His
4.762HisAla: 4.762 ± 3.249
0.0HisCys: 0.0 ± 0.0
1.587HisAsp: 1.587 ± 1.083
1.587HisGlu: 1.587 ± 1.18
3.175HisPhe: 3.175 ± 2.166
3.175HisGly: 3.175 ± 0.097
1.587HisHis: 1.587 ± 1.083
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.175HisLeu: 3.175 ± 2.166
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.587HisPro: 1.587 ± 1.083
3.175HisGln: 3.175 ± 2.166
3.175HisArg: 3.175 ± 0.097
0.0HisSer: 0.0 ± 0.0
1.587HisThr: 1.587 ± 1.18
0.0HisVal: 0.0 ± 0.0
1.587HisTrp: 1.587 ± 1.083
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.587IleCys: 1.587 ± 1.083
1.587IleAsp: 1.587 ± 1.18
0.0IleGlu: 0.0 ± 0.0
1.587IlePhe: 1.587 ± 1.083
3.175IleGly: 3.175 ± 2.36
3.175IleHis: 3.175 ± 2.166
0.0IleIle: 0.0 ± 0.0
1.587IleLys: 1.587 ± 1.083
3.175IleLeu: 3.175 ± 2.36
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
1.587IlePro: 1.587 ± 1.18
1.587IleGln: 1.587 ± 1.083
3.175IleArg: 3.175 ± 2.36
3.175IleSer: 3.175 ± 0.097
3.175IleThr: 3.175 ± 2.36
4.762IleVal: 4.762 ± 0.986
3.175IleTrp: 3.175 ± 0.097
1.587IleTyr: 1.587 ± 1.083
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
1.587LysCys: 1.587 ± 1.18
4.762LysAsp: 4.762 ± 3.249
1.587LysGlu: 1.587 ± 1.18
4.762LysPhe: 4.762 ± 0.986
3.175LysGly: 3.175 ± 0.097
0.0LysHis: 0.0 ± 0.0
1.587LysIle: 1.587 ± 1.18
1.587LysLys: 1.587 ± 1.083
1.587LysLeu: 1.587 ± 1.083
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
3.175LysArg: 3.175 ± 2.36
3.175LysSer: 3.175 ± 0.097
3.175LysThr: 3.175 ± 0.097
0.0LysVal: 0.0 ± 0.0
1.587LysTrp: 1.587 ± 1.18
4.762LysTyr: 4.762 ± 0.986
0.0LysXaa: 0.0 ± 0.0
Leu
4.762LeuAla: 4.762 ± 3.249
0.0LeuCys: 0.0 ± 0.0
11.111LeuAsp: 11.111 ± 1.471
7.937LeuGlu: 7.937 ± 3.152
1.587LeuPhe: 1.587 ± 1.18
9.524LeuGly: 9.524 ± 1.972
1.587LeuHis: 1.587 ± 1.083
4.762LeuIle: 4.762 ± 1.277
0.0LeuLys: 0.0 ± 0.0
6.349LeuLeu: 6.349 ± 0.194
0.0LeuMet: 0.0 ± 0.0
1.587LeuAsn: 1.587 ± 1.18
0.0LeuPro: 0.0 ± 0.0
1.587LeuGln: 1.587 ± 1.083
4.762LeuArg: 4.762 ± 0.986
3.175LeuSer: 3.175 ± 2.166
1.587LeuThr: 1.587 ± 1.083
3.175LeuVal: 3.175 ± 0.097
1.587LeuTrp: 1.587 ± 1.18
3.175LeuTyr: 3.175 ± 2.166
0.0LeuXaa: 0.0 ± 0.0
Met
4.762MetAla: 4.762 ± 1.277
1.587MetCys: 1.587 ± 1.083
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.587MetPhe: 1.587 ± 1.18
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
3.175MetMet: 3.175 ± 2.36
0.0MetAsn: 0.0 ± 0.0
4.762MetPro: 4.762 ± 1.277
0.0MetGln: 0.0 ± 0.0
3.175MetArg: 3.175 ± 2.36
4.762MetSer: 4.762 ± 1.277
0.0MetThr: 0.0 ± 0.0
1.587MetVal: 1.587 ± 1.18
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.587AsnAla: 1.587 ± 1.18
1.587AsnCys: 1.587 ± 1.083
1.587AsnAsp: 1.587 ± 1.18
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
1.587AsnGly: 1.587 ± 1.18
1.587AsnHis: 1.587 ± 1.083
3.175AsnIle: 3.175 ± 0.097
0.0AsnLys: 0.0 ± 0.0
1.587AsnLeu: 1.587 ± 1.18
1.587AsnMet: 1.587 ± 1.18
1.587AsnAsn: 1.587 ± 1.083
1.587AsnPro: 1.587 ± 1.083
0.0AsnGln: 0.0 ± 0.0
1.587AsnArg: 1.587 ± 1.18
1.587AsnSer: 1.587 ± 1.083
4.762AsnThr: 4.762 ± 0.986
1.587AsnVal: 1.587 ± 1.083
0.0AsnTrp: 0.0 ± 0.0
1.587AsnTyr: 1.587 ± 1.18
0.0AsnXaa: 0.0 ± 0.0
Pro
3.175ProAla: 3.175 ± 2.166
0.0ProCys: 0.0 ± 0.0
1.587ProAsp: 1.587 ± 1.18
4.762ProGlu: 4.762 ± 0.986
1.587ProPhe: 1.587 ± 1.083
6.349ProGly: 6.349 ± 0.194
0.0ProHis: 0.0 ± 0.0
1.587ProIle: 1.587 ± 1.18
0.0ProLys: 0.0 ± 0.0
1.587ProLeu: 1.587 ± 1.083
0.0ProMet: 0.0 ± 0.0
1.587ProAsn: 1.587 ± 1.083
3.175ProPro: 3.175 ± 0.097
1.587ProGln: 1.587 ± 1.18
6.349ProArg: 6.349 ± 0.194
6.349ProSer: 6.349 ± 2.069
4.762ProThr: 4.762 ± 1.277
3.175ProVal: 3.175 ± 2.36
1.587ProTrp: 1.587 ± 1.18
1.587ProTyr: 1.587 ± 1.083
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
3.175GlnAsp: 3.175 ± 2.36
0.0GlnGlu: 0.0 ± 0.0
1.587GlnPhe: 1.587 ± 1.083
3.175GlnGly: 3.175 ± 0.097
3.175GlnHis: 3.175 ± 2.166
1.587GlnIle: 1.587 ± 1.083
1.587GlnLys: 1.587 ± 1.083
0.0GlnLeu: 0.0 ± 0.0
3.175GlnMet: 3.175 ± 0.843
0.0GlnAsn: 0.0 ± 0.0
3.175GlnPro: 3.175 ± 0.097
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
4.762GlnSer: 4.762 ± 3.249
3.175GlnThr: 3.175 ± 2.36
0.0GlnVal: 0.0 ± 0.0
1.587GlnTrp: 1.587 ± 1.18
1.587GlnTyr: 1.587 ± 1.18
0.0GlnXaa: 0.0 ± 0.0
Arg
6.349ArgAla: 6.349 ± 2.457
1.587ArgCys: 1.587 ± 1.18
6.349ArgAsp: 6.349 ± 0.194
1.587ArgGlu: 1.587 ± 1.083
1.587ArgPhe: 1.587 ± 1.18
3.175ArgGly: 3.175 ± 0.097
0.0ArgHis: 0.0 ± 0.0
4.762ArgIle: 4.762 ± 3.539
6.349ArgLys: 6.349 ± 2.457
3.175ArgLeu: 3.175 ± 0.097
3.175ArgMet: 3.175 ± 2.36
3.175ArgAsn: 3.175 ± 2.36
3.175ArgPro: 3.175 ± 2.166
3.175ArgGln: 3.175 ± 0.097
11.111ArgArg: 11.111 ± 8.259
7.937ArgSer: 7.937 ± 1.374
9.524ArgThr: 9.524 ± 4.816
4.762ArgVal: 4.762 ± 0.986
0.0ArgTrp: 0.0 ± 0.0
4.762ArgTyr: 4.762 ± 1.277
0.0ArgXaa: 0.0 ± 0.0
Ser
4.762SerAla: 4.762 ± 0.986
1.587SerCys: 1.587 ± 1.083
4.762SerAsp: 4.762 ± 1.277
1.587SerGlu: 1.587 ± 1.18
4.762SerPhe: 4.762 ± 0.986
9.524SerGly: 9.524 ± 0.291
0.0SerHis: 0.0 ± 0.0
1.587SerIle: 1.587 ± 1.083
1.587SerLys: 1.587 ± 1.083
11.111SerLeu: 11.111 ± 7.58
1.587SerMet: 1.587 ± 1.18
1.587SerAsn: 1.587 ± 1.18
3.175SerPro: 3.175 ± 0.097
3.175SerGln: 3.175 ± 2.166
9.524SerArg: 9.524 ± 2.554
0.0SerSer: 0.0 ± 0.0
1.587SerThr: 1.587 ± 1.083
3.175SerVal: 3.175 ± 2.36
1.587SerTrp: 1.587 ± 1.083
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.175ThrAla: 3.175 ± 2.36
1.587ThrCys: 1.587 ± 1.18
7.937ThrAsp: 7.937 ± 1.374
3.175ThrGlu: 3.175 ± 2.36
3.175ThrPhe: 3.175 ± 2.36
3.175ThrGly: 3.175 ± 0.097
0.0ThrHis: 0.0 ± 0.0
1.587ThrIle: 1.587 ± 1.18
1.587ThrLys: 1.587 ± 1.083
6.349ThrLeu: 6.349 ± 4.719
3.175ThrMet: 3.175 ± 2.36
3.175ThrAsn: 3.175 ± 0.097
4.762ThrPro: 4.762 ± 1.277
4.762ThrGln: 4.762 ± 3.539
1.587ThrArg: 1.587 ± 1.18
6.349ThrSer: 6.349 ± 0.194
6.349ThrThr: 6.349 ± 2.457
3.175ThrVal: 3.175 ± 2.166
0.0ThrTrp: 0.0 ± 0.0
3.175ThrTyr: 3.175 ± 0.097
0.0ThrXaa: 0.0 ± 0.0
Val
1.587ValAla: 1.587 ± 1.083
1.587ValCys: 1.587 ± 1.083
7.937ValAsp: 7.937 ± 3.152
1.587ValGlu: 1.587 ± 1.083
4.762ValPhe: 4.762 ± 0.986
7.937ValGly: 7.937 ± 0.889
3.175ValHis: 3.175 ± 2.166
1.587ValIle: 1.587 ± 1.083
0.0ValLys: 0.0 ± 0.0
3.175ValLeu: 3.175 ± 0.097
1.587ValMet: 1.587 ± 1.083
3.175ValAsn: 3.175 ± 0.097
1.587ValPro: 1.587 ± 1.18
0.0ValGln: 0.0 ± 0.0
3.175ValArg: 3.175 ± 2.36
1.587ValSer: 1.587 ± 1.083
6.349ValThr: 6.349 ± 2.457
4.762ValVal: 4.762 ± 1.277
3.175ValTrp: 3.175 ± 2.166
4.762ValTyr: 4.762 ± 3.539
0.0ValXaa: 0.0 ± 0.0
Trp
3.175TrpAla: 3.175 ± 0.097
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.587TrpGly: 1.587 ± 1.083
3.175TrpHis: 3.175 ± 0.097
3.175TrpIle: 3.175 ± 2.36
0.0TrpLys: 0.0 ± 0.0
4.762TrpLeu: 4.762 ± 3.249
0.0TrpMet: 0.0 ± 0.0
1.587TrpAsn: 1.587 ± 1.18
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.587TrpArg: 1.587 ± 1.18
4.762TrpSer: 4.762 ± 0.986
0.0TrpThr: 0.0 ± 0.0
1.587TrpVal: 1.587 ± 1.083
0.0TrpTrp: 0.0 ± 0.0
1.587TrpTyr: 1.587 ± 1.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
7.937TyrAla: 7.937 ± 5.414
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
1.587TyrGlu: 1.587 ± 1.18
1.587TyrPhe: 1.587 ± 1.083
3.175TyrGly: 3.175 ± 0.097
0.0TyrHis: 0.0 ± 0.0
1.587TyrIle: 1.587 ± 1.18
6.349TyrLys: 6.349 ± 0.194
4.762TyrLeu: 4.762 ± 0.986
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
3.175TyrPro: 3.175 ± 0.097
1.587TyrGln: 1.587 ± 1.083
4.762TyrArg: 4.762 ± 3.539
3.175TyrSer: 3.175 ± 2.166
1.587TyrThr: 1.587 ± 1.18
3.175TyrVal: 3.175 ± 2.166
1.587TyrTrp: 1.587 ± 1.18
1.587TyrTyr: 1.587 ± 1.18
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (631 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski