Amino acid dipepetide frequency for Lake Sarah-associated circular virus-39

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.611AlaAla: 2.611 ± 1.53
0.0AlaCys: 0.0 ± 0.0
5.222AlaAsp: 5.222 ± 1.074
2.611AlaGlu: 2.611 ± 0.537
1.305AlaPhe: 1.305 ± 0.765
3.916AlaGly: 3.916 ± 0.228
0.0AlaHis: 0.0 ± 0.0
3.916AlaIle: 3.916 ± 2.296
2.611AlaLys: 2.611 ± 2.605
3.916AlaLeu: 3.916 ± 2.296
2.611AlaMet: 2.611 ± 1.53
3.916AlaAsn: 3.916 ± 2.296
1.305AlaPro: 1.305 ± 1.302
3.916AlaGln: 3.916 ± 2.296
2.611AlaArg: 2.611 ± 2.605
6.527AlaSer: 6.527 ± 3.826
2.611AlaThr: 2.611 ± 1.53
3.916AlaVal: 3.916 ± 2.296
0.0AlaTrp: 0.0 ± 0.0
1.305AlaTyr: 1.305 ± 0.765
0.0AlaXaa: 0.0 ± 0.0
Cys
1.305CysAla: 1.305 ± 0.765
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.305CysPhe: 1.305 ± 0.765
0.0CysGly: 0.0 ± 0.0
1.305CysHis: 1.305 ± 1.302
0.0CysIle: 0.0 ± 0.0
1.305CysLys: 1.305 ± 1.302
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.305CysAsn: 1.305 ± 0.765
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.916CysSer: 3.916 ± 1.839
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
3.916CysTyr: 3.916 ± 3.907
0.0CysXaa: 0.0 ± 0.0
Asp
1.305AspAla: 1.305 ± 1.302
0.0AspCys: 0.0 ± 0.0
2.611AspAsp: 2.611 ± 2.605
2.611AspGlu: 2.611 ± 2.605
2.611AspPhe: 2.611 ± 0.537
5.222AspGly: 5.222 ± 0.993
0.0AspHis: 0.0 ± 0.0
6.527AspIle: 6.527 ± 2.376
1.305AspLys: 1.305 ± 1.302
1.305AspLeu: 1.305 ± 0.765
0.0AspMet: 0.0 ± 0.0
1.305AspAsn: 1.305 ± 1.302
5.222AspPro: 5.222 ± 1.074
0.0AspGln: 0.0 ± 0.0
5.222AspArg: 5.222 ± 0.993
6.527AspSer: 6.527 ± 0.309
1.305AspThr: 1.305 ± 0.765
3.916AspVal: 3.916 ± 0.228
0.0AspTrp: 0.0 ± 0.0
5.222AspTyr: 5.222 ± 1.074
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
5.222GluAsp: 5.222 ± 5.209
1.305GluGlu: 1.305 ± 1.302
1.305GluPhe: 1.305 ± 0.765
1.305GluGly: 1.305 ± 1.302
2.611GluHis: 2.611 ± 1.53
2.611GluIle: 2.611 ± 2.605
0.0GluLys: 0.0 ± 0.0
3.916GluLeu: 3.916 ± 0.228
1.305GluMet: 1.305 ± 0.765
1.305GluAsn: 1.305 ± 1.302
0.0GluPro: 0.0 ± 0.0
6.527GluGln: 6.527 ± 2.376
1.305GluArg: 1.305 ± 0.765
2.611GluSer: 2.611 ± 2.605
3.916GluThr: 3.916 ± 3.907
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
2.611GluTyr: 2.611 ± 1.53
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
3.916PheCys: 3.916 ± 0.228
2.611PheAsp: 2.611 ± 2.605
1.305PheGlu: 1.305 ± 1.302
1.305PhePhe: 1.305 ± 0.765
2.611PheGly: 2.611 ± 1.53
0.0PheHis: 0.0 ± 0.0
2.611PheIle: 2.611 ± 0.537
6.527PheLys: 6.527 ± 3.826
2.611PheLeu: 2.611 ± 0.537
0.0PheMet: 0.0 ± 0.0
1.305PheAsn: 1.305 ± 0.765
1.305PhePro: 1.305 ± 1.302
3.916PheGln: 3.916 ± 2.296
1.305PheArg: 1.305 ± 0.765
1.305PheSer: 1.305 ± 0.765
5.222PheThr: 5.222 ± 3.142
3.916PheVal: 3.916 ± 0.228
0.0PheTrp: 0.0 ± 0.0
1.305PheTyr: 1.305 ± 0.765
0.0PheXaa: 0.0 ± 0.0
Gly
3.916GlyAla: 3.916 ± 2.296
1.305GlyCys: 1.305 ± 1.302
0.0GlyAsp: 0.0 ± 0.0
3.916GlyGlu: 3.916 ± 1.839
3.916GlyPhe: 3.916 ± 0.228
2.611GlyGly: 2.611 ± 0.537
0.0GlyHis: 0.0 ± 0.0
2.611GlyIle: 2.611 ± 1.53
5.222GlyLys: 5.222 ± 0.993
5.222GlyLeu: 5.222 ± 0.993
0.0GlyMet: 0.0 ± 0.0
1.305GlyAsn: 1.305 ± 0.765
1.305GlyPro: 1.305 ± 1.302
2.611GlyGln: 2.611 ± 0.537
3.916GlyArg: 3.916 ± 0.228
6.527GlySer: 6.527 ± 0.309
6.527GlyThr: 6.527 ± 2.376
9.138GlyVal: 9.138 ± 1.221
0.0GlyTrp: 0.0 ± 0.0
5.222GlyTyr: 5.222 ± 1.074
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.305HisGlu: 1.305 ± 0.765
0.0HisPhe: 0.0 ± 0.0
1.305HisGly: 1.305 ± 0.765
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.305HisLeu: 1.305 ± 1.302
1.305HisMet: 1.305 ± 0.765
0.0HisAsn: 0.0 ± 0.0
2.611HisPro: 2.611 ± 1.53
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.611HisSer: 2.611 ± 0.537
1.305HisThr: 1.305 ± 0.765
1.305HisVal: 1.305 ± 1.302
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.611IleAla: 2.611 ± 0.537
0.0IleCys: 0.0 ± 0.0
6.527IleAsp: 6.527 ± 2.376
0.0IleGlu: 0.0 ± 0.0
3.916IlePhe: 3.916 ± 0.228
2.611IleGly: 2.611 ± 1.53
2.611IleHis: 2.611 ± 1.53
2.611IleIle: 2.611 ± 0.537
2.611IleLys: 2.611 ± 0.537
3.916IleLeu: 3.916 ± 1.839
0.0IleMet: 0.0 ± 0.0
1.305IleAsn: 1.305 ± 1.302
0.0IlePro: 0.0 ± 0.0
1.305IleGln: 1.305 ± 0.765
3.916IleArg: 3.916 ± 3.907
2.611IleSer: 2.611 ± 0.537
6.527IleThr: 6.527 ± 3.826
2.611IleVal: 2.611 ± 0.537
1.305IleTrp: 1.305 ± 1.302
1.305IleTyr: 1.305 ± 1.302
0.0IleXaa: 0.0 ± 0.0
Lys
9.138LysAla: 9.138 ± 3.289
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
2.611LysGlu: 2.611 ± 2.605
2.611LysPhe: 2.611 ± 1.53
3.916LysGly: 3.916 ± 1.839
0.0LysHis: 0.0 ± 0.0
2.611LysIle: 2.611 ± 1.53
6.527LysLys: 6.527 ± 0.309
6.527LysLeu: 6.527 ± 1.758
1.305LysMet: 1.305 ± 0.765
3.916LysAsn: 3.916 ± 0.228
3.916LysPro: 3.916 ± 0.228
1.305LysGln: 1.305 ± 0.765
6.527LysArg: 6.527 ± 2.376
5.222LysSer: 5.222 ± 1.074
3.916LysThr: 3.916 ± 2.296
0.0LysVal: 0.0 ± 0.0
2.611LysTrp: 2.611 ± 2.605
3.916LysTyr: 3.916 ± 1.839
0.0LysXaa: 0.0 ± 0.0
Leu
3.916LeuAla: 3.916 ± 0.228
2.611LeuCys: 2.611 ± 0.537
7.833LeuAsp: 7.833 ± 0.456
2.611LeuGlu: 2.611 ± 0.537
2.611LeuPhe: 2.611 ± 1.53
3.916LeuGly: 3.916 ± 1.839
0.0LeuHis: 0.0 ± 0.0
2.611LeuIle: 2.611 ± 0.537
5.222LeuLys: 5.222 ± 3.142
5.222LeuLeu: 5.222 ± 3.142
1.305LeuMet: 1.305 ± 0.592
3.916LeuAsn: 3.916 ± 2.296
2.611LeuPro: 2.611 ± 1.53
3.916LeuGln: 3.916 ± 0.228
5.222LeuArg: 5.222 ± 3.142
3.916LeuSer: 3.916 ± 2.296
6.527LeuThr: 6.527 ± 3.826
3.916LeuVal: 3.916 ± 1.839
0.0LeuTrp: 0.0 ± 0.0
1.305LeuTyr: 1.305 ± 0.765
0.0LeuXaa: 0.0 ± 0.0
Met
2.611MetAla: 2.611 ± 0.537
0.0MetCys: 0.0 ± 0.0
2.611MetAsp: 2.611 ± 0.537
1.305MetGlu: 1.305 ± 0.765
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.916MetLys: 3.916 ± 2.296
1.305MetLeu: 1.305 ± 1.302
1.305MetMet: 1.305 ± 0.765
2.611MetAsn: 2.611 ± 1.53
0.0MetPro: 0.0 ± 0.0
2.611MetGln: 2.611 ± 0.537
2.611MetArg: 2.611 ± 1.53
0.0MetSer: 0.0 ± 0.0
1.305MetThr: 1.305 ± 0.765
2.611MetVal: 2.611 ± 1.53
0.0MetTrp: 0.0 ± 0.0
3.916MetTyr: 3.916 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
2.611AsnAla: 2.611 ± 1.53
0.0AsnCys: 0.0 ± 0.0
5.222AsnAsp: 5.222 ± 0.993
1.305AsnGlu: 1.305 ± 1.302
2.611AsnPhe: 2.611 ± 0.537
2.611AsnGly: 2.611 ± 1.53
0.0AsnHis: 0.0 ± 0.0
1.305AsnIle: 1.305 ± 0.765
3.916AsnLys: 3.916 ± 1.839
5.222AsnLeu: 5.222 ± 0.993
2.611AsnMet: 2.611 ± 0.537
2.611AsnAsn: 2.611 ± 0.537
3.916AsnPro: 3.916 ± 2.296
1.305AsnGln: 1.305 ± 0.765
2.611AsnArg: 2.611 ± 0.537
7.833AsnSer: 7.833 ± 4.591
3.916AsnThr: 3.916 ± 0.228
2.611AsnVal: 2.611 ± 1.53
1.305AsnTrp: 1.305 ± 0.765
6.527AsnTyr: 6.527 ± 2.376
0.0AsnXaa: 0.0 ± 0.0
Pro
1.305ProAla: 1.305 ± 0.765
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
1.305ProGlu: 1.305 ± 1.302
1.305ProPhe: 1.305 ± 0.765
3.916ProGly: 3.916 ± 2.296
1.305ProHis: 1.305 ± 1.302
3.916ProIle: 3.916 ± 2.296
7.833ProLys: 7.833 ± 0.456
5.222ProLeu: 5.222 ± 3.142
0.0ProMet: 0.0 ± 0.0
1.305ProAsn: 1.305 ± 0.765
7.833ProPro: 7.833 ± 0.456
0.0ProGln: 0.0 ± 0.0
0.0ProArg: 0.0 ± 0.0
3.916ProSer: 3.916 ± 0.228
1.305ProThr: 1.305 ± 1.302
2.611ProVal: 2.611 ± 0.537
0.0ProTrp: 0.0 ± 0.0
1.305ProTyr: 1.305 ± 0.765
0.0ProXaa: 0.0 ± 0.0
Gln
1.305GlnAla: 1.305 ± 0.765
0.0GlnCys: 0.0 ± 0.0
1.305GlnAsp: 1.305 ± 0.765
3.916GlnGlu: 3.916 ± 0.228
3.916GlnPhe: 3.916 ± 1.839
3.916GlnGly: 3.916 ± 1.839
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
1.305GlnLeu: 1.305 ± 1.302
2.611GlnMet: 2.611 ± 1.53
3.916GlnAsn: 3.916 ± 0.228
3.916GlnPro: 3.916 ± 0.228
1.305GlnGln: 1.305 ± 1.302
1.305GlnArg: 1.305 ± 1.302
2.611GlnSer: 2.611 ± 1.53
3.916GlnThr: 3.916 ± 2.296
3.916GlnVal: 3.916 ± 3.907
1.305GlnTrp: 1.305 ± 0.765
2.611GlnTyr: 2.611 ± 0.537
0.0GlnXaa: 0.0 ± 0.0
Arg
5.222ArgAla: 5.222 ± 1.074
1.305ArgCys: 1.305 ± 1.302
1.305ArgAsp: 1.305 ± 0.765
1.305ArgGlu: 1.305 ± 1.302
1.305ArgPhe: 1.305 ± 1.302
0.0ArgGly: 0.0 ± 0.0
0.0ArgHis: 0.0 ± 0.0
2.611ArgIle: 2.611 ± 2.605
5.222ArgLys: 5.222 ± 3.061
3.916ArgLeu: 3.916 ± 0.228
2.611ArgMet: 2.611 ± 1.53
5.222ArgAsn: 5.222 ± 1.074
2.611ArgPro: 2.611 ± 0.537
0.0ArgGln: 0.0 ± 0.0
6.527ArgArg: 6.527 ± 1.758
3.916ArgSer: 3.916 ± 0.228
3.916ArgThr: 3.916 ± 0.228
3.916ArgVal: 3.916 ± 1.839
2.611ArgTrp: 2.611 ± 2.605
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.527SerAla: 6.527 ± 1.758
0.0SerCys: 0.0 ± 0.0
5.222SerAsp: 5.222 ± 0.993
1.305SerGlu: 1.305 ± 1.302
2.611SerPhe: 2.611 ± 0.537
9.138SerGly: 9.138 ± 3.289
2.611SerHis: 2.611 ± 0.537
2.611SerIle: 2.611 ± 0.537
3.916SerLys: 3.916 ± 0.228
3.916SerLeu: 3.916 ± 2.296
3.916SerMet: 3.916 ± 0.228
9.138SerAsn: 9.138 ± 3.289
3.916SerPro: 3.916 ± 2.296
3.916SerGln: 3.916 ± 1.839
1.305SerArg: 1.305 ± 1.302
14.36SerSer: 14.36 ± 8.417
13.055SerThr: 13.055 ± 3.517
3.916SerVal: 3.916 ± 1.839
0.0SerTrp: 0.0 ± 0.0
2.611SerTyr: 2.611 ± 1.53
0.0SerXaa: 0.0 ± 0.0
Thr
2.611ThrAla: 2.611 ± 0.537
1.305ThrCys: 1.305 ± 1.302
3.916ThrAsp: 3.916 ± 0.228
3.916ThrGlu: 3.916 ± 0.228
6.527ThrPhe: 6.527 ± 1.758
6.527ThrGly: 6.527 ± 2.376
1.305ThrHis: 1.305 ± 0.765
1.305ThrIle: 1.305 ± 0.765
5.222ThrLys: 5.222 ± 0.993
6.527ThrLeu: 6.527 ± 0.309
3.916ThrMet: 3.916 ± 0.619
7.833ThrAsn: 7.833 ± 2.524
1.305ThrPro: 1.305 ± 1.302
1.305ThrGln: 1.305 ± 0.765
2.611ThrArg: 2.611 ± 1.53
6.527ThrSer: 6.527 ± 1.758
7.833ThrThr: 7.833 ± 2.524
10.444ThrVal: 10.444 ± 1.986
1.305ThrTrp: 1.305 ± 1.302
2.611ThrTyr: 2.611 ± 1.53
0.0ThrXaa: 0.0 ± 0.0
Val
7.833ValAla: 7.833 ± 4.591
0.0ValCys: 0.0 ± 0.0
1.305ValAsp: 1.305 ± 1.302
3.916ValGlu: 3.916 ± 1.839
2.611ValPhe: 2.611 ± 0.537
5.222ValGly: 5.222 ± 1.074
0.0ValHis: 0.0 ± 0.0
6.527ValIle: 6.527 ± 4.444
1.305ValLys: 1.305 ± 1.302
1.305ValLeu: 1.305 ± 0.765
0.0ValMet: 0.0 ± 0.0
2.611ValAsn: 2.611 ± 0.537
1.305ValPro: 1.305 ± 0.765
3.916ValGln: 3.916 ± 1.839
3.916ValArg: 3.916 ± 0.228
6.527ValSer: 6.527 ± 3.826
7.833ValThr: 7.833 ± 1.611
2.611ValVal: 2.611 ± 1.53
0.0ValTrp: 0.0 ± 0.0
5.222ValTyr: 5.222 ± 0.993
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.305TrpPhe: 1.305 ± 1.302
1.305TrpGly: 1.305 ± 1.302
0.0TrpHis: 0.0 ± 0.0
1.305TrpIle: 1.305 ± 1.302
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.305TrpAsn: 1.305 ± 1.302
1.305TrpPro: 1.305 ± 1.302
2.611TrpGln: 2.611 ± 0.537
1.305TrpArg: 1.305 ± 0.765
1.305TrpSer: 1.305 ± 1.302
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
3.916TyrCys: 3.916 ± 1.839
1.305TyrAsp: 1.305 ± 0.765
1.305TyrGlu: 1.305 ± 0.765
0.0TyrPhe: 0.0 ± 0.0
5.222TyrGly: 5.222 ± 1.074
1.305TyrHis: 1.305 ± 0.765
2.611TyrIle: 2.611 ± 0.537
3.916TyrLys: 3.916 ± 0.228
6.527TyrLeu: 6.527 ± 1.758
2.611TyrMet: 2.611 ± 0.537
3.916TyrAsn: 3.916 ± 0.228
0.0TyrPro: 0.0 ± 0.0
2.611TyrGln: 2.611 ± 2.605
1.305TyrArg: 1.305 ± 0.765
5.222TyrSer: 5.222 ± 0.993
5.222TyrThr: 5.222 ± 1.074
2.611TyrVal: 2.611 ± 0.537
1.305TyrTrp: 1.305 ± 1.302
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski