Amino acid dipepetide frequency for Giant house spider associated circular virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.839AlaAla: 4.839 ± 1.265
1.613AlaCys: 1.613 ± 1.058
4.839AlaAsp: 4.839 ± 1.265
3.226AlaGlu: 3.226 ± 2.115
1.613AlaPhe: 1.613 ± 1.161
1.613AlaGly: 1.613 ± 1.058
1.613AlaHis: 1.613 ± 1.058
6.452AlaIle: 6.452 ± 2.011
1.613AlaLys: 1.613 ± 1.161
0.0AlaLeu: 0.0 ± 0.0
1.613AlaMet: 1.613 ± 1.161
4.839AlaAsn: 4.839 ± 1.265
1.613AlaPro: 1.613 ± 1.161
3.226AlaGln: 3.226 ± 0.104
11.29AlaArg: 11.29 ± 0.746
0.0AlaSer: 0.0 ± 0.0
1.613AlaThr: 1.613 ± 1.161
1.613AlaVal: 1.613 ± 1.058
1.613AlaTrp: 1.613 ± 1.161
1.613AlaTyr: 1.613 ± 1.161
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.613CysAsp: 1.613 ± 1.058
1.613CysGlu: 1.613 ± 1.058
0.0CysPhe: 0.0 ± 0.0
3.226CysGly: 3.226 ± 2.115
0.0CysHis: 0.0 ± 0.0
3.226CysIle: 3.226 ± 2.115
0.0CysLys: 0.0 ± 0.0
1.613CysLeu: 1.613 ± 1.161
0.0CysMet: 0.0 ± 0.0
3.226CysAsn: 3.226 ± 0.104
0.0CysPro: 0.0 ± 0.0
1.613CysGln: 1.613 ± 1.058
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.613CysThr: 1.613 ± 1.058
1.613CysVal: 1.613 ± 1.058
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.613AspAla: 1.613 ± 1.161
1.613AspCys: 1.613 ± 1.161
4.839AspAsp: 4.839 ± 1.265
6.452AspGlu: 6.452 ± 2.011
0.0AspPhe: 0.0 ± 0.0
9.677AspGly: 9.677 ± 4.127
0.0AspHis: 0.0 ± 0.0
1.613AspIle: 1.613 ± 1.058
4.839AspLys: 4.839 ± 1.265
3.226AspLeu: 3.226 ± 0.104
1.613AspMet: 1.613 ± 1.161
0.0AspAsn: 0.0 ± 0.0
3.226AspPro: 3.226 ± 2.115
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
4.839AspSer: 4.839 ± 0.954
3.226AspThr: 3.226 ± 2.323
14.516AspVal: 14.516 ± 0.642
3.226AspTrp: 3.226 ± 0.104
6.452AspTyr: 6.452 ± 2.011
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
3.226GluCys: 3.226 ± 2.115
1.613GluAsp: 1.613 ± 1.058
0.0GluGlu: 0.0 ± 0.0
3.226GluPhe: 3.226 ± 2.115
1.613GluGly: 1.613 ± 1.161
1.613GluHis: 1.613 ± 1.058
1.613GluIle: 1.613 ± 1.161
3.226GluLys: 3.226 ± 2.115
1.613GluLeu: 1.613 ± 1.058
1.613GluMet: 1.613 ± 1.058
1.613GluAsn: 1.613 ± 1.161
4.839GluPro: 4.839 ± 3.173
0.0GluGln: 0.0 ± 0.0
6.452GluArg: 6.452 ± 0.208
1.613GluSer: 1.613 ± 1.058
0.0GluThr: 0.0 ± 0.0
1.613GluVal: 1.613 ± 1.058
3.226GluTrp: 3.226 ± 2.115
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.613PheAla: 1.613 ± 1.058
1.613PheCys: 1.613 ± 1.058
9.677PheAsp: 9.677 ± 1.908
1.613PheGlu: 1.613 ± 1.058
1.613PhePhe: 1.613 ± 1.058
0.0PheGly: 0.0 ± 0.0
1.613PheHis: 1.613 ± 1.058
1.613PheIle: 1.613 ± 1.058
3.226PheLys: 3.226 ± 0.104
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
3.226PheAsn: 3.226 ± 0.104
1.613PhePro: 1.613 ± 1.058
1.613PheGln: 1.613 ± 1.161
4.839PheArg: 4.839 ± 0.954
1.613PheSer: 1.613 ± 1.161
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
3.226PheTrp: 3.226 ± 0.104
1.613PheTyr: 1.613 ± 1.161
0.0PheXaa: 0.0 ± 0.0
Gly
6.452GlyAla: 6.452 ± 0.208
1.613GlyCys: 1.613 ± 1.058
4.839GlyAsp: 4.839 ± 3.173
0.0GlyGlu: 0.0 ± 0.0
1.613GlyPhe: 1.613 ± 1.058
9.677GlyGly: 9.677 ± 4.127
0.0GlyHis: 0.0 ± 0.0
9.677GlyIle: 9.677 ± 4.127
6.452GlyLys: 6.452 ± 2.011
6.452GlyLeu: 6.452 ± 0.208
1.613GlyMet: 1.613 ± 1.058
3.226GlyAsn: 3.226 ± 0.104
1.613GlyPro: 1.613 ± 1.161
0.0GlyGln: 0.0 ± 0.0
3.226GlyArg: 3.226 ± 0.104
9.677GlySer: 9.677 ± 6.968
8.065GlyThr: 8.065 ± 3.588
3.226GlyVal: 3.226 ± 2.115
0.0GlyTrp: 0.0 ± 0.0
1.613GlyTyr: 1.613 ± 1.058
0.0GlyXaa: 0.0 ± 0.0
His
1.613HisAla: 1.613 ± 1.058
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
4.839HisGlu: 4.839 ± 0.954
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.613HisHis: 1.613 ± 1.058
1.613HisIle: 1.613 ± 1.058
1.613HisLys: 1.613 ± 1.058
1.613HisLeu: 1.613 ± 1.058
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.613HisPro: 1.613 ± 1.058
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.613HisSer: 1.613 ± 1.058
0.0HisThr: 0.0 ± 0.0
1.613HisVal: 1.613 ± 1.058
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.226IleAla: 3.226 ± 2.323
1.613IleCys: 1.613 ± 1.058
3.226IleAsp: 3.226 ± 0.104
3.226IleGlu: 3.226 ± 2.115
3.226IlePhe: 3.226 ± 2.115
1.613IleGly: 1.613 ± 1.058
0.0IleHis: 0.0 ± 0.0
3.226IleIle: 3.226 ± 2.323
6.452IleLys: 6.452 ± 2.011
1.613IleLeu: 1.613 ± 1.161
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
4.839IlePro: 4.839 ± 0.954
1.613IleGln: 1.613 ± 1.058
3.226IleArg: 3.226 ± 0.104
6.452IleSer: 6.452 ± 4.23
6.452IleThr: 6.452 ± 0.208
3.226IleVal: 3.226 ± 0.104
1.613IleTrp: 1.613 ± 1.058
1.613IleTyr: 1.613 ± 1.058
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
1.613LysAsp: 1.613 ± 1.058
1.613LysGlu: 1.613 ± 1.058
3.226LysPhe: 3.226 ± 2.115
3.226LysGly: 3.226 ± 2.115
0.0LysHis: 0.0 ± 0.0
3.226LysIle: 3.226 ± 0.104
1.613LysLys: 1.613 ± 1.161
4.839LysLeu: 4.839 ± 0.954
1.613LysMet: 1.613 ± 1.161
1.613LysAsn: 1.613 ± 1.161
1.613LysPro: 1.613 ± 1.058
1.613LysGln: 1.613 ± 1.161
14.516LysArg: 14.516 ± 8.234
3.226LysSer: 3.226 ± 0.104
6.452LysThr: 6.452 ± 2.011
0.0LysVal: 0.0 ± 0.0
1.613LysTrp: 1.613 ± 1.058
6.452LysTyr: 6.452 ± 2.011
0.0LysXaa: 0.0 ± 0.0
Leu
3.226LeuAla: 3.226 ± 0.104
0.0LeuCys: 0.0 ± 0.0
6.452LeuAsp: 6.452 ± 2.011
0.0LeuGlu: 0.0 ± 0.0
1.613LeuPhe: 1.613 ± 1.161
4.839LeuGly: 4.839 ± 3.173
1.613LeuHis: 1.613 ± 1.058
0.0LeuIle: 0.0 ± 0.0
0.0LeuLys: 0.0 ± 0.0
6.452LeuLeu: 6.452 ± 2.427
1.613LeuMet: 1.613 ± 1.161
4.839LeuAsn: 4.839 ± 1.265
4.839LeuPro: 4.839 ± 3.484
3.226LeuGln: 3.226 ± 2.115
4.839LeuArg: 4.839 ± 1.265
1.613LeuSer: 1.613 ± 1.058
1.613LeuThr: 1.613 ± 1.058
6.452LeuVal: 6.452 ± 2.011
4.839LeuTrp: 4.839 ± 0.954
6.452LeuTyr: 6.452 ± 2.427
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.613MetGlu: 1.613 ± 1.058
0.0MetPhe: 0.0 ± 0.0
1.613MetGly: 1.613 ± 1.161
0.0MetHis: 0.0 ± 0.0
4.839MetIle: 4.839 ± 1.265
0.0MetLys: 0.0 ± 0.0
1.613MetLeu: 1.613 ± 1.058
0.0MetMet: 0.0 ± 0.0
3.226MetAsn: 3.226 ± 0.104
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
6.452MetSer: 6.452 ± 2.427
3.226MetThr: 3.226 ± 2.323
1.613MetVal: 1.613 ± 1.161
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.613AsnAla: 1.613 ± 1.161
1.613AsnCys: 1.613 ± 1.058
3.226AsnAsp: 3.226 ± 2.323
1.613AsnGlu: 1.613 ± 1.058
1.613AsnPhe: 1.613 ± 1.161
6.452AsnGly: 6.452 ± 2.427
3.226AsnHis: 3.226 ± 2.115
3.226AsnIle: 3.226 ± 2.115
3.226AsnLys: 3.226 ± 2.323
4.839AsnLeu: 4.839 ± 1.265
0.0AsnMet: 0.0 ± 0.0
3.226AsnAsn: 3.226 ± 2.323
1.613AsnPro: 1.613 ± 1.161
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
1.613AsnSer: 1.613 ± 1.058
3.226AsnThr: 3.226 ± 2.115
3.226AsnVal: 3.226 ± 2.323
0.0AsnTrp: 0.0 ± 0.0
1.613AsnTyr: 1.613 ± 1.161
0.0AsnXaa: 0.0 ± 0.0
Pro
6.452ProAla: 6.452 ± 2.427
0.0ProCys: 0.0 ± 0.0
4.839ProAsp: 4.839 ± 1.265
1.613ProGlu: 1.613 ± 1.058
0.0ProPhe: 0.0 ± 0.0
1.613ProGly: 1.613 ± 1.161
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
3.226ProLys: 3.226 ± 2.115
3.226ProLeu: 3.226 ± 0.104
1.613ProMet: 1.613 ± 1.161
1.613ProAsn: 1.613 ± 1.058
1.613ProPro: 1.613 ± 1.058
1.613ProGln: 1.613 ± 1.161
3.226ProArg: 3.226 ± 0.104
8.065ProSer: 8.065 ± 3.069
6.452ProThr: 6.452 ± 0.208
1.613ProVal: 1.613 ± 1.161
1.613ProTrp: 1.613 ± 1.161
1.613ProTyr: 1.613 ± 1.161
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.613GlnCys: 1.613 ± 1.058
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
3.226GlnGly: 3.226 ± 0.104
0.0GlnHis: 0.0 ± 0.0
1.613GlnIle: 1.613 ± 1.058
0.0GlnLys: 0.0 ± 0.0
1.613GlnLeu: 1.613 ± 1.161
1.613GlnMet: 1.613 ± 0.791
1.613GlnAsn: 1.613 ± 1.161
0.0GlnPro: 0.0 ± 0.0
3.226GlnGln: 3.226 ± 2.115
1.613GlnArg: 1.613 ± 1.161
4.839GlnSer: 4.839 ± 0.954
0.0GlnThr: 0.0 ± 0.0
1.613GlnVal: 1.613 ± 1.161
1.613GlnTrp: 1.613 ± 1.058
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.226ArgAla: 3.226 ± 0.104
0.0ArgCys: 0.0 ± 0.0
4.839ArgAsp: 4.839 ± 0.954
4.839ArgGlu: 4.839 ± 3.173
4.839ArgPhe: 4.839 ± 1.265
4.839ArgGly: 4.839 ± 1.265
0.0ArgHis: 0.0 ± 0.0
3.226ArgIle: 3.226 ± 0.104
4.839ArgLys: 4.839 ± 1.265
3.226ArgLeu: 3.226 ± 2.115
6.452ArgMet: 6.452 ± 5.26
3.226ArgAsn: 3.226 ± 0.104
6.452ArgPro: 6.452 ± 0.208
0.0ArgGln: 0.0 ± 0.0
9.677ArgArg: 9.677 ± 4.749
8.065ArgSer: 8.065 ± 1.369
3.226ArgThr: 3.226 ± 2.323
0.0ArgVal: 0.0 ± 0.0
1.613ArgTrp: 1.613 ± 1.161
4.839ArgTyr: 4.839 ± 3.484
0.0ArgXaa: 0.0 ± 0.0
Ser
8.065SerAla: 8.065 ± 3.069
1.613SerCys: 1.613 ± 1.161
6.452SerAsp: 6.452 ± 2.427
0.0SerGlu: 0.0 ± 0.0
8.065SerPhe: 8.065 ± 0.85
12.903SerGly: 12.903 ± 2.634
0.0SerHis: 0.0 ± 0.0
1.613SerIle: 1.613 ± 1.058
3.226SerLys: 3.226 ± 0.104
6.452SerLeu: 6.452 ± 4.23
0.0SerMet: 0.0 ± 0.0
1.613SerAsn: 1.613 ± 1.058
4.839SerPro: 4.839 ± 3.484
4.839SerGln: 4.839 ± 0.954
8.065SerArg: 8.065 ± 1.369
9.677SerSer: 9.677 ± 2.53
1.613SerThr: 1.613 ± 1.161
3.226SerVal: 3.226 ± 2.323
1.613SerTrp: 1.613 ± 1.058
3.226SerTyr: 3.226 ± 0.104
0.0SerXaa: 0.0 ± 0.0
Thr
3.226ThrAla: 3.226 ± 2.323
0.0ThrCys: 0.0 ± 0.0
4.839ThrAsp: 4.839 ± 0.954
1.613ThrGlu: 1.613 ± 1.161
3.226ThrPhe: 3.226 ± 2.115
3.226ThrGly: 3.226 ± 0.104
0.0ThrHis: 0.0 ± 0.0
4.839ThrIle: 4.839 ± 3.484
6.452ThrLys: 6.452 ± 4.646
8.065ThrLeu: 8.065 ± 3.588
0.0ThrMet: 0.0 ± 0.0
3.226ThrAsn: 3.226 ± 0.104
4.839ThrPro: 4.839 ± 0.954
0.0ThrGln: 0.0 ± 0.0
0.0ThrArg: 0.0 ± 0.0
8.065ThrSer: 8.065 ± 1.369
9.677ThrThr: 9.677 ± 4.749
3.226ThrVal: 3.226 ± 0.104
0.0ThrTrp: 0.0 ± 0.0
1.613ThrTyr: 1.613 ± 1.058
0.0ThrXaa: 0.0 ± 0.0
Val
3.226ValAla: 3.226 ± 0.104
0.0ValCys: 0.0 ± 0.0
4.839ValAsp: 4.839 ± 0.954
3.226ValGlu: 3.226 ± 0.104
3.226ValPhe: 3.226 ± 2.115
4.839ValGly: 4.839 ± 0.954
4.839ValHis: 4.839 ± 3.173
1.613ValIle: 1.613 ± 1.058
3.226ValLys: 3.226 ± 2.115
3.226ValLeu: 3.226 ± 0.104
1.613ValMet: 1.613 ± 1.058
3.226ValAsn: 3.226 ± 0.104
3.226ValPro: 3.226 ± 2.323
0.0ValGln: 0.0 ± 0.0
3.226ValArg: 3.226 ± 2.323
4.839ValSer: 4.839 ± 3.484
1.613ValThr: 1.613 ± 1.161
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
3.226ValTyr: 3.226 ± 2.323
0.0ValXaa: 0.0 ± 0.0
Trp
1.613TrpAla: 1.613 ± 1.058
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.613TrpGlu: 1.613 ± 1.161
1.613TrpPhe: 1.613 ± 1.161
3.226TrpGly: 3.226 ± 0.104
1.613TrpHis: 1.613 ± 1.161
3.226TrpIle: 3.226 ± 2.115
1.613TrpLys: 1.613 ± 1.058
3.226TrpLeu: 3.226 ± 2.115
1.613TrpMet: 1.613 ± 1.058
1.613TrpAsn: 1.613 ± 1.161
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
3.226TrpSer: 3.226 ± 2.115
1.613TrpThr: 1.613 ± 1.161
1.613TrpVal: 1.613 ± 1.161
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.452TyrAla: 6.452 ± 2.011
3.226TyrCys: 3.226 ± 2.115
3.226TyrAsp: 3.226 ± 0.104
0.0TyrGlu: 0.0 ± 0.0
1.613TyrPhe: 1.613 ± 1.161
1.613TyrGly: 1.613 ± 1.161
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
3.226TyrLys: 3.226 ± 0.104
1.613TyrLeu: 1.613 ± 1.161
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.613TyrPro: 1.613 ± 1.161
1.613TyrGln: 1.613 ± 1.161
4.839TyrArg: 4.839 ± 0.954
1.613TyrSer: 1.613 ± 1.161
6.452TyrThr: 6.452 ± 2.427
3.226TyrVal: 3.226 ± 0.104
1.613TyrTrp: 1.613 ± 1.161
1.613TyrTyr: 1.613 ± 1.161
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (621 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski