Amino acid dipepetide frequency for Camponotus nipponicus virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.33AlaAla: 7.33 ± 1.939
1.571AlaCys: 1.571 ± 0.31
2.618AlaAsp: 2.618 ± 0.271
3.665AlaGlu: 3.665 ± 1.707
0.524AlaPhe: 0.524 ± 0.349
6.283AlaGly: 6.283 ± 2.446
2.618AlaHis: 2.618 ± 0.271
0.524AlaIle: 0.524 ± 0.349
3.665AlaLys: 3.665 ± 1.242
5.759AlaLeu: 5.759 ± 1.629
3.141AlaMet: 3.141 ± 1.357
2.618AlaAsn: 2.618 ± 0.271
5.759AlaPro: 5.759 ± 1.629
1.571AlaGln: 1.571 ± 1.047
4.188AlaArg: 4.188 ± 0.156
6.806AlaSer: 6.806 ± 0.852
3.665AlaThr: 3.665 ± 0.969
7.33AlaVal: 7.33 ± 1.939
1.571AlaTrp: 1.571 ± 0.31
1.047AlaTyr: 1.047 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
1.571CysAla: 1.571 ± 0.427
1.047CysCys: 1.047 ± 0.776
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.524CysPhe: 0.524 ± 0.388
1.047CysGly: 1.047 ± 0.039
0.0CysHis: 0.0 ± 0.0
0.524CysIle: 0.524 ± 0.388
0.524CysLys: 0.524 ± 0.349
0.524CysLeu: 0.524 ± 0.349
0.524CysMet: 0.524 ± 0.388
1.571CysAsn: 1.571 ± 1.164
1.047CysPro: 1.047 ± 0.776
0.524CysGln: 0.524 ± 0.388
0.524CysArg: 0.524 ± 0.388
1.047CysSer: 1.047 ± 0.039
1.571CysThr: 1.571 ± 0.31
2.618CysVal: 2.618 ± 0.466
1.047CysTrp: 1.047 ± 0.776
0.524CysTyr: 0.524 ± 0.388
0.0CysXaa: 0.0 ± 0.0
Asp
2.618AspAla: 2.618 ± 0.466
1.047AspCys: 1.047 ± 0.698
3.141AspAsp: 3.141 ± 0.854
4.712AspGlu: 4.712 ± 1.281
1.047AspPhe: 1.047 ± 0.776
3.665AspGly: 3.665 ± 0.232
1.047AspHis: 1.047 ± 0.776
4.712AspIle: 4.712 ± 0.193
2.094AspLys: 2.094 ± 0.078
5.236AspLeu: 5.236 ± 0.542
1.571AspMet: 1.571 ± 0.892
1.571AspAsn: 1.571 ± 1.047
4.188AspPro: 4.188 ± 1.318
1.047AspGln: 1.047 ± 0.039
2.094AspArg: 2.094 ± 1.552
2.618AspSer: 2.618 ± 0.271
3.665AspThr: 3.665 ± 0.969
3.141AspVal: 3.141 ± 0.62
2.094AspTrp: 2.094 ± 0.815
2.618AspTyr: 2.618 ± 0.271
0.0AspXaa: 0.0 ± 0.0
Glu
2.618GluAla: 2.618 ± 1.008
1.047GluCys: 1.047 ± 0.039
2.094GluAsp: 2.094 ± 0.659
4.712GluGlu: 4.712 ± 2.018
2.094GluPhe: 2.094 ± 1.552
3.141GluGly: 3.141 ± 0.62
3.141GluHis: 3.141 ± 0.117
1.571GluIle: 1.571 ± 1.164
3.665GluLys: 3.665 ± 1.242
3.141GluLeu: 3.141 ± 0.117
1.571GluMet: 1.571 ± 0.31
2.094GluAsn: 2.094 ± 0.078
2.618GluPro: 2.618 ± 0.466
0.524GluGln: 0.524 ± 0.388
4.188GluArg: 4.188 ± 0.893
2.618GluSer: 2.618 ± 0.466
6.283GluThr: 6.283 ± 1.978
1.571GluVal: 1.571 ± 0.427
1.571GluTrp: 1.571 ± 0.31
1.047GluTyr: 1.047 ± 0.776
0.0GluXaa: 0.0 ± 0.0
Phe
2.618PheAla: 2.618 ± 0.271
0.0PheCys: 0.0 ± 0.0
1.047PheAsp: 1.047 ± 0.776
1.047PheGlu: 1.047 ± 0.776
1.571PhePhe: 1.571 ± 0.31
4.188PheGly: 4.188 ± 1.63
1.047PheHis: 1.047 ± 0.039
1.571PheIle: 1.571 ± 0.427
1.571PheLys: 1.571 ± 1.164
2.094PheLeu: 2.094 ± 0.078
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.047PhePro: 1.047 ± 0.039
0.524PheGln: 0.524 ± 0.349
1.571PheArg: 1.571 ± 0.31
5.236PheSer: 5.236 ± 0.542
2.094PheThr: 2.094 ± 0.078
1.047PheVal: 1.047 ± 0.039
0.524PheTrp: 0.524 ± 0.349
1.047PheTyr: 1.047 ± 0.698
0.0PheXaa: 0.0 ± 0.0
Gly
5.236GlyAla: 5.236 ± 1.28
1.571GlyCys: 1.571 ± 0.427
4.188GlyAsp: 4.188 ± 0.893
4.712GlyGlu: 4.712 ± 2.018
2.618GlyPhe: 2.618 ± 1.008
9.424GlyGly: 9.424 ± 4.037
0.524GlyHis: 0.524 ± 0.388
3.665GlyIle: 3.665 ± 0.232
4.712GlyLys: 4.712 ± 2.756
4.712GlyLeu: 4.712 ± 1.281
2.094GlyMet: 2.094 ± 1.396
3.141GlyAsn: 3.141 ± 0.117
3.665GlyPro: 3.665 ± 0.232
2.094GlyGln: 2.094 ± 0.078
4.188GlyArg: 4.188 ± 3.105
5.759GlySer: 5.759 ± 0.583
7.853GlyThr: 7.853 ± 2.135
5.759GlyVal: 5.759 ± 1.629
1.047GlyTrp: 1.047 ± 0.039
1.571GlyTyr: 1.571 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
1.047HisAla: 1.047 ± 0.039
2.094HisCys: 2.094 ± 0.815
0.524HisAsp: 0.524 ± 0.349
2.618HisGlu: 2.618 ± 1.203
0.524HisPhe: 0.524 ± 0.349
0.524HisGly: 0.524 ± 0.349
1.047HisHis: 1.047 ± 0.698
0.524HisIle: 0.524 ± 0.349
1.571HisLys: 1.571 ± 0.31
1.571HisLeu: 1.571 ± 0.427
1.571HisMet: 1.571 ± 0.427
0.524HisAsn: 0.524 ± 0.349
0.0HisPro: 0.0 ± 0.0
1.571HisGln: 1.571 ± 0.31
0.0HisArg: 0.0 ± 0.0
0.524HisSer: 0.524 ± 0.388
2.618HisThr: 2.618 ± 0.466
1.571HisVal: 1.571 ± 0.427
1.047HisTrp: 1.047 ± 0.698
0.524HisTyr: 0.524 ± 0.388
0.0HisXaa: 0.0 ± 0.0
Ile
3.141IleAla: 3.141 ± 0.117
1.047IleCys: 1.047 ± 0.039
3.141IleAsp: 3.141 ± 0.62
0.524IleGlu: 0.524 ± 0.349
1.047IlePhe: 1.047 ± 0.039
4.712IleGly: 4.712 ± 1.281
1.047IleHis: 1.047 ± 0.039
1.571IleIle: 1.571 ± 0.427
0.0IleLys: 0.0 ± 0.0
5.759IleLeu: 5.759 ± 0.583
1.571IleMet: 1.571 ± 0.427
1.571IleAsn: 1.571 ± 0.31
5.236IlePro: 5.236 ± 0.195
3.141IleGln: 3.141 ± 0.117
4.188IleArg: 4.188 ± 0.893
4.188IleSer: 4.188 ± 1.63
1.571IleThr: 1.571 ± 1.164
2.618IleVal: 2.618 ± 0.271
0.524IleTrp: 0.524 ± 0.349
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.571LysAla: 1.571 ± 0.31
0.524LysCys: 0.524 ± 0.388
2.094LysAsp: 2.094 ± 1.552
4.188LysGlu: 4.188 ± 2.368
0.0LysPhe: 0.0 ± 0.0
2.094LysGly: 2.094 ± 0.078
0.0LysHis: 0.0 ± 0.0
3.665LysIle: 3.665 ± 0.232
2.618LysLys: 2.618 ± 0.271
5.759LysLeu: 5.759 ± 2.795
0.0LysMet: 0.0 ± 0.0
2.618LysAsn: 2.618 ± 0.466
2.618LysPro: 2.618 ± 0.271
1.571LysGln: 1.571 ± 0.427
3.141LysArg: 3.141 ± 0.117
5.236LysSer: 5.236 ± 0.542
4.188LysThr: 4.188 ± 0.156
1.047LysVal: 1.047 ± 0.039
2.094LysTrp: 2.094 ± 0.815
1.571LysTyr: 1.571 ± 0.31
0.0LysXaa: 0.0 ± 0.0
Leu
7.33LeuAla: 7.33 ± 3.413
0.524LeuCys: 0.524 ± 0.388
9.424LeuAsp: 9.424 ± 0.351
2.618LeuGlu: 2.618 ± 0.466
3.141LeuPhe: 3.141 ± 0.854
5.236LeuGly: 5.236 ± 0.195
2.094LeuHis: 2.094 ± 0.078
2.618LeuIle: 2.618 ± 0.466
5.236LeuLys: 5.236 ± 1.669
4.712LeuLeu: 4.712 ± 0.193
0.524LeuMet: 0.524 ± 0.388
4.188LeuAsn: 4.188 ± 0.581
2.618LeuPro: 2.618 ± 1.203
1.047LeuGln: 1.047 ± 0.776
4.712LeuArg: 4.712 ± 0.193
5.236LeuSer: 5.236 ± 1.28
4.712LeuThr: 4.712 ± 0.544
5.236LeuVal: 5.236 ± 0.542
1.047LeuTrp: 1.047 ± 0.039
6.806LeuTyr: 6.806 ± 2.096
0.0LeuXaa: 0.0 ± 0.0
Met
2.094MetAla: 2.094 ± 0.815
0.0MetCys: 0.0 ± 0.0
1.047MetAsp: 1.047 ± 0.698
0.524MetGlu: 0.524 ± 0.388
0.524MetPhe: 0.524 ± 0.388
3.141MetGly: 3.141 ± 0.117
0.0MetHis: 0.0 ± 0.0
0.524MetIle: 0.524 ± 0.349
2.618MetLys: 2.618 ± 1.008
2.618MetLeu: 2.618 ± 0.466
0.524MetMet: 0.524 ± 0.349
0.0MetAsn: 0.0 ± 0.0
1.571MetPro: 1.571 ± 1.047
1.047MetGln: 1.047 ± 0.039
2.094MetArg: 2.094 ± 0.078
3.141MetSer: 3.141 ± 1.357
1.571MetThr: 1.571 ± 1.047
1.047MetVal: 1.047 ± 0.039
0.524MetTrp: 0.524 ± 0.388
2.094MetTyr: 2.094 ± 0.078
0.0MetXaa: 0.0 ± 0.0
Asn
1.047AsnAla: 1.047 ± 0.698
0.0AsnCys: 0.0 ± 0.0
2.094AsnAsp: 2.094 ± 0.659
2.094AsnGlu: 2.094 ± 0.659
1.047AsnPhe: 1.047 ± 0.039
1.571AsnGly: 1.571 ± 0.427
0.0AsnHis: 0.0 ± 0.0
4.188AsnIle: 4.188 ± 0.581
2.094AsnLys: 2.094 ± 0.815
3.141AsnLeu: 3.141 ± 0.854
0.524AsnMet: 0.524 ± 0.349
1.571AsnAsn: 1.571 ± 1.047
3.141AsnPro: 3.141 ± 0.117
0.0AsnGln: 0.0 ± 0.0
2.618AsnArg: 2.618 ± 0.466
1.047AsnSer: 1.047 ± 0.698
2.618AsnThr: 2.618 ± 1.008
3.665AsnVal: 3.665 ± 2.444
1.047AsnTrp: 1.047 ± 0.776
2.094AsnTyr: 2.094 ± 0.659
0.0AsnXaa: 0.0 ± 0.0
Pro
5.759ProAla: 5.759 ± 0.891
0.0ProCys: 0.0 ± 0.0
2.094ProAsp: 2.094 ± 0.078
3.141ProGlu: 3.141 ± 0.117
1.571ProPhe: 1.571 ± 0.31
5.236ProGly: 5.236 ± 0.195
0.524ProHis: 0.524 ± 0.388
3.141ProIle: 3.141 ± 0.117
3.665ProLys: 3.665 ± 0.505
5.236ProLeu: 5.236 ± 2.017
1.047ProMet: 1.047 ± 0.698
1.047ProAsn: 1.047 ± 0.039
2.618ProPro: 2.618 ± 0.271
3.141ProGln: 3.141 ± 0.62
3.141ProArg: 3.141 ± 0.62
3.141ProSer: 3.141 ± 0.62
4.188ProThr: 4.188 ± 0.581
4.712ProVal: 4.712 ± 2.405
1.047ProTrp: 1.047 ± 0.039
3.665ProTyr: 3.665 ± 0.505
0.0ProXaa: 0.0 ± 0.0
Gln
1.047GlnAla: 1.047 ± 0.698
0.0GlnCys: 0.0 ± 0.0
0.524GlnAsp: 0.524 ± 0.388
2.618GlnGlu: 2.618 ± 1.008
1.047GlnPhe: 1.047 ± 0.039
4.188GlnGly: 4.188 ± 1.318
1.047GlnHis: 1.047 ± 0.039
0.524GlnIle: 0.524 ± 0.388
0.0GlnLys: 0.0 ± 0.0
2.618GlnLeu: 2.618 ± 0.466
1.047GlnMet: 1.047 ± 0.039
1.047GlnAsn: 1.047 ± 0.698
2.094GlnPro: 2.094 ± 0.659
1.047GlnGln: 1.047 ± 0.776
0.524GlnArg: 0.524 ± 0.388
3.141GlnSer: 3.141 ± 0.117
4.188GlnThr: 4.188 ± 0.581
2.094GlnVal: 2.094 ± 0.078
3.141GlnTrp: 3.141 ± 0.62
1.047GlnTyr: 1.047 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
5.759ArgAla: 5.759 ± 1.32
1.047ArgCys: 1.047 ± 0.039
3.665ArgAsp: 3.665 ± 0.232
3.141ArgGlu: 3.141 ± 0.117
2.094ArgPhe: 2.094 ± 0.815
5.759ArgGly: 5.759 ± 2.795
0.524ArgHis: 0.524 ± 0.388
5.236ArgIle: 5.236 ± 1.669
2.618ArgLys: 2.618 ± 0.466
6.283ArgLeu: 6.283 ± 0.971
0.0ArgMet: 0.0 ± 0.0
2.094ArgAsn: 2.094 ± 0.659
1.571ArgPro: 1.571 ± 0.31
3.665ArgGln: 3.665 ± 0.232
2.618ArgArg: 2.618 ± 1.008
3.141ArgSer: 3.141 ± 0.117
2.094ArgThr: 2.094 ± 0.659
5.236ArgVal: 5.236 ± 1.669
1.571ArgTrp: 1.571 ± 1.164
3.665ArgTyr: 3.665 ± 0.969
0.0ArgXaa: 0.0 ± 0.0
Ser
5.759SerAla: 5.759 ± 1.629
1.047SerCys: 1.047 ± 0.776
5.236SerAsp: 5.236 ± 0.542
3.141SerGlu: 3.141 ± 0.117
2.618SerPhe: 2.618 ± 1.008
5.236SerGly: 5.236 ± 2.407
0.0SerHis: 0.0 ± 0.0
1.571SerIle: 1.571 ± 0.427
3.141SerLys: 3.141 ± 1.591
7.853SerLeu: 7.853 ± 0.076
2.618SerMet: 2.618 ± 1.008
2.094SerAsn: 2.094 ± 0.815
9.424SerPro: 9.424 ± 1.861
3.665SerGln: 3.665 ± 1.707
3.665SerArg: 3.665 ± 1.242
6.806SerSer: 6.806 ± 0.852
4.712SerThr: 4.712 ± 0.93
5.759SerVal: 5.759 ± 1.629
2.618SerTrp: 2.618 ± 1.008
4.712SerTyr: 4.712 ± 0.544
0.0SerXaa: 0.0 ± 0.0
Thr
5.759ThrAla: 5.759 ± 2.366
1.047ThrCys: 1.047 ± 0.776
4.712ThrAsp: 4.712 ± 0.193
3.665ThrGlu: 3.665 ± 0.969
2.094ThrPhe: 2.094 ± 0.659
5.759ThrGly: 5.759 ± 0.583
3.141ThrHis: 3.141 ± 1.357
3.141ThrIle: 3.141 ± 0.854
1.047ThrLys: 1.047 ± 0.039
3.141ThrLeu: 3.141 ± 0.62
3.665ThrMet: 3.665 ± 1.242
1.571ThrAsn: 1.571 ± 0.31
3.141ThrPro: 3.141 ± 0.62
2.618ThrGln: 2.618 ± 0.271
7.33ThrArg: 7.33 ± 0.273
5.759ThrSer: 5.759 ± 0.583
3.665ThrThr: 3.665 ± 0.232
3.665ThrVal: 3.665 ± 0.969
3.141ThrTrp: 3.141 ± 0.117
3.665ThrTyr: 3.665 ± 0.969
0.0ThrXaa: 0.0 ± 0.0
Val
7.33ValAla: 7.33 ± 1.202
1.047ValCys: 1.047 ± 0.776
3.141ValAsp: 3.141 ± 1.357
2.094ValGlu: 2.094 ± 0.659
3.141ValPhe: 3.141 ± 0.854
2.618ValGly: 2.618 ± 1.008
1.571ValHis: 1.571 ± 0.31
1.047ValIle: 1.047 ± 0.776
3.141ValLys: 3.141 ± 1.357
3.665ValLeu: 3.665 ± 1.707
2.094ValMet: 2.094 ± 0.078
3.665ValAsn: 3.665 ± 1.707
4.188ValPro: 4.188 ± 2.056
1.571ValGln: 1.571 ± 0.31
6.806ValArg: 6.806 ± 0.852
7.853ValSer: 7.853 ± 1.551
3.141ValThr: 3.141 ± 0.117
3.141ValVal: 3.141 ± 1.357
0.524ValTrp: 0.524 ± 0.349
2.094ValTyr: 2.094 ± 1.552
0.0ValXaa: 0.0 ± 0.0
Trp
1.047TrpAla: 1.047 ± 0.776
1.047TrpCys: 1.047 ± 0.776
2.094TrpAsp: 2.094 ± 0.078
0.524TrpGlu: 0.524 ± 0.349
1.047TrpPhe: 1.047 ± 0.776
1.571TrpGly: 1.571 ± 1.164
0.0TrpHis: 0.0 ± 0.0
2.094TrpIle: 2.094 ± 0.659
1.571TrpLys: 1.571 ± 0.31
3.665TrpLeu: 3.665 ± 1.979
0.524TrpMet: 0.524 ± 0.24
0.524TrpAsn: 0.524 ± 0.349
0.524TrpPro: 0.524 ± 0.349
1.047TrpGln: 1.047 ± 0.698
1.571TrpArg: 1.571 ± 0.427
2.618TrpSer: 2.618 ± 0.271
2.094TrpThr: 2.094 ± 1.396
1.047TrpVal: 1.047 ± 0.039
0.524TrpTrp: 0.524 ± 0.349
2.094TrpTyr: 2.094 ± 0.659
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.571TyrAla: 1.571 ± 0.427
1.047TyrCys: 1.047 ± 0.776
1.047TyrAsp: 1.047 ± 0.039
1.571TyrGlu: 1.571 ± 0.31
1.571TyrPhe: 1.571 ± 0.31
3.141TyrGly: 3.141 ± 1.357
2.618TyrHis: 2.618 ± 1.203
4.188TyrIle: 4.188 ± 0.893
1.047TyrLys: 1.047 ± 0.698
1.571TyrLeu: 1.571 ± 0.427
1.571TyrMet: 1.571 ± 0.427
2.094TyrAsn: 2.094 ± 0.659
1.047TyrPro: 1.047 ± 0.776
1.571TyrGln: 1.571 ± 0.31
2.618TyrArg: 2.618 ± 0.466
5.759TyrSer: 5.759 ± 0.583
5.236TyrThr: 5.236 ± 0.932
1.571TyrVal: 1.571 ± 1.047
1.047TyrTrp: 1.047 ± 0.039
1.571TyrTyr: 1.571 ± 1.047
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski