Amino acid dipepetide frequency for Camponotus yamaokai virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.711AlaAla: 7.711 ± 3.6
1.186AlaCys: 1.186 ± 0.69
1.186AlaAsp: 1.186 ± 0.193
4.152AlaGlu: 4.152 ± 0.235
2.372AlaPhe: 2.372 ± 0.496
3.559AlaGly: 3.559 ± 0.58
1.186AlaHis: 1.186 ± 0.193
3.559AlaIle: 3.559 ± 0.58
4.152AlaLys: 4.152 ± 1.118
8.304AlaLeu: 8.304 ± 0.47
1.779AlaMet: 1.779 ± 1.034
2.966AlaAsn: 2.966 ± 0.042
3.559AlaPro: 3.559 ± 1.186
1.186AlaGln: 1.186 ± 0.193
1.779AlaArg: 1.779 ± 1.034
4.152AlaSer: 4.152 ± 0.235
4.745AlaThr: 4.745 ± 0.993
2.966AlaVal: 2.966 ± 0.841
2.372AlaTrp: 2.372 ± 0.386
4.152AlaTyr: 4.152 ± 0.648
0.0AlaXaa: 0.0 ± 0.0
Cys
2.372CysAla: 2.372 ± 1.379
0.593CysCys: 0.593 ± 0.538
0.593CysAsp: 0.593 ± 0.538
0.0CysGlu: 0.0 ± 0.0
0.593CysPhe: 0.593 ± 0.345
2.372CysGly: 2.372 ± 1.379
0.0CysHis: 0.0 ± 0.0
1.186CysIle: 1.186 ± 0.69
0.593CysLys: 0.593 ± 0.345
0.593CysLeu: 0.593 ± 0.345
1.779CysMet: 1.779 ± 1.034
1.186CysAsn: 1.186 ± 0.193
1.186CysPro: 1.186 ± 0.193
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.593CysSer: 0.593 ± 0.538
0.593CysThr: 0.593 ± 0.345
1.779CysVal: 1.779 ± 0.152
0.593CysTrp: 0.593 ± 0.538
1.186CysTyr: 1.186 ± 1.076
0.0CysXaa: 0.0 ± 0.0
Asp
4.152AspAla: 4.152 ± 1.531
1.186AspCys: 1.186 ± 0.69
1.779AspAsp: 1.779 ± 0.152
5.338AspGlu: 5.338 ± 1.338
1.186AspPhe: 1.186 ± 0.193
2.966AspGly: 2.966 ± 0.042
1.186AspHis: 1.186 ± 1.076
3.559AspIle: 3.559 ± 0.58
4.152AspLys: 4.152 ± 0.235
5.931AspLeu: 5.931 ± 0.8
0.0AspMet: 0.0 ± 0.0
2.372AspAsn: 2.372 ± 1.379
3.559AspPro: 3.559 ± 2.069
1.186AspGln: 1.186 ± 0.193
0.593AspArg: 0.593 ± 0.345
3.559AspSer: 3.559 ± 1.462
3.559AspThr: 3.559 ± 0.303
4.745AspVal: 4.745 ± 1.656
1.186AspTrp: 1.186 ± 0.193
3.559AspTyr: 3.559 ± 1.462
0.0AspXaa: 0.0 ± 0.0
Glu
2.966GluAla: 2.966 ± 0.924
0.593GluCys: 0.593 ± 0.345
1.186GluAsp: 1.186 ± 0.193
2.966GluGlu: 2.966 ± 0.924
2.372GluPhe: 2.372 ± 1.269
1.779GluGly: 1.779 ± 1.614
2.372GluHis: 2.372 ± 0.496
1.779GluIle: 1.779 ± 1.614
2.372GluLys: 2.372 ± 0.386
5.338GluLeu: 5.338 ± 0.428
1.779GluMet: 1.779 ± 1.225
2.372GluAsn: 2.372 ± 1.379
1.779GluPro: 1.779 ± 0.152
2.966GluGln: 2.966 ± 0.841
4.152GluArg: 4.152 ± 0.235
2.966GluSer: 2.966 ± 0.841
5.931GluThr: 5.931 ± 1.682
2.966GluVal: 2.966 ± 0.924
2.372GluTrp: 2.372 ± 1.269
4.152GluTyr: 4.152 ± 0.235
0.0GluXaa: 0.0 ± 0.0
Phe
0.593PheAla: 0.593 ± 0.345
1.186PheCys: 1.186 ± 0.69
2.966PheAsp: 2.966 ± 0.841
1.779PheGlu: 1.779 ± 0.152
1.186PhePhe: 1.186 ± 0.193
2.966PheGly: 2.966 ± 0.924
0.593PheHis: 0.593 ± 0.345
2.966PheIle: 2.966 ± 0.841
1.779PheLys: 1.779 ± 0.731
4.152PheLeu: 4.152 ± 1.118
0.593PheMet: 0.593 ± 0.538
0.0PheAsn: 0.0 ± 0.0
0.593PhePro: 0.593 ± 0.538
1.186PheGln: 1.186 ± 0.69
1.779PheArg: 1.779 ± 0.731
2.966PheSer: 2.966 ± 0.042
1.779PheThr: 1.779 ± 0.152
1.779PheVal: 1.779 ± 0.152
0.0PheTrp: 0.0 ± 0.0
0.593PheTyr: 0.593 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
4.745GlyAla: 4.745 ± 0.773
0.0GlyCys: 0.0 ± 0.0
2.966GlyAsp: 2.966 ± 0.042
3.559GlyGlu: 3.559 ± 1.462
2.966GlyPhe: 2.966 ± 0.042
6.524GlyGly: 6.524 ± 3.27
1.186GlyHis: 1.186 ± 0.69
2.372GlyIle: 2.372 ± 0.496
3.559GlyLys: 3.559 ± 1.462
7.711GlyLeu: 7.711 ± 0.814
0.0GlyMet: 0.0 ± 0.0
1.779GlyAsn: 1.779 ± 0.152
1.779GlyPro: 1.779 ± 0.152
2.966GlyGln: 2.966 ± 0.841
1.186GlyArg: 1.186 ± 1.076
6.524GlySer: 6.524 ± 0.262
4.152GlyThr: 4.152 ± 1.118
4.152GlyVal: 4.152 ± 0.648
1.779GlyTrp: 1.779 ± 0.152
0.593GlyTyr: 0.593 ± 0.538
0.0GlyXaa: 0.0 ± 0.0
His
2.372HisAla: 2.372 ± 0.386
0.593HisCys: 0.593 ± 0.538
0.593HisAsp: 0.593 ± 0.345
1.186HisGlu: 1.186 ± 0.193
0.0HisPhe: 0.0 ± 0.0
1.186HisGly: 1.186 ± 0.69
0.0HisHis: 0.0 ± 0.0
1.186HisIle: 1.186 ± 0.193
1.779HisLys: 1.779 ± 1.034
1.186HisLeu: 1.186 ± 0.193
1.779HisMet: 1.779 ± 0.731
0.0HisAsn: 0.0 ± 0.0
0.593HisPro: 0.593 ± 0.345
1.186HisGln: 1.186 ± 0.69
0.593HisArg: 0.593 ± 0.345
1.186HisSer: 1.186 ± 1.076
1.779HisThr: 1.779 ± 0.152
1.186HisVal: 1.186 ± 0.193
1.779HisTrp: 1.779 ± 0.152
1.779HisTyr: 1.779 ± 0.731
0.0HisXaa: 0.0 ± 0.0
Ile
4.152IleAla: 4.152 ± 0.648
1.779IleCys: 1.779 ± 0.731
5.931IleAsp: 5.931 ± 1.682
1.779IleGlu: 1.779 ± 0.152
0.593IlePhe: 0.593 ± 0.345
4.152IleGly: 4.152 ± 0.235
0.593IleHis: 0.593 ± 0.345
3.559IleIle: 3.559 ± 2.345
1.186IleLys: 1.186 ± 0.69
4.745IleLeu: 4.745 ± 0.11
2.372IleMet: 2.372 ± 0.386
2.966IleAsn: 2.966 ± 1.724
4.152IlePro: 4.152 ± 0.648
1.779IleGln: 1.779 ± 0.731
4.152IleArg: 4.152 ± 1.531
4.745IleSer: 4.745 ± 2.538
5.338IleThr: 5.338 ± 2.194
1.779IleVal: 1.779 ± 1.614
1.186IleTrp: 1.186 ± 0.193
1.186IleTyr: 1.186 ± 0.193
0.0IleXaa: 0.0 ± 0.0
Lys
2.966LysAla: 2.966 ± 1.807
1.779LysCys: 1.779 ± 0.152
2.372LysAsp: 2.372 ± 1.269
4.745LysGlu: 4.745 ± 2.538
1.779LysPhe: 1.779 ± 0.152
4.745LysGly: 4.745 ± 0.773
1.186LysHis: 1.186 ± 0.193
2.966LysIle: 2.966 ± 0.042
1.779LysLys: 1.779 ± 0.731
5.931LysLeu: 5.931 ± 0.966
3.559LysMet: 3.559 ± 0.58
2.966LysAsn: 2.966 ± 0.841
3.559LysPro: 3.559 ± 0.303
0.593LysGln: 0.593 ± 0.538
4.152LysArg: 4.152 ± 0.648
1.186LysSer: 1.186 ± 0.69
6.524LysThr: 6.524 ± 0.262
4.745LysVal: 4.745 ± 0.11
1.186LysTrp: 1.186 ± 1.076
1.186LysTyr: 1.186 ± 0.69
0.0LysXaa: 0.0 ± 0.0
Leu
5.931LeuAla: 5.931 ± 2.565
1.186LeuCys: 1.186 ± 0.69
7.711LeuAsp: 7.711 ± 0.814
4.745LeuGlu: 4.745 ± 0.11
1.186LeuPhe: 1.186 ± 1.076
1.779LeuGly: 1.779 ± 1.614
1.779LeuHis: 1.779 ± 1.614
4.745LeuIle: 4.745 ± 0.993
7.117LeuLys: 7.117 ± 4.691
4.152LeuLeu: 4.152 ± 1.118
3.559LeuMet: 3.559 ± 0.58
1.186LeuAsn: 1.186 ± 0.69
1.779LeuPro: 1.779 ± 0.731
1.186LeuGln: 1.186 ± 0.69
5.931LeuArg: 5.931 ± 2.732
9.49LeuSer: 9.49 ± 1.546
7.117LeuThr: 7.117 ± 0.606
3.559LeuVal: 3.559 ± 1.186
1.779LeuTrp: 1.779 ± 1.034
4.152LeuTyr: 4.152 ± 1.118
0.0LeuXaa: 0.0 ± 0.0
Met
2.372MetAla: 2.372 ± 0.496
0.0MetCys: 0.0 ± 0.0
1.186MetAsp: 1.186 ± 0.193
2.372MetGlu: 2.372 ± 1.269
0.593MetPhe: 0.593 ± 0.345
2.966MetGly: 2.966 ± 0.042
0.593MetHis: 0.593 ± 0.345
0.593MetIle: 0.593 ± 0.345
1.186MetLys: 1.186 ± 0.193
2.372MetLeu: 2.372 ± 0.386
1.186MetMet: 1.186 ± 0.193
1.779MetAsn: 1.779 ± 0.731
2.372MetPro: 2.372 ± 1.379
1.779MetGln: 1.779 ± 0.152
1.186MetArg: 1.186 ± 1.076
2.372MetSer: 2.372 ± 0.496
3.559MetThr: 3.559 ± 2.069
1.779MetVal: 1.779 ± 0.731
0.593MetTrp: 0.593 ± 0.345
1.779MetTyr: 1.779 ± 0.731
0.0MetXaa: 0.0 ± 0.0
Asn
1.186AsnAla: 1.186 ± 0.193
1.186AsnCys: 1.186 ± 0.193
2.966AsnAsp: 2.966 ± 0.841
3.559AsnGlu: 3.559 ± 0.303
1.779AsnPhe: 1.779 ± 1.034
0.593AsnGly: 0.593 ± 0.345
0.593AsnHis: 0.593 ± 0.345
5.931AsnIle: 5.931 ± 0.8
1.779AsnLys: 1.779 ± 0.152
1.779AsnLeu: 1.779 ± 0.731
0.0AsnMet: 0.0 ± 0.0
2.966AsnAsn: 2.966 ± 1.724
0.593AsnPro: 0.593 ± 0.538
1.779AsnGln: 1.779 ± 0.152
1.779AsnArg: 1.779 ± 0.152
3.559AsnSer: 3.559 ± 1.186
1.779AsnThr: 1.779 ± 1.034
2.966AsnVal: 2.966 ± 0.841
1.186AsnTrp: 1.186 ± 0.69
2.372AsnTyr: 2.372 ± 1.379
0.0AsnXaa: 0.0 ± 0.0
Pro
3.559ProAla: 3.559 ± 0.303
1.186ProCys: 1.186 ± 0.193
2.372ProAsp: 2.372 ± 0.496
2.372ProGlu: 2.372 ± 1.379
0.593ProPhe: 0.593 ± 0.345
1.186ProGly: 1.186 ± 0.69
1.779ProHis: 1.779 ± 0.731
5.931ProIle: 5.931 ± 0.8
2.966ProLys: 2.966 ± 0.042
1.186ProLeu: 1.186 ± 0.69
1.779ProMet: 1.779 ± 1.034
1.186ProAsn: 1.186 ± 0.193
0.593ProPro: 0.593 ± 0.345
1.186ProGln: 1.186 ± 0.193
2.966ProArg: 2.966 ± 0.841
1.186ProSer: 1.186 ± 1.076
5.338ProThr: 5.338 ± 3.103
2.966ProVal: 2.966 ± 1.724
1.186ProTrp: 1.186 ± 0.193
2.966ProTyr: 2.966 ± 0.924
0.0ProXaa: 0.0 ± 0.0
Gln
1.779GlnAla: 1.779 ± 0.152
0.593GlnCys: 0.593 ± 0.345
2.966GlnAsp: 2.966 ± 0.841
1.779GlnGlu: 1.779 ± 0.152
1.186GlnPhe: 1.186 ± 0.193
1.186GlnGly: 1.186 ± 0.69
2.966GlnHis: 2.966 ± 0.841
1.779GlnIle: 1.779 ± 0.731
2.372GlnLys: 2.372 ± 1.379
1.779GlnLeu: 1.779 ± 0.152
0.0GlnMet: 0.0 ± 0.0
0.593GlnAsn: 0.593 ± 0.345
0.0GlnPro: 0.0 ± 0.0
1.186GlnGln: 1.186 ± 0.69
1.186GlnArg: 1.186 ± 0.193
2.966GlnSer: 2.966 ± 0.841
1.779GlnThr: 1.779 ± 0.731
1.779GlnVal: 1.779 ± 0.152
1.186GlnTrp: 1.186 ± 0.69
1.186GlnTyr: 1.186 ± 0.69
0.0GlnXaa: 0.0 ± 0.0
Arg
2.966ArgAla: 2.966 ± 0.042
1.186ArgCys: 1.186 ± 0.69
3.559ArgAsp: 3.559 ± 1.186
5.931ArgGlu: 5.931 ± 0.8
3.559ArgPhe: 3.559 ± 0.58
2.966ArgGly: 2.966 ± 1.807
1.186ArgHis: 1.186 ± 0.193
1.186ArgIle: 1.186 ± 1.076
1.779ArgLys: 1.779 ± 0.152
4.745ArgLeu: 4.745 ± 1.656
1.186ArgMet: 1.186 ± 0.69
1.186ArgAsn: 1.186 ± 0.69
0.593ArgPro: 0.593 ± 0.345
2.966ArgGln: 2.966 ± 0.924
4.152ArgArg: 4.152 ± 0.648
2.966ArgSer: 2.966 ± 0.042
2.372ArgThr: 2.372 ± 1.269
4.745ArgVal: 4.745 ± 0.11
1.779ArgTrp: 1.779 ± 1.614
1.779ArgTyr: 1.779 ± 1.034
0.0ArgXaa: 0.0 ± 0.0
Ser
4.152SerAla: 4.152 ± 0.235
1.779SerCys: 1.779 ± 0.152
5.338SerAsp: 5.338 ± 0.428
2.966SerGlu: 2.966 ± 0.042
1.779SerPhe: 1.779 ± 0.152
4.745SerGly: 4.745 ± 0.773
1.186SerHis: 1.186 ± 0.193
4.152SerIle: 4.152 ± 0.235
6.524SerLys: 6.524 ± 1.504
4.152SerLeu: 4.152 ± 0.235
2.372SerMet: 2.372 ± 0.386
1.779SerAsn: 1.779 ± 0.731
7.711SerPro: 7.711 ± 0.951
1.779SerGln: 1.779 ± 1.034
4.745SerArg: 4.745 ± 0.993
4.745SerSer: 4.745 ± 0.11
3.559SerThr: 3.559 ± 1.186
5.931SerVal: 5.931 ± 0.083
1.779SerTrp: 1.779 ± 1.614
4.152SerTyr: 4.152 ± 1.118
0.0SerXaa: 0.0 ± 0.0
Thr
5.338ThrAla: 5.338 ± 1.338
0.0ThrCys: 0.0 ± 0.0
2.372ThrAsp: 2.372 ± 0.496
2.372ThrGlu: 2.372 ± 0.386
3.559ThrPhe: 3.559 ± 1.186
5.338ThrGly: 5.338 ± 1.338
1.779ThrHis: 1.779 ± 0.152
3.559ThrIle: 3.559 ± 0.58
3.559ThrLys: 3.559 ± 0.58
3.559ThrLeu: 3.559 ± 1.186
3.559ThrMet: 3.559 ± 0.303
2.966ThrAsn: 2.966 ± 1.807
4.152ThrPro: 4.152 ± 0.648
2.372ThrGln: 2.372 ± 1.379
3.559ThrArg: 3.559 ± 0.58
7.117ThrSer: 7.117 ± 0.276
10.676ThrThr: 10.676 ± 0.027
5.338ThrVal: 5.338 ± 2.22
4.745ThrTrp: 4.745 ± 0.11
1.779ThrTyr: 1.779 ± 0.152
0.0ThrXaa: 0.0 ± 0.0
Val
3.559ValAla: 3.559 ± 0.303
1.186ValCys: 1.186 ± 0.69
2.966ValAsp: 2.966 ± 1.807
1.779ValGlu: 1.779 ± 0.152
2.966ValPhe: 2.966 ± 0.924
2.372ValGly: 2.372 ± 0.496
0.593ValHis: 0.593 ± 0.538
2.966ValIle: 2.966 ± 0.924
5.338ValLys: 5.338 ± 2.22
5.931ValLeu: 5.931 ± 1.849
1.186ValMet: 1.186 ± 0.193
4.745ValAsn: 4.745 ± 2.758
4.745ValPro: 4.745 ± 1.876
1.186ValGln: 1.186 ± 0.69
3.559ValArg: 3.559 ± 0.58
7.711ValSer: 7.711 ± 0.951
2.966ValThr: 2.966 ± 0.042
6.524ValVal: 6.524 ± 0.262
0.0ValTrp: 0.0 ± 0.0
3.559ValTyr: 3.559 ± 0.58
0.0ValXaa: 0.0 ± 0.0
Trp
2.372TrpAla: 2.372 ± 1.269
0.0TrpCys: 0.0 ± 0.0
1.779TrpAsp: 1.779 ± 0.152
0.593TrpGlu: 0.593 ± 0.538
0.593TrpPhe: 0.593 ± 0.538
2.372TrpGly: 2.372 ± 1.269
0.0TrpHis: 0.0 ± 0.0
2.372TrpIle: 2.372 ± 1.379
1.779TrpLys: 1.779 ± 0.731
2.372TrpLeu: 2.372 ± 2.152
2.372TrpMet: 2.372 ± 0.238
1.186TrpAsn: 1.186 ± 0.69
0.593TrpPro: 0.593 ± 0.345
0.593TrpGln: 0.593 ± 0.345
1.779TrpArg: 1.779 ± 0.731
1.779TrpSer: 1.779 ± 0.152
1.779TrpThr: 1.779 ± 1.034
1.186TrpVal: 1.186 ± 0.193
0.0TrpTrp: 0.0 ± 0.0
2.372TrpTyr: 2.372 ± 0.386
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.779TyrAla: 1.779 ± 0.731
0.593TyrCys: 0.593 ± 0.538
2.966TyrAsp: 2.966 ± 0.042
1.779TyrGlu: 1.779 ± 0.152
0.593TyrPhe: 0.593 ± 0.345
4.745TyrGly: 4.745 ± 0.11
1.186TyrHis: 1.186 ± 0.69
1.779TyrIle: 1.779 ± 0.731
4.152TyrLys: 4.152 ± 0.235
4.152TyrLeu: 4.152 ± 0.235
1.186TyrMet: 1.186 ± 1.076
4.152TyrAsn: 4.152 ± 0.648
0.593TyrPro: 0.593 ± 0.538
1.186TyrGln: 1.186 ± 0.69
3.559TyrArg: 3.559 ± 1.462
3.559TyrSer: 3.559 ± 0.303
2.372TyrThr: 2.372 ± 1.269
2.966TyrVal: 2.966 ± 0.042
1.186TyrTrp: 1.186 ± 0.193
0.593TyrTyr: 0.593 ± 0.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1687 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski