Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_363

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.994AlaAla: 2.994 ± 2.688
0.599AlaCys: 0.599 ± 0.447
5.389AlaAsp: 5.389 ± 3.004
0.599AlaGlu: 0.599 ± 0.447
2.994AlaPhe: 2.994 ± 1.402
5.389AlaGly: 5.389 ± 3.535
0.599AlaHis: 0.599 ± 0.462
4.192AlaIle: 4.192 ± 0.739
3.593AlaLys: 3.593 ± 1.028
4.192AlaLeu: 4.192 ± 1.379
1.796AlaMet: 1.796 ± 0.318
4.192AlaAsn: 4.192 ± 0.739
1.796AlaPro: 1.796 ± 1.387
2.994AlaGln: 2.994 ± 1.329
1.198AlaArg: 1.198 ± 0.925
4.79AlaSer: 4.79 ± 3.09
4.192AlaThr: 4.192 ± 2.155
3.593AlaVal: 3.593 ± 1.254
1.198AlaTrp: 1.198 ± 0.369
2.994AlaTyr: 2.994 ± 1.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.599CysGlu: 0.599 ± 0.447
0.599CysPhe: 0.599 ± 0.447
1.198CysGly: 1.198 ± 0.895
0.0CysHis: 0.0 ± 0.0
1.198CysIle: 1.198 ± 0.895
1.198CysLys: 1.198 ± 0.369
1.198CysLeu: 1.198 ± 0.895
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.599CysPro: 0.599 ± 0.447
0.0CysGln: 0.0 ± 0.0
0.599CysArg: 0.599 ± 0.447
0.599CysSer: 0.599 ± 0.447
0.0CysThr: 0.0 ± 0.0
0.599CysVal: 0.599 ± 1.205
0.0CysTrp: 0.0 ± 0.0
0.599CysTyr: 0.599 ± 0.462
0.0CysXaa: 0.0 ± 0.0
Asp
3.593AspAla: 3.593 ± 2.078
1.198AspCys: 1.198 ± 0.895
3.593AspAsp: 3.593 ± 1.151
3.593AspGlu: 3.593 ± 2.116
5.389AspPhe: 5.389 ± 1.557
3.593AspGly: 3.593 ± 1.414
1.198AspHis: 1.198 ± 0.925
2.994AspIle: 2.994 ± 1.38
6.587AspLys: 6.587 ± 1.075
5.389AspLeu: 5.389 ± 1.886
1.198AspMet: 1.198 ± 0.534
4.192AspAsn: 4.192 ± 1.408
2.395AspPro: 2.395 ± 0.966
1.796AspGln: 1.796 ± 0.999
4.79AspArg: 4.79 ± 1.797
6.587AspSer: 6.587 ± 2.104
2.994AspThr: 2.994 ± 0.798
6.587AspVal: 6.587 ± 1.188
1.796AspTrp: 1.796 ± 1.407
5.988AspTyr: 5.988 ± 1.99
0.0AspXaa: 0.0 ± 0.0
Glu
2.395GluAla: 2.395 ± 1.276
0.599GluCys: 0.599 ± 0.447
2.994GluAsp: 2.994 ± 0.94
2.395GluGlu: 2.395 ± 1.17
2.994GluPhe: 2.994 ± 0.973
3.593GluGly: 3.593 ± 1.019
1.796GluHis: 1.796 ± 1.37
4.192GluIle: 4.192 ± 0.737
2.395GluLys: 2.395 ± 1.087
3.593GluLeu: 3.593 ± 1.362
1.198GluMet: 1.198 ± 1.075
2.994GluAsn: 2.994 ± 0.964
0.0GluPro: 0.0 ± 0.0
0.599GluGln: 0.599 ± 0.538
2.395GluArg: 2.395 ± 1.294
1.198GluSer: 1.198 ± 0.592
1.198GluThr: 1.198 ± 1.225
2.994GluVal: 2.994 ± 0.94
0.0GluTrp: 0.0 ± 0.0
6.587GluTyr: 6.587 ± 1.929
0.0GluXaa: 0.0 ± 0.0
Phe
1.796PheAla: 1.796 ± 0.707
0.599PheCys: 0.599 ± 0.447
9.581PheAsp: 9.581 ± 0.49
1.198PheGlu: 1.198 ± 1.21
4.192PhePhe: 4.192 ± 0.737
2.395PheGly: 2.395 ± 0.738
0.599PheHis: 0.599 ± 0.447
2.395PheIle: 2.395 ± 1.087
3.593PheLys: 3.593 ± 1.293
1.796PheLeu: 1.796 ± 0.999
0.0PheMet: 0.0 ± 0.0
3.593PheAsn: 3.593 ± 1.689
0.0PhePro: 0.0 ± 0.0
0.599PheGln: 0.599 ± 0.447
1.796PheArg: 1.796 ± 1.342
5.389PheSer: 5.389 ± 1.924
3.593PheThr: 3.593 ± 2.169
3.593PheVal: 3.593 ± 1.355
0.599PheTrp: 0.599 ± 0.447
2.395PheTyr: 2.395 ± 1.294
0.0PheXaa: 0.0 ± 0.0
Gly
3.593GlyAla: 3.593 ± 2.552
0.599GlyCys: 0.599 ± 1.205
4.192GlyAsp: 4.192 ± 1.555
4.79GlyGlu: 4.79 ± 0.819
4.192GlyPhe: 4.192 ± 0.737
4.192GlyGly: 4.192 ± 0.739
1.198GlyHis: 1.198 ± 0.925
7.784GlyIle: 7.784 ± 1.158
4.192GlyLys: 4.192 ± 0.757
3.593GlyLeu: 3.593 ± 2.078
1.796GlyMet: 1.796 ± 0.318
4.79GlyAsn: 4.79 ± 1.072
0.0GlyPro: 0.0 ± 0.0
2.395GlyGln: 2.395 ± 1.126
1.198GlyArg: 1.198 ± 0.369
4.79GlySer: 4.79 ± 1.11
3.593GlyThr: 3.593 ± 2.169
2.395GlyVal: 2.395 ± 1.287
0.0GlyTrp: 0.0 ± 0.0
4.192GlyTyr: 4.192 ± 1.838
0.0GlyXaa: 0.0 ± 0.0
His
1.198HisAla: 1.198 ± 0.563
0.0HisCys: 0.0 ± 0.0
1.198HisAsp: 1.198 ± 0.369
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.198HisHis: 1.198 ± 0.895
0.599HisIle: 0.599 ± 0.462
0.599HisLys: 0.599 ± 0.447
3.593HisLeu: 3.593 ± 1.414
0.0HisMet: 0.0 ± 0.0
0.599HisAsn: 0.599 ± 0.447
1.198HisPro: 1.198 ± 0.369
1.198HisGln: 1.198 ± 0.369
1.796HisArg: 1.796 ± 1.342
0.599HisSer: 0.599 ± 0.462
0.0HisThr: 0.0 ± 0.0
1.198HisVal: 1.198 ± 1.21
0.0HisTrp: 0.0 ± 0.0
3.593HisTyr: 3.593 ± 2.145
0.0HisXaa: 0.0 ± 0.0
Ile
2.395IleAla: 2.395 ± 0.534
0.0IleCys: 0.0 ± 0.0
5.389IleAsp: 5.389 ± 0.854
3.593IleGlu: 3.593 ± 1.151
1.796IlePhe: 1.796 ± 0.318
4.192IleGly: 4.192 ± 1.306
0.0IleHis: 0.0 ± 0.0
2.395IleIle: 2.395 ± 1.198
5.389IleLys: 5.389 ± 1.532
1.796IleLeu: 1.796 ± 1.387
0.599IleMet: 0.599 ± 1.205
8.383IleAsn: 8.383 ± 2.49
2.994IlePro: 2.994 ± 1.584
1.796IleGln: 1.796 ± 1.039
3.593IleArg: 3.593 ± 2.739
5.988IleSer: 5.988 ± 1.246
3.593IleThr: 3.593 ± 0.816
1.796IleVal: 1.796 ± 0.707
0.0IleTrp: 0.0 ± 0.0
1.796IleTyr: 1.796 ± 1.342
0.0IleXaa: 0.0 ± 0.0
Lys
4.79LysAla: 4.79 ± 2.275
0.599LysCys: 0.599 ± 0.447
5.389LysAsp: 5.389 ± 1.953
2.395LysGlu: 2.395 ± 0.528
4.79LysPhe: 4.79 ± 1.406
2.994LysGly: 2.994 ± 0.798
1.796LysHis: 1.796 ± 0.677
4.79LysIle: 4.79 ± 1.068
2.994LysLys: 2.994 ± 1.458
6.587LysLeu: 6.587 ± 1.873
3.593LysMet: 3.593 ± 0.702
2.994LysAsn: 2.994 ± 1.518
1.796LysPro: 1.796 ± 0.318
2.395LysGln: 2.395 ± 1.185
1.198LysArg: 1.198 ± 0.369
3.593LysSer: 3.593 ± 1.107
3.593LysThr: 3.593 ± 1.028
4.192LysVal: 4.192 ± 0.739
0.599LysTrp: 0.599 ± 0.462
4.192LysTyr: 4.192 ± 1.848
0.0LysXaa: 0.0 ± 0.0
Leu
2.395LeuAla: 2.395 ± 1.287
1.198LeuCys: 1.198 ± 0.369
4.79LeuAsp: 4.79 ± 1.605
2.994LeuGlu: 2.994 ± 0.831
1.796LeuPhe: 1.796 ± 0.677
3.593LeuGly: 3.593 ± 0.717
0.599LeuHis: 0.599 ± 0.447
4.79LeuIle: 4.79 ± 3.611
5.389LeuLys: 5.389 ± 0.954
7.186LeuLeu: 7.186 ± 1.08
0.599LeuMet: 0.599 ± 0.416
8.383LeuAsn: 8.383 ± 2.838
1.198LeuPro: 1.198 ± 0.369
4.79LeuGln: 4.79 ± 1.11
4.192LeuArg: 4.192 ± 0.737
7.784LeuSer: 7.784 ± 1.108
4.192LeuThr: 4.192 ± 0.737
2.395LeuVal: 2.395 ± 0.528
0.0LeuTrp: 0.0 ± 0.0
4.192LeuTyr: 4.192 ± 1.095
0.0LeuXaa: 0.0 ± 0.0
Met
0.599MetAla: 0.599 ± 0.462
0.0MetCys: 0.0 ± 0.0
0.599MetAsp: 0.599 ± 0.462
0.599MetGlu: 0.599 ± 0.462
1.796MetPhe: 1.796 ± 1.039
1.198MetGly: 1.198 ± 0.563
0.599MetHis: 0.599 ± 0.462
1.198MetIle: 1.198 ± 1.214
0.599MetLys: 0.599 ± 0.462
1.198MetLeu: 1.198 ± 0.369
0.599MetMet: 0.599 ± 0.538
2.395MetAsn: 2.395 ± 0.803
0.599MetPro: 0.599 ± 0.447
1.796MetGln: 1.796 ± 0.318
1.198MetArg: 1.198 ± 1.214
3.593MetSer: 3.593 ± 1.028
0.599MetThr: 0.599 ± 0.462
2.994MetVal: 2.994 ± 0.798
0.0MetTrp: 0.0 ± 0.0
1.198MetTyr: 1.198 ± 0.369
0.0MetXaa: 0.0 ± 0.0
Asn
7.186AsnAla: 7.186 ± 1.862
0.599AsnCys: 0.599 ± 0.447
2.994AsnAsp: 2.994 ± 0.995
5.389AsnGlu: 5.389 ± 1.502
3.593AsnPhe: 3.593 ± 1.956
5.389AsnGly: 5.389 ± 1.453
0.599AsnHis: 0.599 ± 0.538
4.192AsnIle: 4.192 ± 1.798
3.593AsnLys: 3.593 ± 1.687
7.784AsnLeu: 7.784 ± 1.27
0.0AsnMet: 0.0 ± 0.931
6.587AsnAsn: 6.587 ± 3.281
3.593AsnPro: 3.593 ± 1.687
2.994AsnGln: 2.994 ± 0.798
5.389AsnArg: 5.389 ± 0.546
9.581AsnSer: 9.581 ± 1.744
1.796AsnThr: 1.796 ± 0.318
2.395AsnVal: 2.395 ± 1.198
0.0AsnTrp: 0.0 ± 0.0
5.389AsnTyr: 5.389 ± 1.241
0.0AsnXaa: 0.0 ± 0.0
Pro
0.599ProAla: 0.599 ± 0.538
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
0.599ProGlu: 0.599 ± 0.462
0.0ProPhe: 0.0 ± 0.0
2.395ProGly: 2.395 ± 1.287
0.599ProHis: 0.599 ± 0.447
1.198ProIle: 1.198 ± 0.369
0.0ProLys: 0.0 ± 0.0
3.593ProLeu: 3.593 ± 1.385
3.593ProMet: 3.593 ± 0.717
1.796ProAsn: 1.796 ± 1.387
0.0ProPro: 0.0 ± 0.0
1.198ProGln: 1.198 ± 1.21
0.599ProArg: 0.599 ± 0.462
2.994ProSer: 2.994 ± 3.569
0.599ProThr: 0.599 ± 0.462
1.796ProVal: 1.796 ± 0.901
0.0ProTrp: 0.0 ± 0.0
2.994ProTyr: 2.994 ± 0.431
0.0ProXaa: 0.0 ± 0.0
Gln
3.593GlnAla: 3.593 ± 2.552
0.0GlnCys: 0.0 ± 0.0
2.994GlnAsp: 2.994 ± 1.38
1.796GlnGlu: 1.796 ± 0.901
1.198GlnPhe: 1.198 ± 0.895
4.79GlnGly: 4.79 ± 1.056
0.599GlnHis: 0.599 ± 0.462
1.796GlnIle: 1.796 ± 0.879
1.198GlnLys: 1.198 ± 1.075
2.395GlnLeu: 2.395 ± 0.528
0.599GlnMet: 0.599 ± 0.462
2.994GlnAsn: 2.994 ± 2.024
0.599GlnPro: 0.599 ± 0.462
1.796GlnGln: 1.796 ± 0.879
4.79GlnArg: 4.79 ± 1.519
3.593GlnSer: 3.593 ± 1.254
1.796GlnThr: 1.796 ± 1.039
2.395GlnVal: 2.395 ± 1.276
1.198GlnTrp: 1.198 ± 1.075
1.796GlnTyr: 1.796 ± 0.707
0.0GlnXaa: 0.0 ± 0.0
Arg
1.198ArgAla: 1.198 ± 0.369
0.0ArgCys: 0.0 ± 0.0
4.192ArgAsp: 4.192 ± 0.911
4.192ArgGlu: 4.192 ± 1.407
2.395ArgPhe: 2.395 ± 0.738
1.198ArgGly: 1.198 ± 0.592
1.796ArgHis: 1.796 ± 1.342
1.198ArgIle: 1.198 ± 0.369
4.192ArgLys: 4.192 ± 1.457
4.79ArgLeu: 4.79 ± 2.301
0.0ArgMet: 0.0 ± 0.0
3.593ArgAsn: 3.593 ± 1.107
0.599ArgPro: 0.599 ± 0.462
3.593ArgGln: 3.593 ± 0.636
0.599ArgArg: 0.599 ± 0.462
1.796ArgSer: 1.796 ± 0.707
1.198ArgThr: 1.198 ± 0.895
3.593ArgVal: 3.593 ± 1.355
0.599ArgTrp: 0.599 ± 0.462
5.988ArgTyr: 5.988 ± 3.036
0.0ArgXaa: 0.0 ± 0.0
Ser
7.784SerAla: 7.784 ± 1.802
1.796SerCys: 1.796 ± 1.342
5.389SerAsp: 5.389 ± 0.955
1.796SerGlu: 1.796 ± 0.677
1.796SerPhe: 1.796 ± 0.999
4.79SerGly: 4.79 ± 1.727
1.198SerHis: 1.198 ± 0.895
2.994SerIle: 2.994 ± 1.363
6.587SerLys: 6.587 ± 1.188
6.587SerLeu: 6.587 ± 2.051
1.198SerMet: 1.198 ± 0.925
5.389SerAsn: 5.389 ± 0.969
1.796SerPro: 1.796 ± 0.879
2.994SerGln: 2.994 ± 1.402
4.192SerArg: 4.192 ± 1.339
4.79SerSer: 4.79 ± 2.345
5.988SerThr: 5.988 ± 3.229
6.587SerVal: 6.587 ± 2.104
1.198SerTrp: 1.198 ± 0.563
4.192SerTyr: 4.192 ± 1.336
0.0SerXaa: 0.0 ± 0.0
Thr
4.79ThrAla: 4.79 ± 2.271
0.599ThrCys: 0.599 ± 0.462
1.796ThrAsp: 1.796 ± 0.318
4.192ThrGlu: 4.192 ± 1.096
1.796ThrPhe: 1.796 ± 1.407
1.796ThrGly: 1.796 ± 0.879
2.395ThrHis: 2.395 ± 0.534
1.796ThrIle: 1.796 ± 0.318
2.994ThrLys: 2.994 ± 0.431
2.395ThrLeu: 2.395 ± 1.17
2.994ThrMet: 2.994 ± 1.402
4.192ThrAsn: 4.192 ± 1.137
1.796ThrPro: 1.796 ± 1.407
1.796ThrGln: 1.796 ± 0.707
2.395ThrArg: 2.395 ± 1.136
4.192ThrSer: 4.192 ± 3.237
2.395ThrThr: 2.395 ± 1.287
1.796ThrVal: 1.796 ± 1.157
0.599ThrTrp: 0.599 ± 0.462
2.395ThrTyr: 2.395 ± 0.738
0.0ThrXaa: 0.0 ± 0.0
Val
4.192ValAla: 4.192 ± 0.757
0.0ValCys: 0.0 ± 0.0
9.581ValAsp: 9.581 ± 2.384
1.796ValGlu: 1.796 ± 0.677
2.395ValPhe: 2.395 ± 0.738
3.593ValGly: 3.593 ± 1.632
0.599ValHis: 0.599 ± 0.447
2.994ValIle: 2.994 ± 1.515
4.79ValLys: 4.79 ± 1.068
1.796ValLeu: 1.796 ± 0.677
1.198ValMet: 1.198 ± 0.563
4.192ValAsn: 4.192 ± 1.096
2.395ValPro: 2.395 ± 1.198
4.192ValGln: 4.192 ± 0.786
1.796ValArg: 1.796 ± 0.318
2.994ValSer: 2.994 ± 2.296
4.192ValThr: 4.192 ± 3.237
3.593ValVal: 3.593 ± 2.352
0.599ValTrp: 0.599 ± 0.462
4.192ValTyr: 4.192 ± 1.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.599TrpAsp: 0.599 ± 0.447
0.599TrpGlu: 0.599 ± 0.462
1.198TrpPhe: 1.198 ± 0.563
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.599TrpLeu: 0.599 ± 0.462
0.599TrpMet: 0.599 ± 0.462
1.796TrpAsn: 1.796 ± 1.039
0.0TrpPro: 0.0 ± 0.0
0.599TrpGln: 0.599 ± 0.447
0.599TrpArg: 0.599 ± 0.447
0.599TrpSer: 0.599 ± 1.205
0.599TrpThr: 0.599 ± 0.462
0.599TrpVal: 0.599 ± 0.462
0.0TrpTrp: 0.0 ± 0.0
1.198TrpTyr: 1.198 ± 0.925
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.192TyrAla: 4.192 ± 1.273
0.599TyrCys: 0.599 ± 0.447
4.79TyrAsp: 4.79 ± 1.408
2.994TyrGlu: 2.994 ± 1.029
4.79TyrPhe: 4.79 ± 2.841
7.186TyrGly: 7.186 ± 0.364
1.198TyrHis: 1.198 ± 0.369
4.79TyrIle: 4.79 ± 2.174
5.389TyrLys: 5.389 ± 0.853
2.395TyrLeu: 2.395 ± 0.534
1.198TyrMet: 1.198 ± 1.016
6.587TyrAsn: 6.587 ± 1.584
1.198TyrPro: 1.198 ± 0.895
2.395TyrGln: 2.395 ± 1.287
2.395TyrArg: 2.395 ± 0.738
3.593TyrSer: 3.593 ± 0.717
2.994TyrThr: 2.994 ± 0.995
5.988TyrVal: 5.988 ± 1.191
1.198TyrTrp: 1.198 ± 0.895
2.994TyrTyr: 2.994 ± 2.236
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1671 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski