Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_407

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.289AlaAla: 6.289 ± 1.569
0.629AlaCys: 0.629 ± 0.551
1.887AlaAsp: 1.887 ± 0.377
4.403AlaGlu: 4.403 ± 2.616
3.774AlaPhe: 3.774 ± 1.587
1.887AlaGly: 1.887 ± 1.04
1.258AlaHis: 1.258 ± 0.704
0.629AlaIle: 0.629 ± 0.551
2.516AlaLys: 2.516 ± 1.169
3.774AlaLeu: 3.774 ± 2.112
1.258AlaMet: 1.258 ± 0.567
5.031AlaAsn: 5.031 ± 3.678
1.887AlaPro: 1.887 ± 0.741
2.516AlaGln: 2.516 ± 0.871
5.031AlaArg: 5.031 ± 2.139
4.403AlaSer: 4.403 ± 0.997
2.516AlaThr: 2.516 ± 1.584
2.516AlaVal: 2.516 ± 1.925
1.887AlaTrp: 1.887 ± 0.741
3.774AlaTyr: 3.774 ± 2.33
0.0AlaXaa: 0.0 ± 0.0
Cys
1.887CysAla: 1.887 ± 0.969
0.629CysCys: 0.629 ± 0.551
1.258CysAsp: 1.258 ± 0.869
0.0CysGlu: 0.0 ± 0.0
0.629CysPhe: 0.629 ± 0.822
0.629CysGly: 0.629 ± 0.551
0.0CysHis: 0.0 ± 0.0
0.629CysIle: 0.629 ± 0.822
0.629CysLys: 0.629 ± 0.551
0.629CysLeu: 0.629 ± 0.481
0.0CysMet: 0.0 ± 0.0
0.629CysAsn: 0.629 ± 0.551
0.629CysPro: 0.629 ± 0.481
0.629CysGln: 0.629 ± 0.822
0.0CysArg: 0.0 ± 0.0
0.629CysSer: 0.629 ± 0.551
1.887CysThr: 1.887 ± 0.66
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.887CysTyr: 1.887 ± 1.175
0.0CysXaa: 0.0 ± 0.0
Asp
2.516AspAla: 2.516 ± 0.756
0.629AspCys: 0.629 ± 0.736
4.403AspAsp: 4.403 ± 2.155
5.66AspGlu: 5.66 ± 0.908
4.403AspPhe: 4.403 ± 1.302
1.887AspGly: 1.887 ± 0.741
1.258AspHis: 1.258 ± 0.442
5.031AspIle: 5.031 ± 1.414
2.516AspLys: 2.516 ± 0.994
3.145AspLeu: 3.145 ± 1.338
1.258AspMet: 1.258 ± 0.962
4.403AspAsn: 4.403 ± 0.643
3.774AspPro: 3.774 ± 1.803
0.629AspGln: 0.629 ± 0.551
2.516AspArg: 2.516 ± 0.756
8.176AspSer: 8.176 ± 1.72
2.516AspThr: 2.516 ± 0.502
3.145AspVal: 3.145 ± 1.338
0.629AspTrp: 0.629 ± 0.551
3.774AspTyr: 3.774 ± 0.757
0.0AspXaa: 0.0 ± 0.0
Glu
1.258GluAla: 1.258 ± 0.442
0.629GluCys: 0.629 ± 0.579
1.258GluAsp: 1.258 ± 0.442
3.774GluGlu: 3.774 ± 2.821
2.516GluPhe: 2.516 ± 1.229
1.887GluGly: 1.887 ± 1.04
0.0GluHis: 0.0 ± 0.0
1.258GluIle: 1.258 ± 0.869
4.403GluLys: 4.403 ± 1.317
3.774GluLeu: 3.774 ± 2.079
1.258GluMet: 1.258 ± 0.442
1.887GluAsn: 1.887 ± 1.401
0.0GluPro: 0.0 ± 0.0
3.145GluGln: 3.145 ± 1.022
3.145GluArg: 3.145 ± 0.79
5.66GluSer: 5.66 ± 1.952
2.516GluThr: 2.516 ± 1.251
4.403GluVal: 4.403 ± 1.126
2.516GluTrp: 2.516 ± 0.871
4.403GluTyr: 4.403 ± 1.53
0.0GluXaa: 0.0 ± 0.0
Phe
4.403PheAla: 4.403 ± 1.126
1.258PheCys: 1.258 ± 0.869
4.403PheAsp: 4.403 ± 1.627
0.0PheGlu: 0.0 ± 0.0
5.031PhePhe: 5.031 ± 1.353
3.774PheGly: 3.774 ± 1.445
0.629PheHis: 0.629 ± 0.551
1.887PheIle: 1.887 ± 1.139
1.887PheLys: 1.887 ± 1.622
3.774PheLeu: 3.774 ± 0.901
3.145PheMet: 3.145 ± 1.605
1.258PheAsn: 1.258 ± 0.828
1.258PhePro: 1.258 ± 0.567
0.629PheGln: 0.629 ± 0.481
3.774PheArg: 3.774 ± 2.475
6.289PheSer: 6.289 ± 1.226
2.516PheThr: 2.516 ± 0.883
5.031PheVal: 5.031 ± 1.584
0.629PheTrp: 0.629 ± 0.481
3.145PheTyr: 3.145 ± 0.816
0.0PheXaa: 0.0 ± 0.0
Gly
2.516GlyAla: 2.516 ± 1.408
0.629GlyCys: 0.629 ± 0.822
6.289GlyAsp: 6.289 ± 1.203
3.145GlyGlu: 3.145 ± 0.908
2.516GlyPhe: 2.516 ± 1.298
2.516GlyGly: 2.516 ± 0.502
0.0GlyHis: 0.0 ± 0.0
5.66GlyIle: 5.66 ± 2.363
4.403GlyLys: 4.403 ± 1.212
1.887GlyLeu: 1.887 ± 0.969
1.258GlyMet: 1.258 ± 0.962
1.887GlyAsn: 1.887 ± 0.377
1.258GlyPro: 1.258 ± 1.102
5.031GlyGln: 5.031 ± 2.596
1.887GlyArg: 1.887 ± 0.741
5.66GlySer: 5.66 ± 1.41
3.145GlyThr: 3.145 ± 1.627
3.145GlyVal: 3.145 ± 1.7
0.629GlyTrp: 0.629 ± 0.551
1.887GlyTyr: 1.887 ± 1.139
0.0GlyXaa: 0.0 ± 0.0
His
0.629HisAla: 0.629 ± 0.551
0.0HisCys: 0.0 ± 0.0
1.258HisAsp: 1.258 ± 1.102
0.629HisGlu: 0.629 ± 0.551
2.516HisPhe: 2.516 ± 0.947
0.629HisGly: 0.629 ± 0.551
0.629HisHis: 0.629 ± 0.551
0.629HisIle: 0.629 ± 0.551
0.0HisLys: 0.0 ± 0.0
3.145HisLeu: 3.145 ± 0.908
0.0HisMet: 0.0 ± 0.0
0.629HisAsn: 0.629 ± 0.481
1.258HisPro: 1.258 ± 0.828
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.258HisSer: 1.258 ± 0.442
0.629HisThr: 0.629 ± 0.579
0.629HisVal: 0.629 ± 0.551
0.0HisTrp: 0.0 ± 0.0
2.516HisTyr: 2.516 ± 1.395
0.0HisXaa: 0.0 ± 0.0
Ile
3.774IleAla: 3.774 ± 1.506
0.0IleCys: 0.0 ± 0.0
5.031IleAsp: 5.031 ± 0.877
1.887IleGlu: 1.887 ± 0.878
1.887IlePhe: 1.887 ± 0.875
3.145IleGly: 3.145 ± 0.626
1.258IleHis: 1.258 ± 0.704
3.774IleIle: 3.774 ± 1.051
3.145IleLys: 3.145 ± 1.672
3.774IleLeu: 3.774 ± 1.587
1.887IleMet: 1.887 ± 0.722
2.516IleAsn: 2.516 ± 0.871
1.258IlePro: 1.258 ± 0.869
0.629IleGln: 0.629 ± 0.579
1.258IleArg: 1.258 ± 0.442
3.774IleSer: 3.774 ± 0.567
6.918IleThr: 6.918 ± 1.993
2.516IleVal: 2.516 ± 1.484
0.0IleTrp: 0.0 ± 0.0
5.031IleTyr: 5.031 ± 1.117
0.0IleXaa: 0.0 ± 0.0
Lys
1.887LysAla: 1.887 ± 1.04
1.887LysCys: 1.887 ± 0.722
1.887LysAsp: 1.887 ± 0.875
3.774LysGlu: 3.774 ± 1.803
1.887LysPhe: 1.887 ± 1.622
4.403LysGly: 4.403 ± 1.342
1.258LysHis: 1.258 ± 1.102
1.887LysIle: 1.887 ± 0.878
2.516LysLys: 2.516 ± 0.502
6.918LysLeu: 6.918 ± 1.713
3.145LysMet: 3.145 ± 1.642
1.887LysAsn: 1.887 ± 0.722
1.258LysPro: 1.258 ± 0.442
1.887LysGln: 1.887 ± 1.04
3.145LysArg: 3.145 ± 1.542
3.145LysSer: 3.145 ± 1.096
2.516LysThr: 2.516 ± 1.584
3.145LysVal: 3.145 ± 1.121
0.0LysTrp: 0.0 ± 0.0
5.031LysTyr: 5.031 ± 1.558
0.0LysXaa: 0.0 ± 0.0
Leu
5.031LeuAla: 5.031 ± 1.354
1.887LeuCys: 1.887 ± 2.467
3.774LeuAsp: 3.774 ± 1.833
3.145LeuGlu: 3.145 ± 2.256
2.516LeuPhe: 2.516 ± 0.851
8.176LeuGly: 8.176 ± 2.649
1.258LeuHis: 1.258 ± 0.442
3.774LeuIle: 3.774 ± 1.372
5.66LeuLys: 5.66 ± 1.76
6.289LeuLeu: 6.289 ± 1.071
0.629LeuMet: 0.629 ± 0.466
4.403LeuAsn: 4.403 ± 0.963
5.66LeuPro: 5.66 ± 1.482
3.774LeuGln: 3.774 ± 1.654
5.66LeuArg: 5.66 ± 0.782
6.918LeuSer: 6.918 ± 1.315
4.403LeuThr: 4.403 ± 2.163
2.516LeuVal: 2.516 ± 1.679
0.629LeuTrp: 0.629 ± 0.579
5.66LeuTyr: 5.66 ± 3.467
0.0LeuXaa: 0.0 ± 0.0
Met
1.258MetAla: 1.258 ± 0.962
0.0MetCys: 0.0 ± 0.0
0.629MetAsp: 0.629 ± 0.579
0.629MetGlu: 0.629 ± 0.822
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.629MetHis: 0.629 ± 0.822
1.258MetIle: 1.258 ± 0.885
0.0MetLys: 0.0 ± 0.0
1.258MetLeu: 1.258 ± 0.567
0.629MetMet: 0.629 ± 0.481
1.258MetAsn: 1.258 ± 0.567
0.629MetPro: 0.629 ± 0.551
0.0MetGln: 0.0 ± 0.0
3.774MetArg: 3.774 ± 0.755
3.774MetSer: 3.774 ± 1.54
2.516MetThr: 2.516 ± 0.502
0.0MetVal: 0.0 ± 0.0
0.629MetTrp: 0.629 ± 0.551
1.887MetTyr: 1.887 ± 0.9
0.0MetXaa: 0.0 ± 0.0
Asn
3.774AsnAla: 3.774 ± 2.716
0.629AsnCys: 0.629 ± 0.551
3.774AsnAsp: 3.774 ± 0.532
4.403AsnGlu: 4.403 ± 2.856
3.145AsnPhe: 3.145 ± 1.899
4.403AsnGly: 4.403 ± 2.436
0.0AsnHis: 0.0 ± 0.0
5.031AsnIle: 5.031 ± 0.877
1.887AsnLys: 1.887 ± 0.958
4.403AsnLeu: 4.403 ± 0.769
0.629AsnMet: 0.629 ± 0.822
3.774AsnAsn: 3.774 ± 2.179
5.031AsnPro: 5.031 ± 1.133
1.887AsnGln: 1.887 ± 0.878
1.887AsnArg: 1.887 ± 0.878
2.516AsnSer: 2.516 ± 1.169
3.145AsnThr: 3.145 ± 1.711
5.031AsnVal: 5.031 ± 2.003
0.0AsnTrp: 0.0 ± 0.0
1.887AsnTyr: 1.887 ± 0.66
0.0AsnXaa: 0.0 ± 0.0
Pro
0.629ProAla: 0.629 ± 0.481
0.0ProCys: 0.0 ± 0.0
2.516ProAsp: 2.516 ± 0.92
1.887ProGlu: 1.887 ± 0.66
3.145ProPhe: 3.145 ± 1.366
0.629ProGly: 0.629 ± 0.481
2.516ProHis: 2.516 ± 1.672
2.516ProIle: 2.516 ± 1.169
1.887ProLys: 1.887 ± 0.377
6.289ProLeu: 6.289 ± 1.708
0.629ProMet: 0.629 ± 0.551
2.516ProAsn: 2.516 ± 0.659
1.887ProPro: 1.887 ± 1.466
3.145ProGln: 3.145 ± 1.695
2.516ProArg: 2.516 ± 1.672
5.66ProSer: 5.66 ± 1.276
1.887ProThr: 1.887 ± 0.66
1.887ProVal: 1.887 ± 0.66
0.0ProTrp: 0.0 ± 0.0
5.031ProTyr: 5.031 ± 1.767
0.0ProXaa: 0.0 ± 0.0
Gln
1.887GlnAla: 1.887 ± 1.04
0.0GlnCys: 0.0 ± 0.0
3.145GlnAsp: 3.145 ± 1.43
1.887GlnGlu: 1.887 ± 0.878
2.516GlnPhe: 2.516 ± 1.169
3.774GlnGly: 3.774 ± 1.364
0.629GlnHis: 0.629 ± 0.481
2.516GlnIle: 2.516 ± 0.871
0.0GlnLys: 0.0 ± 0.0
5.66GlnLeu: 5.66 ± 1.435
0.629GlnMet: 0.629 ± 0.894
2.516GlnAsn: 2.516 ± 0.659
4.403GlnPro: 4.403 ± 1.951
1.887GlnGln: 1.887 ± 1.04
1.258GlnArg: 1.258 ± 1.157
2.516GlnSer: 2.516 ± 1.395
2.516GlnThr: 2.516 ± 1.584
3.145GlnVal: 3.145 ± 0.79
0.0GlnTrp: 0.0 ± 0.0
1.887GlnTyr: 1.887 ± 0.741
0.0GlnXaa: 0.0 ± 0.0
Arg
1.258ArgAla: 1.258 ± 1.102
1.887ArgCys: 1.887 ± 0.875
3.774ArgAsp: 3.774 ± 2.475
3.145ArgGlu: 3.145 ± 0.583
1.887ArgPhe: 1.887 ± 0.722
1.258ArgGly: 1.258 ± 1.102
0.0ArgHis: 0.0 ± 0.0
3.145ArgIle: 3.145 ± 1.572
4.403ArgLys: 4.403 ± 2.208
3.145ArgLeu: 3.145 ± 0.97
0.0ArgMet: 0.0 ± 0.0
5.031ArgAsn: 5.031 ± 1.004
2.516ArgPro: 2.516 ± 1.395
0.629ArgGln: 0.629 ± 0.579
1.887ArgArg: 1.887 ± 0.878
6.918ArgSer: 6.918 ± 1.008
1.258ArgThr: 1.258 ± 0.567
3.774ArgVal: 3.774 ± 1.609
0.629ArgTrp: 0.629 ± 0.551
1.887ArgTyr: 1.887 ± 0.741
0.0ArgXaa: 0.0 ± 0.0
Ser
3.774SerAla: 3.774 ± 1.701
1.258SerCys: 1.258 ± 0.962
2.516SerAsp: 2.516 ± 1.591
3.774SerGlu: 3.774 ± 0.757
4.403SerPhe: 4.403 ± 1.182
5.66SerGly: 5.66 ± 2.202
1.887SerHis: 1.887 ± 0.875
5.66SerIle: 5.66 ± 1.631
6.289SerLys: 6.289 ± 1.226
12.579SerLeu: 12.579 ± 0.926
0.629SerMet: 0.629 ± 0.551
3.774SerAsn: 3.774 ± 1.25
6.289SerPro: 6.289 ± 1.139
4.403SerGln: 4.403 ± 2.163
5.66SerArg: 5.66 ± 1.433
9.434SerSer: 9.434 ± 1.21
5.031SerThr: 5.031 ± 1.133
5.031SerVal: 5.031 ± 0.912
1.258SerTrp: 1.258 ± 0.442
3.774SerTyr: 3.774 ± 1.325
0.0SerXaa: 0.0 ± 0.0
Thr
6.289ThrAla: 6.289 ± 2.128
0.0ThrCys: 0.0 ± 0.0
3.145ThrAsp: 3.145 ± 1.695
1.887ThrGlu: 1.887 ± 0.722
3.774ThrPhe: 3.774 ± 1.364
1.887ThrGly: 1.887 ± 0.878
0.629ThrHis: 0.629 ± 0.481
4.403ThrIle: 4.403 ± 1.302
1.887ThrLys: 1.887 ± 0.741
3.145ThrLeu: 3.145 ± 0.832
3.145ThrMet: 3.145 ± 1.572
4.403ThrAsn: 4.403 ± 1.32
3.145ThrPro: 3.145 ± 1.354
4.403ThrGln: 4.403 ± 1.322
1.258ThrArg: 1.258 ± 0.795
3.774ThrSer: 3.774 ± 0.755
2.516ThrThr: 2.516 ± 1.298
2.516ThrVal: 2.516 ± 1.298
0.0ThrTrp: 0.0 ± 0.0
4.403ThrTyr: 4.403 ± 0.746
0.0ThrXaa: 0.0 ± 0.0
Val
3.774ValAla: 3.774 ± 0.567
0.629ValCys: 0.629 ± 0.736
7.547ValAsp: 7.547 ± 2.797
2.516ValGlu: 2.516 ± 1.654
3.774ValPhe: 3.774 ± 0.961
3.145ValGly: 3.145 ± 1.04
1.258ValHis: 1.258 ± 1.102
0.629ValIle: 0.629 ± 0.822
5.031ValLys: 5.031 ± 2.237
2.516ValLeu: 2.516 ± 0.92
0.0ValMet: 0.0 ± 0.0
1.887ValAsn: 1.887 ± 1.175
3.145ValPro: 3.145 ± 0.626
1.258ValGln: 1.258 ± 0.442
1.258ValArg: 1.258 ± 0.885
6.918ValSer: 6.918 ± 4.191
2.516ValThr: 2.516 ± 1.925
5.66ValVal: 5.66 ± 2.747
0.629ValTrp: 0.629 ± 0.551
3.145ValTyr: 3.145 ± 1.5
0.0ValXaa: 0.0 ± 0.0
Trp
0.629TrpAla: 0.629 ± 0.579
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.629TrpGlu: 0.629 ± 0.481
0.629TrpPhe: 0.629 ± 0.551
0.0TrpGly: 0.0 ± 0.0
0.629TrpHis: 0.629 ± 0.481
0.629TrpIle: 0.629 ± 0.481
1.258TrpLys: 1.258 ± 1.102
0.629TrpLeu: 0.629 ± 0.551
0.0TrpMet: 0.0 ± 0.0
1.258TrpAsn: 1.258 ± 0.567
0.0TrpPro: 0.0 ± 0.0
1.887TrpGln: 1.887 ± 0.377
0.629TrpArg: 0.629 ± 0.551
0.0TrpSer: 0.0 ± 0.0
1.258TrpThr: 1.258 ± 1.102
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.403TyrAla: 4.403 ± 1.535
0.629TyrCys: 0.629 ± 0.551
3.774TyrAsp: 3.774 ± 2.096
2.516TyrGlu: 2.516 ± 0.756
3.774TyrPhe: 3.774 ± 1.862
5.66TyrGly: 5.66 ± 1.383
1.258TyrHis: 1.258 ± 0.442
2.516TyrIle: 2.516 ± 0.883
3.145TyrLys: 3.145 ± 0.816
5.031TyrLeu: 5.031 ± 2.001
0.0TyrMet: 0.0 ± 0.0
6.289TyrAsn: 6.289 ± 1.356
1.887TyrPro: 1.887 ± 0.958
5.031TyrGln: 5.031 ± 0.952
1.887TyrArg: 1.887 ± 1.654
5.66TyrSer: 5.66 ± 1.496
4.403TyrThr: 4.403 ± 1.53
3.145TyrVal: 3.145 ± 1.179
0.0TyrTrp: 0.0 ± 0.0
6.918TyrTyr: 6.918 ± 2.944
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski