Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_442

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.968AlaAla: 5.968 ± 5.539
1.326AlaCys: 1.326 ± 1.085
5.305AlaAsp: 5.305 ± 1.428
3.316AlaGlu: 3.316 ± 1.16
2.653AlaPhe: 2.653 ± 1.508
5.305AlaGly: 5.305 ± 3.253
1.989AlaHis: 1.989 ± 0.903
3.979AlaIle: 3.979 ± 1.781
5.968AlaLys: 5.968 ± 4.01
5.305AlaLeu: 5.305 ± 2.011
3.979AlaMet: 3.979 ± 3.401
3.979AlaAsn: 3.979 ± 1.502
2.653AlaPro: 2.653 ± 1.203
3.316AlaGln: 3.316 ± 2.781
3.316AlaArg: 3.316 ± 0.552
10.61AlaSer: 10.61 ± 3.433
3.979AlaThr: 3.979 ± 1.402
3.316AlaVal: 3.316 ± 1.612
0.0AlaTrp: 0.0 ± 0.0
0.663AlaTyr: 0.663 ± 0.542
0.0AlaXaa: 0.0 ± 0.0
Cys
0.663CysAla: 0.663 ± 0.438
1.326CysCys: 1.326 ± 1.085
2.653CysAsp: 2.653 ± 1.786
0.663CysGlu: 0.663 ± 0.542
0.0CysPhe: 0.0 ± 0.0
0.663CysGly: 0.663 ± 0.542
0.0CysHis: 0.0 ± 0.0
0.663CysIle: 0.663 ± 0.542
1.326CysLys: 1.326 ± 0.88
0.663CysLeu: 0.663 ± 0.438
0.0CysMet: 0.0 ± 0.0
0.663CysAsn: 0.663 ± 0.542
0.663CysPro: 0.663 ± 0.438
0.663CysGln: 0.663 ± 0.438
0.663CysArg: 0.663 ± 0.542
1.326CysSer: 1.326 ± 0.88
0.0CysThr: 0.0 ± 0.0
1.326CysVal: 1.326 ± 0.805
0.0CysTrp: 0.0 ± 0.0
2.653CysTyr: 2.653 ± 1.074
0.0CysXaa: 0.0 ± 0.0
Asp
7.958AspAla: 7.958 ± 4.353
0.663AspCys: 0.663 ± 0.542
3.979AspAsp: 3.979 ± 2.08
2.653AspGlu: 2.653 ± 1.61
3.316AspPhe: 3.316 ± 1.448
1.989AspGly: 1.989 ± 0.824
1.326AspHis: 1.326 ± 0.805
3.979AspIle: 3.979 ± 1.59
3.979AspLys: 3.979 ± 2.289
5.305AspLeu: 5.305 ± 1.638
1.989AspMet: 1.989 ± 0.914
5.968AspAsn: 5.968 ± 1.687
3.979AspPro: 3.979 ± 2.106
3.316AspGln: 3.316 ± 0.6
3.316AspArg: 3.316 ± 1.16
4.642AspSer: 4.642 ± 1.551
0.663AspThr: 0.663 ± 0.438
2.653AspVal: 2.653 ± 1.61
0.663AspTrp: 0.663 ± 0.542
1.989AspTyr: 1.989 ± 0.741
0.0AspXaa: 0.0 ± 0.0
Glu
3.979GluAla: 3.979 ± 1.852
0.663GluCys: 0.663 ± 0.809
2.653GluAsp: 2.653 ± 2.452
1.326GluGlu: 1.326 ± 1.027
3.979GluPhe: 3.979 ± 1.628
0.663GluGly: 0.663 ± 0.705
1.326GluHis: 1.326 ± 0.543
1.326GluIle: 1.326 ± 0.877
1.326GluLys: 1.326 ± 1.085
4.642GluLeu: 4.642 ± 1.15
0.663GluMet: 0.663 ± 0.83
3.979GluAsn: 3.979 ± 1.281
1.989GluPro: 1.989 ± 0.839
2.653GluGln: 2.653 ± 1.203
4.642GluArg: 4.642 ± 1.903
1.326GluSer: 1.326 ± 0.543
0.663GluThr: 0.663 ± 0.542
5.305GluVal: 5.305 ± 1.725
0.663GluTrp: 0.663 ± 0.438
3.979GluTyr: 3.979 ± 1.869
0.0GluXaa: 0.0 ± 0.0
Phe
3.979PheAla: 3.979 ± 0.604
0.663PheCys: 0.663 ± 0.83
5.968PheAsp: 5.968 ± 1.973
1.326PheGlu: 1.326 ± 1.085
4.642PhePhe: 4.642 ± 2.834
3.979PheGly: 3.979 ± 0.934
0.663PheHis: 0.663 ± 0.542
3.979PheIle: 3.979 ± 1.473
3.979PheLys: 3.979 ± 1.535
3.979PheLeu: 3.979 ± 1.417
1.989PheMet: 1.989 ± 0.808
2.653PheAsn: 2.653 ± 1.311
0.663PhePro: 0.663 ± 0.438
1.326PheGln: 1.326 ± 0.928
3.316PheArg: 3.316 ± 1.574
4.642PheSer: 4.642 ± 1.292
2.653PheThr: 2.653 ± 1.203
1.989PheVal: 1.989 ± 1.479
0.663PheTrp: 0.663 ± 0.438
3.316PheTyr: 3.316 ± 1.369
0.0PheXaa: 0.0 ± 0.0
Gly
3.316GlyAla: 3.316 ± 1.988
0.663GlyCys: 0.663 ± 0.542
2.653GlyAsp: 2.653 ± 0.788
5.305GlyGlu: 5.305 ± 0.936
1.989GlyPhe: 1.989 ± 0.824
3.316GlyGly: 3.316 ± 2.191
0.663GlyHis: 0.663 ± 0.438
1.989GlyIle: 1.989 ± 1.323
3.316GlyLys: 3.316 ± 1.034
7.294GlyLeu: 7.294 ± 1.388
2.653GlyMet: 2.653 ± 1.377
3.316GlyAsn: 3.316 ± 2.017
0.663GlyPro: 0.663 ± 0.438
3.979GlyGln: 3.979 ± 1.298
1.989GlyArg: 1.989 ± 0.737
5.968GlySer: 5.968 ± 1.681
1.989GlyThr: 1.989 ± 1.323
3.979GlyVal: 3.979 ± 1.546
0.663GlyTrp: 0.663 ± 0.705
3.979GlyTyr: 3.979 ± 1.421
0.0GlyXaa: 0.0 ± 0.0
His
0.663HisAla: 0.663 ± 0.542
0.0HisCys: 0.0 ± 0.0
1.326HisAsp: 1.326 ± 1.027
0.0HisGlu: 0.0 ± 0.0
0.663HisPhe: 0.663 ± 0.438
3.316HisGly: 3.316 ± 1.683
0.0HisHis: 0.0 ± 0.0
3.316HisIle: 3.316 ± 2.033
0.0HisLys: 0.0 ± 0.0
1.326HisLeu: 1.326 ± 0.877
0.663HisMet: 0.663 ± 0.809
0.663HisAsn: 0.663 ± 0.542
0.663HisPro: 0.663 ± 0.542
0.663HisGln: 0.663 ± 0.83
0.663HisArg: 0.663 ± 0.542
0.0HisSer: 0.0 ± 0.0
1.326HisThr: 1.326 ± 0.543
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.326HisTyr: 1.326 ± 0.543
0.0HisXaa: 0.0 ± 0.0
Ile
1.989IleAla: 1.989 ± 2.489
1.989IleCys: 1.989 ± 0.824
2.653IleAsp: 2.653 ± 1.257
1.989IleGlu: 1.989 ± 0.993
0.663IlePhe: 0.663 ± 0.705
5.305IleGly: 5.305 ± 1.706
0.0IleHis: 0.0 ± 0.0
1.989IleIle: 1.989 ± 0.741
3.316IleLys: 3.316 ± 1.657
1.989IleLeu: 1.989 ± 1.627
0.663IleMet: 0.663 ± 0.705
5.968IleAsn: 5.968 ± 1.681
1.989IlePro: 1.989 ± 0.824
0.663IleGln: 0.663 ± 0.438
1.989IleArg: 1.989 ± 1.235
1.326IleSer: 1.326 ± 0.88
3.316IleThr: 3.316 ± 0.885
1.989IleVal: 1.989 ± 0.737
0.663IleTrp: 0.663 ± 0.438
2.653IleTyr: 2.653 ± 1.085
0.0IleXaa: 0.0 ± 0.0
Lys
5.305LysAla: 5.305 ± 4.8
0.663LysCys: 0.663 ± 0.542
3.979LysAsp: 3.979 ± 2.589
1.989LysGlu: 1.989 ± 0.903
1.989LysPhe: 1.989 ± 1.015
2.653LysGly: 2.653 ± 0.718
1.326LysHis: 1.326 ± 1.085
2.653LysIle: 2.653 ± 0.788
5.305LysLys: 5.305 ± 1.212
8.621LysLeu: 8.621 ± 2.465
1.989LysMet: 1.989 ± 1.535
2.653LysAsn: 2.653 ± 2.053
1.326LysPro: 1.326 ± 0.877
5.968LysGln: 5.968 ± 2.456
1.989LysArg: 1.989 ± 1.627
4.642LysSer: 4.642 ± 0.703
1.326LysThr: 1.326 ± 0.543
1.989LysVal: 1.989 ± 0.741
0.0LysTrp: 0.0 ± 0.0
2.653LysTyr: 2.653 ± 0.709
0.0LysXaa: 0.0 ± 0.0
Leu
5.968LeuAla: 5.968 ± 3.256
0.0LeuCys: 0.0 ± 0.0
6.631LeuAsp: 6.631 ± 1.927
5.968LeuGlu: 5.968 ± 3.509
2.653LeuPhe: 2.653 ± 1.61
5.968LeuGly: 5.968 ± 1.773
0.663LeuHis: 0.663 ± 0.542
5.968LeuIle: 5.968 ± 1.851
3.316LeuLys: 3.316 ± 1.514
7.294LeuLeu: 7.294 ± 2.219
0.0LeuMet: 0.0 ± 0.0
5.968LeuAsn: 5.968 ± 2.488
5.305LeuPro: 5.305 ± 1.738
3.979LeuGln: 3.979 ± 1.07
4.642LeuArg: 4.642 ± 1.346
10.61LeuSer: 10.61 ± 1.808
4.642LeuThr: 4.642 ± 1.545
7.294LeuVal: 7.294 ± 1.291
0.663LeuTrp: 0.663 ± 0.542
3.316LeuTyr: 3.316 ± 1.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.653MetAla: 2.653 ± 1.074
0.663MetCys: 0.663 ± 0.83
1.326MetAsp: 1.326 ± 0.731
0.663MetGlu: 0.663 ± 0.542
0.663MetPhe: 0.663 ± 0.809
1.989MetGly: 1.989 ± 0.914
0.0MetHis: 0.0 ± 0.0
0.663MetIle: 0.663 ± 0.83
2.653MetLys: 2.653 ± 2.019
0.663MetLeu: 0.663 ± 0.705
0.663MetMet: 0.663 ± 0.438
2.653MetAsn: 2.653 ± 1.18
1.326MetPro: 1.326 ± 0.877
0.0MetGln: 0.0 ± 0.0
3.979MetArg: 3.979 ± 1.072
1.989MetSer: 1.989 ± 1.015
1.326MetThr: 1.326 ± 1.027
1.989MetVal: 1.989 ± 1.315
0.0MetTrp: 0.0 ± 0.0
0.663MetTyr: 0.663 ± 0.438
0.0MetXaa: 0.0 ± 0.0
Asn
3.979AsnAla: 3.979 ± 2.646
0.663AsnCys: 0.663 ± 0.83
2.653AsnAsp: 2.653 ± 1.311
3.979AsnGlu: 3.979 ± 1.209
4.642AsnPhe: 4.642 ± 2.822
1.326AsnGly: 1.326 ± 1.41
0.0AsnHis: 0.0 ± 0.0
2.653AsnIle: 2.653 ± 1.693
3.979AsnLys: 3.979 ± 1.869
7.958AsnLeu: 7.958 ± 3.171
0.663AsnMet: 0.663 ± 0.809
3.979AsnAsn: 3.979 ± 2.029
5.305AsnPro: 5.305 ± 1.418
1.326AsnGln: 1.326 ± 0.877
2.653AsnArg: 2.653 ± 0.788
5.968AsnSer: 5.968 ± 3.081
3.979AsnThr: 3.979 ± 1.23
2.653AsnVal: 2.653 ± 1.013
0.663AsnTrp: 0.663 ± 0.438
1.989AsnTyr: 1.989 ± 1.101
0.0AsnXaa: 0.0 ± 0.0
Pro
2.653ProAla: 2.653 ± 1.203
0.663ProCys: 0.663 ± 0.542
3.316ProAsp: 3.316 ± 2.191
3.316ProGlu: 3.316 ± 0.993
3.316ProPhe: 3.316 ± 2.191
1.989ProGly: 1.989 ± 1.315
0.663ProHis: 0.663 ± 0.542
1.326ProIle: 1.326 ± 0.543
4.642ProLys: 4.642 ± 1.201
5.305ProLeu: 5.305 ± 0.808
1.989ProMet: 1.989 ± 0.894
0.663ProAsn: 0.663 ± 0.705
0.663ProPro: 0.663 ± 0.438
3.316ProGln: 3.316 ± 1.016
1.989ProArg: 1.989 ± 0.993
2.653ProSer: 2.653 ± 0.99
1.989ProThr: 1.989 ± 0.894
1.989ProVal: 1.989 ± 1.315
0.0ProTrp: 0.0 ± 0.0
0.663ProTyr: 0.663 ± 0.542
0.0ProXaa: 0.0 ± 0.0
Gln
5.305GlnAla: 5.305 ± 1.548
0.663GlnCys: 0.663 ± 0.809
3.316GlnAsp: 3.316 ± 1.038
3.316GlnGlu: 3.316 ± 1.034
0.663GlnPhe: 0.663 ± 0.542
2.653GlnGly: 2.653 ± 1.377
0.0GlnHis: 0.0 ± 0.0
0.663GlnIle: 0.663 ± 0.705
1.326GlnLys: 1.326 ± 0.689
3.979GlnLeu: 3.979 ± 0.576
2.653GlnMet: 2.653 ± 1.349
3.316GlnAsn: 3.316 ± 0.906
1.326GlnPro: 1.326 ± 0.805
2.653GlnGln: 2.653 ± 2.269
3.979GlnArg: 3.979 ± 0.775
6.631GlnSer: 6.631 ± 0.707
0.663GlnThr: 0.663 ± 0.438
3.979GlnVal: 3.979 ± 2.225
0.663GlnTrp: 0.663 ± 0.438
0.663GlnTyr: 0.663 ± 0.809
0.0GlnXaa: 0.0 ± 0.0
Arg
1.326ArgAla: 1.326 ± 0.689
1.326ArgCys: 1.326 ± 0.805
4.642ArgAsp: 4.642 ± 1.027
3.316ArgGlu: 3.316 ± 1.016
5.305ArgPhe: 5.305 ± 1.446
1.326ArgGly: 1.326 ± 0.877
0.663ArgHis: 0.663 ± 0.438
1.326ArgIle: 1.326 ± 0.543
0.0ArgLys: 0.0 ± 0.0
6.631ArgLeu: 6.631 ± 3.01
1.989ArgMet: 1.989 ± 0.824
1.326ArgAsn: 1.326 ± 1.085
3.316ArgPro: 3.316 ± 1.612
1.326ArgGln: 1.326 ± 0.805
1.989ArgArg: 1.989 ± 1.235
6.631ArgSer: 6.631 ± 1.962
0.0ArgThr: 0.0 ± 0.0
3.979ArgVal: 3.979 ± 1.012
0.0ArgTrp: 0.0 ± 0.0
5.968ArgTyr: 5.968 ± 1.527
0.0ArgXaa: 0.0 ± 0.0
Ser
9.284SerAla: 9.284 ± 2.684
1.989SerCys: 1.989 ± 0.737
3.979SerAsp: 3.979 ± 2.432
3.316SerGlu: 3.316 ± 1.329
9.284SerPhe: 9.284 ± 1.647
7.294SerGly: 7.294 ± 2.298
2.653SerHis: 2.653 ± 1.203
1.989SerIle: 1.989 ± 0.824
5.305SerLys: 5.305 ± 1.446
6.631SerLeu: 6.631 ± 2.435
0.0SerMet: 0.0 ± 0.514
3.316SerAsn: 3.316 ± 0.787
3.316SerPro: 3.316 ± 0.993
5.968SerGln: 5.968 ± 2.657
3.979SerArg: 3.979 ± 1.819
7.294SerSer: 7.294 ± 1.215
4.642SerThr: 4.642 ± 1.815
7.958SerVal: 7.958 ± 2.528
0.663SerTrp: 0.663 ± 0.83
1.326SerTyr: 1.326 ± 0.689
0.0SerXaa: 0.0 ± 0.0
Thr
5.968ThrAla: 5.968 ± 1.798
0.663ThrCys: 0.663 ± 0.438
1.989ThrAsp: 1.989 ± 1.917
1.326ThrGlu: 1.326 ± 0.877
1.326ThrPhe: 1.326 ± 0.543
1.989ThrGly: 1.989 ± 0.588
0.663ThrHis: 0.663 ± 0.438
0.663ThrIle: 0.663 ± 0.438
3.979ThrLys: 3.979 ± 1.045
2.653ThrLeu: 2.653 ± 0.718
0.0ThrMet: 0.0 ± 0.0
1.326ThrAsn: 1.326 ± 0.749
3.316ThrPro: 3.316 ± 1.325
1.326ThrGln: 1.326 ± 0.731
2.653ThrArg: 2.653 ± 1.085
5.305ThrSer: 5.305 ± 1.881
1.326ThrThr: 1.326 ± 0.877
0.663ThrVal: 0.663 ± 0.809
0.0ThrTrp: 0.0 ± 0.0
1.989ThrTyr: 1.989 ± 0.824
0.0ThrXaa: 0.0 ± 0.0
Val
3.979ValAla: 3.979 ± 1.209
0.0ValCys: 0.0 ± 0.0
3.316ValAsp: 3.316 ± 1.089
0.663ValGlu: 0.663 ± 0.542
3.979ValPhe: 3.979 ± 2.415
4.642ValGly: 4.642 ± 1.007
1.326ValHis: 1.326 ± 0.543
2.653ValIle: 2.653 ± 0.833
2.653ValLys: 2.653 ± 1.761
7.958ValLeu: 7.958 ± 2.5
1.989ValMet: 1.989 ± 0.894
5.968ValAsn: 5.968 ± 1.847
3.979ValPro: 3.979 ± 1.42
2.653ValGln: 2.653 ± 1.786
1.326ValArg: 1.326 ± 0.877
3.316ValSer: 3.316 ± 0.935
1.989ValThr: 1.989 ± 0.824
2.653ValVal: 2.653 ± 3.319
1.326ValTrp: 1.326 ± 0.543
1.326ValTyr: 1.326 ± 0.805
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.663TrpGlu: 0.663 ± 0.438
1.989TrpPhe: 1.989 ± 0.996
0.663TrpGly: 0.663 ± 0.438
0.663TrpHis: 0.663 ± 0.438
0.0TrpIle: 0.0 ± 0.0
0.663TrpLys: 0.663 ± 0.705
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.326TrpAsn: 1.326 ± 0.543
0.663TrpPro: 0.663 ± 0.438
0.663TrpGln: 0.663 ± 0.542
0.0TrpArg: 0.0 ± 0.0
0.663TrpSer: 0.663 ± 0.542
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.989TyrAla: 1.989 ± 0.894
1.989TyrCys: 1.989 ± 0.824
1.989TyrAsp: 1.989 ± 0.993
2.653TyrGlu: 2.653 ± 0.709
2.653TyrPhe: 2.653 ± 1.085
2.653TyrGly: 2.653 ± 1.18
1.989TyrHis: 1.989 ± 1.287
1.326TyrIle: 1.326 ± 0.543
1.989TyrLys: 1.989 ± 1.315
2.653TyrLeu: 2.653 ± 1.744
1.326TyrMet: 1.326 ± 0.543
1.326TyrAsn: 1.326 ± 0.689
0.0TyrPro: 0.0 ± 0.0
2.653TyrGln: 2.653 ± 0.718
3.316TyrArg: 3.316 ± 1.016
5.305TyrSer: 5.305 ± 1.418
2.653TyrThr: 2.653 ± 1.074
1.989TyrVal: 1.989 ± 0.737
0.663TyrTrp: 0.663 ± 0.438
1.326TyrTyr: 1.326 ± 0.543
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski