Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_223

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.648AlaAla: 4.648 ± 3.46
0.775AlaCys: 0.775 ± 0.716
4.648AlaAsp: 4.648 ± 1.771
3.098AlaGlu: 3.098 ± 2.135
3.873AlaPhe: 3.873 ± 1.761
5.422AlaGly: 5.422 ± 1.55
3.098AlaHis: 3.098 ± 1.33
0.0AlaIle: 0.0 ± 0.0
0.0AlaLys: 0.0 ± 0.0
5.422AlaLeu: 5.422 ± 0.711
2.324AlaMet: 2.324 ± 1.447
6.971AlaAsn: 6.971 ± 2.027
4.648AlaPro: 4.648 ± 1.016
2.324AlaGln: 2.324 ± 1.132
3.873AlaArg: 3.873 ± 2.109
10.844AlaSer: 10.844 ± 5.131
2.324AlaThr: 2.324 ± 1.194
3.873AlaVal: 3.873 ± 1.43
2.324AlaTrp: 2.324 ± 0.891
3.873AlaTyr: 3.873 ± 2.482
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.549CysGlu: 1.549 ± 1.972
0.775CysPhe: 0.775 ± 0.716
1.549CysGly: 1.549 ± 1.432
0.775CysHis: 0.775 ± 0.986
1.549CysIle: 1.549 ± 0.556
0.0CysLys: 0.0 ± 0.0
4.648CysLeu: 4.648 ± 2.342
0.0CysMet: 0.0 ± 0.0
0.775CysAsn: 0.775 ± 0.986
1.549CysPro: 1.549 ± 1.045
0.775CysGln: 0.775 ± 0.716
1.549CysArg: 1.549 ± 0.556
1.549CysSer: 1.549 ± 0.556
0.0CysThr: 0.0 ± 0.0
0.775CysVal: 0.775 ± 0.986
0.775CysTrp: 0.775 ± 0.716
0.775CysTyr: 0.775 ± 0.523
0.0CysXaa: 0.0 ± 0.0
Asp
3.098AspAla: 3.098 ± 1.523
0.775AspCys: 0.775 ± 0.523
5.422AspAsp: 5.422 ± 2.376
2.324AspGlu: 2.324 ± 1.778
3.873AspPhe: 3.873 ± 0.88
2.324AspGly: 2.324 ± 1.613
0.775AspHis: 0.775 ± 0.523
6.197AspIle: 6.197 ± 1.682
2.324AspLys: 2.324 ± 1.717
4.648AspLeu: 4.648 ± 1.771
0.775AspMet: 0.775 ± 0.523
2.324AspAsn: 2.324 ± 0.824
1.549AspPro: 1.549 ± 1.021
6.197AspGln: 6.197 ± 1.792
3.098AspArg: 3.098 ± 1.555
5.422AspSer: 5.422 ± 2.076
3.098AspThr: 3.098 ± 1.679
0.775AspVal: 0.775 ± 0.523
2.324AspTrp: 2.324 ± 0.891
4.648AspTyr: 4.648 ± 2.999
0.0AspXaa: 0.0 ± 0.0
Glu
5.422GluAla: 5.422 ± 2.227
0.775GluCys: 0.775 ± 0.523
1.549GluAsp: 1.549 ± 1.009
1.549GluGlu: 1.549 ± 1.045
2.324GluPhe: 2.324 ± 1.516
0.775GluGly: 0.775 ± 0.523
0.775GluHis: 0.775 ± 0.523
3.873GluIle: 3.873 ± 1.443
1.549GluLys: 1.549 ± 1.009
3.873GluLeu: 3.873 ± 2.618
1.549GluMet: 1.549 ± 1.167
1.549GluAsn: 1.549 ± 0.605
0.775GluPro: 0.775 ± 0.836
3.098GluGln: 3.098 ± 1.209
3.873GluArg: 3.873 ± 1.823
3.098GluSer: 3.098 ± 1.891
3.098GluThr: 3.098 ± 1.547
5.422GluVal: 5.422 ± 3.299
0.0GluTrp: 0.0 ± 0.0
3.873GluTyr: 3.873 ± 1.443
0.0GluXaa: 0.0 ± 0.0
Phe
3.873PheAla: 3.873 ± 1.24
0.775PheCys: 0.775 ± 0.986
3.098PheAsp: 3.098 ± 0.876
0.775PheGlu: 0.775 ± 0.716
4.648PhePhe: 4.648 ± 0.898
4.648PheGly: 4.648 ± 0.898
0.0PheHis: 0.0 ± 0.0
2.324PheIle: 2.324 ± 0.808
2.324PheLys: 2.324 ± 1.171
4.648PheLeu: 4.648 ± 0.813
0.0PheMet: 0.0 ± 0.0
1.549PheAsn: 1.549 ± 1.391
0.0PhePro: 0.0 ± 0.0
0.775PheGln: 0.775 ± 0.523
4.648PheArg: 4.648 ± 1.631
6.197PheSer: 6.197 ± 1.228
3.873PheThr: 3.873 ± 0.859
3.873PheVal: 3.873 ± 2.614
0.775PheTrp: 0.775 ± 0.523
1.549PheTyr: 1.549 ± 0.847
0.0PheXaa: 0.0 ± 0.0
Gly
3.098GlyAla: 3.098 ± 2.154
2.324GlyCys: 2.324 ± 1.104
6.197GlyAsp: 6.197 ± 2.085
3.098GlyGlu: 3.098 ± 1.165
2.324GlyPhe: 2.324 ± 1.171
1.549GlyGly: 1.549 ± 0.605
0.775GlyHis: 0.775 ± 0.523
3.098GlyIle: 3.098 ± 2.207
4.648GlyLys: 4.648 ± 1.631
5.422GlyLeu: 5.422 ± 1.122
1.549GlyMet: 1.549 ± 0.895
2.324GlyAsn: 2.324 ± 0.891
3.873GlyPro: 3.873 ± 0.476
3.098GlyGln: 3.098 ± 0.876
1.549GlyArg: 1.549 ± 0.556
6.971GlySer: 6.971 ± 1.028
3.873GlyThr: 3.873 ± 0.88
4.648GlyVal: 4.648 ± 1.349
0.775GlyTrp: 0.775 ± 0.716
3.873GlyTyr: 3.873 ± 2.013
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.775HisAsp: 0.775 ± 0.523
0.775HisGlu: 0.775 ± 0.986
0.775HisPhe: 0.775 ± 0.523
3.098HisGly: 3.098 ± 2.091
0.0HisHis: 0.0 ± 0.0
0.775HisIle: 0.775 ± 0.523
0.0HisLys: 0.0 ± 0.0
1.549HisLeu: 1.549 ± 0.96
0.0HisMet: 0.0 ± 0.0
0.775HisAsn: 0.775 ± 0.523
1.549HisPro: 1.549 ± 1.275
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.324HisSer: 2.324 ± 0.571
0.775HisThr: 0.775 ± 0.716
0.775HisVal: 0.775 ± 0.523
0.0HisTrp: 0.0 ± 0.0
2.324HisTyr: 2.324 ± 0.808
0.0HisXaa: 0.0 ± 0.0
Ile
0.775IleAla: 0.775 ± 0.696
0.775IleCys: 0.775 ± 0.986
3.098IleAsp: 3.098 ± 2.408
2.324IleGlu: 2.324 ± 1.171
3.098IlePhe: 3.098 ± 1.547
5.422IleGly: 5.422 ± 1.888
0.0IleHis: 0.0 ± 0.0
2.324IleIle: 2.324 ± 0.803
1.549IleLys: 1.549 ± 0.605
3.098IleLeu: 3.098 ± 1.145
0.775IleMet: 0.775 ± 0.523
2.324IleAsn: 2.324 ± 2.148
2.324IlePro: 2.324 ± 0.808
2.324IleGln: 2.324 ± 1.568
5.422IleArg: 5.422 ± 1.288
5.422IleSer: 5.422 ± 1.423
4.648IleThr: 4.648 ± 1.178
2.324IleVal: 2.324 ± 0.824
3.098IleTrp: 3.098 ± 0.526
1.549IleTyr: 1.549 ± 0.847
0.0IleXaa: 0.0 ± 0.0
Lys
5.422LysAla: 5.422 ± 2.163
0.775LysCys: 0.775 ± 0.716
1.549LysAsp: 1.549 ± 1.009
2.324LysGlu: 2.324 ± 1.778
0.775LysPhe: 0.775 ± 0.523
2.324LysGly: 2.324 ± 1.171
0.0LysHis: 0.0 ± 0.0
2.324LysIle: 2.324 ± 2.148
0.775LysLys: 0.775 ± 0.716
2.324LysLeu: 2.324 ± 1.544
0.775LysMet: 0.775 ± 0.523
3.098LysAsn: 3.098 ± 1.165
1.549LysPro: 1.549 ± 0.556
1.549LysGln: 1.549 ± 1.009
2.324LysArg: 2.324 ± 1.175
2.324LysSer: 2.324 ± 0.891
2.324LysThr: 2.324 ± 0.808
3.098LysVal: 3.098 ± 1.242
0.0LysTrp: 0.0 ± 0.0
3.098LysTyr: 3.098 ± 2.565
0.0LysXaa: 0.0 ± 0.0
Leu
6.197LeuAla: 6.197 ± 1.148
0.0LeuCys: 0.0 ± 0.0
3.098LeuAsp: 3.098 ± 1.291
4.648LeuGlu: 4.648 ± 1.631
2.324LeuPhe: 2.324 ± 0.943
5.422LeuGly: 5.422 ± 1.463
0.775LeuHis: 0.775 ± 0.523
2.324LeuIle: 2.324 ± 0.571
4.648LeuLys: 4.648 ± 1.31
3.873LeuLeu: 3.873 ± 1.201
0.0LeuMet: 0.0 ± 0.0
4.648LeuAsn: 4.648 ± 1.771
2.324LeuPro: 2.324 ± 1.132
3.873LeuGln: 3.873 ± 0.933
4.648LeuArg: 4.648 ± 2.302
9.295LeuSer: 9.295 ± 2.309
2.324LeuThr: 2.324 ± 1.544
6.197LeuVal: 6.197 ± 0.622
0.0LeuTrp: 0.0 ± 0.0
3.098LeuTyr: 3.098 ± 1.145
0.0LeuXaa: 0.0 ± 0.0
Met
1.549MetAla: 1.549 ± 1.009
0.775MetCys: 0.775 ± 0.523
1.549MetAsp: 1.549 ± 0.847
0.0MetGlu: 0.0 ± 0.0
0.775MetPhe: 0.775 ± 0.523
2.324MetGly: 2.324 ± 1.568
0.775MetHis: 0.775 ± 0.523
0.775MetIle: 0.775 ± 0.986
1.549MetLys: 1.549 ± 0.556
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.873MetPro: 3.873 ± 2.013
0.775MetGln: 0.775 ± 0.523
0.0MetArg: 0.0 ± 0.0
3.098MetSer: 3.098 ± 0.526
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.775MetTyr: 0.775 ± 0.696
0.0MetXaa: 0.0 ± 0.0
Asn
5.422AsnAla: 5.422 ± 3.167
0.775AsnCys: 0.775 ± 0.716
3.098AsnAsp: 3.098 ± 1.057
4.648AsnGlu: 4.648 ± 1.439
1.549AsnPhe: 1.549 ± 0.605
3.873AsnGly: 3.873 ± 0.887
1.549AsnHis: 1.549 ± 1.101
0.775AsnIle: 0.775 ± 0.523
2.324AsnLys: 2.324 ± 0.571
2.324AsnLeu: 2.324 ± 0.891
0.775AsnMet: 0.775 ± 0.523
4.648AsnAsn: 4.648 ± 1.814
0.775AsnPro: 0.775 ± 0.696
0.775AsnGln: 0.775 ± 0.836
5.422AsnArg: 5.422 ± 1.465
3.873AsnSer: 3.873 ± 0.933
4.648AsnThr: 4.648 ± 3.111
5.422AsnVal: 5.422 ± 2.149
0.0AsnTrp: 0.0 ± 0.0
0.775AsnTyr: 0.775 ± 0.523
0.0AsnXaa: 0.0 ± 0.0
Pro
3.098ProAla: 3.098 ± 1.242
1.549ProCys: 1.549 ± 1.275
2.324ProAsp: 2.324 ± 1.104
3.098ProGlu: 3.098 ± 0.904
3.873ProPhe: 3.873 ± 1.443
3.098ProGly: 3.098 ± 1.242
1.549ProHis: 1.549 ± 0.556
1.549ProIle: 1.549 ± 0.847
2.324ProLys: 2.324 ± 0.824
2.324ProLeu: 2.324 ± 0.571
2.324ProMet: 2.324 ± 1.568
2.324ProAsn: 2.324 ± 1.568
1.549ProPro: 1.549 ± 0.556
1.549ProGln: 1.549 ± 1.021
1.549ProArg: 1.549 ± 1.067
1.549ProSer: 1.549 ± 0.605
1.549ProThr: 1.549 ± 1.045
5.422ProVal: 5.422 ± 1.58
0.0ProTrp: 0.0 ± 0.0
1.549ProTyr: 1.549 ± 0.605
0.0ProXaa: 0.0 ± 0.0
Gln
4.648GlnAla: 4.648 ± 2.388
0.0GlnCys: 0.0 ± 0.0
2.324GlnAsp: 2.324 ± 0.571
3.098GlnGlu: 3.098 ± 2.135
0.775GlnPhe: 0.775 ± 0.523
3.098GlnGly: 3.098 ± 1.145
0.0GlnHis: 0.0 ± 0.0
3.098GlnIle: 3.098 ± 1.291
0.775GlnLys: 0.775 ± 0.523
3.098GlnLeu: 3.098 ± 1.33
0.775GlnMet: 0.775 ± 0.523
5.422GlnAsn: 5.422 ± 1.761
0.775GlnPro: 0.775 ± 0.523
6.971GlnGln: 6.971 ± 1.359
3.873GlnArg: 3.873 ± 0.88
3.098GlnSer: 3.098 ± 1.209
2.324GlnThr: 2.324 ± 1.132
2.324GlnVal: 2.324 ± 0.891
1.549GlnTrp: 1.549 ± 0.556
1.549GlnTyr: 1.549 ± 1.045
0.0GlnXaa: 0.0 ± 0.0
Arg
4.648ArgAla: 4.648 ± 0.809
3.098ArgCys: 3.098 ± 1.859
2.324ArgAsp: 2.324 ± 1.198
1.549ArgGlu: 1.549 ± 0.605
3.873ArgPhe: 3.873 ± 1.496
1.549ArgGly: 1.549 ± 0.605
0.0ArgHis: 0.0 ± 0.0
3.873ArgIle: 3.873 ± 0.88
5.422ArgLys: 5.422 ± 2.828
2.324ArgLeu: 2.324 ± 1.171
2.324ArgMet: 2.324 ± 0.891
0.775ArgAsn: 0.775 ± 0.696
1.549ArgPro: 1.549 ± 1.432
3.873ArgGln: 3.873 ± 0.887
4.648ArgArg: 4.648 ± 2.075
4.648ArgSer: 4.648 ± 2.342
2.324ArgThr: 2.324 ± 1.198
3.873ArgVal: 3.873 ± 0.476
0.775ArgTrp: 0.775 ± 0.523
4.648ArgTyr: 4.648 ± 1.668
0.0ArgXaa: 0.0 ± 0.0
Ser
9.295SerAla: 9.295 ± 4.01
2.324SerCys: 2.324 ± 1.104
5.422SerAsp: 5.422 ± 1.928
5.422SerGlu: 5.422 ± 1.465
6.971SerPhe: 6.971 ± 1.265
5.422SerGly: 5.422 ± 1.686
1.549SerHis: 1.549 ± 0.556
7.746SerIle: 7.746 ± 1.136
5.422SerLys: 5.422 ± 2.79
6.971SerLeu: 6.971 ± 1.547
0.0SerMet: 0.0 ± 0.0
4.648SerAsn: 4.648 ± 2.388
4.648SerPro: 4.648 ± 1.514
3.873SerGln: 3.873 ± 0.933
6.971SerArg: 6.971 ± 2.961
10.844SerSer: 10.844 ± 3.33
5.422SerThr: 5.422 ± 1.276
6.971SerVal: 6.971 ± 2.26
0.775SerTrp: 0.775 ± 0.696
3.098SerTyr: 3.098 ± 1.145
0.0SerXaa: 0.0 ± 0.0
Thr
6.197ThrAla: 6.197 ± 2.612
1.549ThrCys: 1.549 ± 1.432
3.098ThrAsp: 3.098 ± 1.145
0.0ThrGlu: 0.0 ± 0.0
4.648ThrPhe: 4.648 ± 1.698
5.422ThrGly: 5.422 ± 1.122
0.775ThrHis: 0.775 ± 0.716
3.098ThrIle: 3.098 ± 1.242
0.775ThrLys: 0.775 ± 0.523
3.098ThrLeu: 3.098 ± 0.526
0.0ThrMet: 0.0 ± 0.0
1.549ThrAsn: 1.549 ± 0.605
2.324ThrPro: 2.324 ± 1.568
1.549ThrGln: 1.549 ± 0.605
0.0ThrArg: 0.0 ± 0.0
7.746ThrSer: 7.746 ± 1.605
1.549ThrThr: 1.549 ± 1.045
4.648ThrVal: 4.648 ± 1.674
0.775ThrTrp: 0.775 ± 0.986
0.775ThrTyr: 0.775 ± 0.986
0.0ThrXaa: 0.0 ± 0.0
Val
3.873ValAla: 3.873 ± 0.476
0.775ValCys: 0.775 ± 0.523
6.197ValAsp: 6.197 ± 1.578
3.873ValGlu: 3.873 ± 1.496
0.775ValPhe: 0.775 ± 0.716
6.197ValGly: 6.197 ± 1.878
0.0ValHis: 0.0 ± 0.0
1.549ValIle: 1.549 ± 0.847
2.324ValLys: 2.324 ± 0.808
4.648ValLeu: 4.648 ± 1.641
2.324ValMet: 2.324 ± 1.132
2.324ValAsn: 2.324 ± 1.288
5.422ValPro: 5.422 ± 1.173
0.775ValGln: 0.775 ± 0.523
0.775ValArg: 0.775 ± 0.523
11.619ValSer: 11.619 ± 2.291
5.422ValThr: 5.422 ± 1.122
3.098ValVal: 3.098 ± 0.728
0.775ValTrp: 0.775 ± 0.696
2.324ValTyr: 2.324 ± 0.808
0.0ValXaa: 0.0 ± 0.0
Trp
0.775TrpAla: 0.775 ± 0.523
0.0TrpCys: 0.0 ± 0.0
0.775TrpAsp: 0.775 ± 0.696
1.549TrpGlu: 1.549 ± 0.605
0.775TrpPhe: 0.775 ± 0.523
0.0TrpGly: 0.0 ± 0.0
2.324TrpHis: 2.324 ± 0.808
0.775TrpIle: 0.775 ± 0.523
0.0TrpLys: 0.0 ± 0.0
1.549TrpLeu: 1.549 ± 1.101
0.0TrpMet: 0.0 ± 0.0
0.775TrpAsn: 0.775 ± 0.523
0.0TrpPro: 0.0 ± 0.0
2.324TrpGln: 2.324 ± 1.516
0.775TrpArg: 0.775 ± 0.716
1.549TrpSer: 1.549 ± 0.556
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.775TrpTyr: 0.775 ± 0.523
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.324TyrAla: 2.324 ± 0.808
1.549TyrCys: 1.549 ± 0.556
5.422TyrAsp: 5.422 ± 2.149
3.098TyrGlu: 3.098 ± 1.891
1.549TyrPhe: 1.549 ± 0.556
2.324TyrGly: 2.324 ± 1.778
0.775TyrHis: 0.775 ± 0.716
4.648TyrIle: 4.648 ± 2.45
0.0TyrLys: 0.0 ± 0.0
3.873TyrLeu: 3.873 ± 1.24
1.549TyrMet: 1.549 ± 1.045
4.648TyrAsn: 4.648 ± 1.559
3.098TyrPro: 3.098 ± 1.291
3.098TyrGln: 3.098 ± 1.145
3.098TyrArg: 3.098 ± 0.876
2.324TyrSer: 2.324 ± 0.803
0.0TyrThr: 0.0 ± 0.0
1.549TyrVal: 1.549 ± 0.556
0.0TyrTrp: 0.0 ± 0.0
0.775TyrTyr: 0.775 ± 0.716
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1292 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski