Amino acid dipepetide frequency for Capuchin monkey hepatitis B virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.771AlaAla: 4.771 ± 2.681
2.863AlaCys: 2.863 ± 0.493
2.385AlaAsp: 2.385 ± 1.128
1.431AlaGlu: 1.431 ± 0.721
5.248AlaPhe: 5.248 ± 1.0
1.431AlaGly: 1.431 ± 0.454
0.954AlaHis: 0.954 ± 0.399
1.431AlaIle: 1.431 ± 0.952
1.908AlaLys: 1.908 ± 1.27
5.725AlaLeu: 5.725 ± 2.325
0.954AlaMet: 0.954 ± 0.675
1.431AlaAsn: 1.431 ± 0.573
5.725AlaPro: 5.725 ± 2.156
1.431AlaGln: 1.431 ± 0.573
8.111AlaArg: 8.111 ± 2.327
5.725AlaSer: 5.725 ± 2.382
2.863AlaThr: 2.863 ± 0.549
2.863AlaVal: 2.863 ± 0.493
2.385AlaTrp: 2.385 ± 0.421
0.954AlaTyr: 0.954 ± 0.749
0.0AlaXaa: 0.0 ± 0.0
Cys
1.431CysAla: 1.431 ± 1.108
3.34CysCys: 3.34 ± 1.718
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.431CysPhe: 1.431 ± 0.952
1.908CysGly: 1.908 ± 0.956
0.0CysHis: 0.0 ± 0.0
1.908CysIle: 1.908 ± 0.997
1.431CysLys: 1.431 ± 0.806
6.679CysLeu: 6.679 ± 2.511
0.954CysMet: 0.954 ± 0.469
0.477CysAsn: 0.477 ± 0.317
6.202CysPro: 6.202 ± 3.23
1.908CysGln: 1.908 ± 0.956
0.954CysArg: 0.954 ± 0.572
4.294CysSer: 4.294 ± 0.724
5.248CysThr: 5.248 ± 1.903
0.0CysVal: 0.0 ± 0.0
1.431CysTrp: 1.431 ± 0.681
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.954AspAla: 0.954 ± 0.635
0.0AspCys: 0.0 ± 0.0
0.954AspAsp: 0.954 ± 0.635
0.477AspGlu: 0.477 ± 0.317
1.908AspPhe: 1.908 ± 0.933
1.908AspGly: 1.908 ± 0.997
0.477AspHis: 0.477 ± 0.317
0.954AspIle: 0.954 ± 0.508
0.477AspLys: 0.477 ± 0.317
4.294AspLeu: 4.294 ± 1.807
0.0AspMet: 0.0 ± 0.0
1.431AspAsn: 1.431 ± 0.681
4.771AspPro: 4.771 ± 2.076
1.908AspGln: 1.908 ± 0.837
0.0AspArg: 0.0 ± 0.0
2.385AspSer: 2.385 ± 0.421
1.431AspThr: 1.431 ± 0.498
0.954AspVal: 0.954 ± 0.572
2.385AspTrp: 2.385 ± 0.837
1.908AspTyr: 1.908 ± 0.559
0.0AspXaa: 0.0 ± 0.0
Glu
0.477GluAla: 0.477 ± 0.504
0.0GluCys: 0.0 ± 0.0
0.954GluAsp: 0.954 ± 0.635
1.908GluGlu: 1.908 ± 1.144
2.385GluPhe: 2.385 ± 0.92
0.954GluGly: 0.954 ± 0.635
2.385GluHis: 2.385 ± 1.159
0.0GluIle: 0.0 ± 0.0
0.954GluLys: 0.954 ± 0.635
3.34GluLeu: 3.34 ± 1.497
0.0GluMet: 0.0 ± 0.0
2.385GluAsn: 2.385 ± 1.369
0.477GluPro: 0.477 ± 0.58
0.0GluGln: 0.0 ± 0.0
0.477GluArg: 0.477 ± 0.58
1.908GluSer: 1.908 ± 0.933
3.34GluThr: 3.34 ± 0.835
0.477GluVal: 0.477 ± 0.437
0.954GluTrp: 0.954 ± 0.572
1.431GluTyr: 1.431 ± 0.681
0.0GluXaa: 0.0 ± 0.0
Phe
5.248PheAla: 5.248 ± 1.019
1.431PheCys: 1.431 ± 0.806
0.0PheAsp: 0.0 ± 0.0
0.477PheGlu: 0.477 ± 0.58
6.202PhePhe: 6.202 ± 2.402
2.385PheGly: 2.385 ± 1.397
1.431PheHis: 1.431 ± 0.961
2.863PheIle: 2.863 ± 1.612
1.431PheLys: 1.431 ± 0.57
11.45PheLeu: 11.45 ± 3.495
0.477PheMet: 0.477 ± 0.317
0.0PheAsn: 0.0 ± 0.0
5.725PhePro: 5.725 ± 0.678
0.954PheGln: 0.954 ± 0.642
2.385PheArg: 2.385 ± 1.228
5.725PheSer: 5.725 ± 1.531
2.863PheThr: 2.863 ± 1.442
4.771PheVal: 4.771 ± 0.949
1.431PheTrp: 1.431 ± 0.806
0.954PheTyr: 0.954 ± 0.635
0.0PheXaa: 0.0 ± 0.0
Gly
3.34GlyAla: 3.34 ± 0.936
0.954GlyCys: 0.954 ± 0.572
2.385GlyAsp: 2.385 ± 0.633
3.34GlyGlu: 3.34 ± 0.39
2.863GlyPhe: 2.863 ± 0.658
2.385GlyGly: 2.385 ± 0.871
0.477GlyHis: 0.477 ± 0.317
2.385GlyIle: 2.385 ± 0.633
0.477GlyLys: 0.477 ± 0.317
9.065GlyLeu: 9.065 ± 1.11
1.908GlyMet: 1.908 ± 0.922
1.431GlyAsn: 1.431 ± 0.806
7.156GlyPro: 7.156 ± 1.648
1.431GlyGln: 1.431 ± 0.57
3.817GlyArg: 3.817 ± 0.873
6.202GlySer: 6.202 ± 0.5
2.863GlyThr: 2.863 ± 0.493
1.908GlyVal: 1.908 ± 0.742
1.908GlyTrp: 1.908 ± 0.566
2.385GlyTyr: 2.385 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
0.954HisAla: 0.954 ± 0.635
1.431HisCys: 1.431 ± 0.681
0.954HisAsp: 0.954 ± 0.635
0.477HisGlu: 0.477 ± 0.437
1.431HisPhe: 1.431 ± 0.721
1.431HisGly: 1.431 ± 0.573
3.34HisHis: 3.34 ± 1.817
1.908HisIle: 1.908 ± 1.27
0.954HisLys: 0.954 ± 0.684
3.817HisLeu: 3.817 ± 1.866
0.0HisMet: 0.0 ± 0.0
0.954HisAsn: 0.954 ± 0.635
2.863HisPro: 2.863 ± 1.035
1.431HisGln: 1.431 ± 0.573
0.954HisArg: 0.954 ± 0.635
2.863HisSer: 2.863 ± 0.493
1.908HisThr: 1.908 ± 0.933
0.477HisVal: 0.477 ± 0.504
0.0HisTrp: 0.0 ± 0.0
0.954HisTyr: 0.954 ± 0.635
0.0HisXaa: 0.0 ± 0.0
Ile
1.908IleAla: 1.908 ± 0.559
0.477IleCys: 0.477 ± 0.317
1.431IleAsp: 1.431 ± 0.674
0.0IleGlu: 0.0 ± 0.0
3.34IlePhe: 3.34 ± 1.718
0.477IleGly: 0.477 ± 0.317
1.431IleHis: 1.431 ± 0.952
2.385IleIle: 2.385 ± 0.645
1.431IleLys: 1.431 ± 0.952
4.771IleLeu: 4.771 ± 1.289
0.954IleMet: 0.954 ± 0.708
0.477IleAsn: 0.477 ± 0.437
8.111IlePro: 8.111 ± 2.964
0.477IleGln: 0.477 ± 0.317
1.908IleArg: 1.908 ± 1.016
1.908IleSer: 1.908 ± 1.283
2.863IleThr: 2.863 ± 0.658
1.431IleVal: 1.431 ± 0.952
2.863IleTrp: 2.863 ± 1.612
0.477IleTyr: 0.477 ± 0.317
0.0IleXaa: 0.0 ± 0.0
Lys
1.908LysAla: 1.908 ± 0.771
0.0LysCys: 0.0 ± 0.0
1.431LysAsp: 1.431 ± 0.99
0.477LysGlu: 0.477 ± 0.504
0.477LysPhe: 0.477 ± 0.317
0.954LysGly: 0.954 ± 0.635
0.477LysHis: 0.477 ± 0.317
2.863LysIle: 2.863 ± 0.428
0.0LysLys: 0.0 ± 0.0
2.863LysLeu: 2.863 ± 1.519
0.0LysMet: 0.0 ± 0.0
0.477LysAsn: 0.477 ± 0.317
2.385LysPro: 2.385 ± 0.421
1.431LysGln: 1.431 ± 0.681
1.908LysArg: 1.908 ± 1.27
1.908LysSer: 1.908 ± 1.27
1.908LysThr: 1.908 ± 0.559
2.385LysVal: 2.385 ± 1.228
0.0LysTrp: 0.0 ± 0.0
0.954LysTyr: 0.954 ± 0.635
0.0LysXaa: 0.0 ± 0.0
Leu
5.725LeuAla: 5.725 ± 0.589
3.817LeuCys: 3.817 ± 1.99
4.771LeuAsp: 4.771 ± 1.635
4.294LeuGlu: 4.294 ± 2.042
4.771LeuPhe: 4.771 ± 1.139
10.019LeuGly: 10.019 ± 0.932
3.817LeuHis: 3.817 ± 2.54
1.908LeuIle: 1.908 ± 0.922
2.863LeuLys: 2.863 ± 1.16
21.947LeuLeu: 21.947 ± 4.372
1.431LeuMet: 1.431 ± 0.681
4.294LeuAsn: 4.294 ± 0.494
11.927LeuPro: 11.927 ± 0.429
6.679LeuGln: 6.679 ± 1.082
6.202LeuArg: 6.202 ± 1.271
11.45LeuSer: 11.45 ± 0.999
4.771LeuThr: 4.771 ± 0.856
8.588LeuVal: 8.588 ± 0.478
5.725LeuTrp: 5.725 ± 1.876
6.202LeuTyr: 6.202 ± 0.51
0.0LeuXaa: 0.0 ± 0.0
Met
1.908MetAla: 1.908 ± 1.151
1.431MetCys: 1.431 ± 0.806
0.954MetAsp: 0.954 ± 0.508
1.431MetGlu: 1.431 ± 0.806
0.0MetPhe: 0.0 ± 0.0
1.908MetGly: 1.908 ± 0.566
0.477MetHis: 0.477 ± 0.504
1.431MetIle: 1.431 ± 0.806
0.0MetLys: 0.0 ± 0.0
1.908MetLeu: 1.908 ± 0.559
0.0MetMet: 0.0 ± 0.0
0.477MetAsn: 0.477 ± 0.58
1.431MetPro: 1.431 ± 0.952
0.0MetGln: 0.0 ± 0.0
0.477MetArg: 0.477 ± 0.317
0.0MetSer: 0.0 ± 0.0
0.954MetThr: 0.954 ± 0.642
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.431MetTyr: 1.431 ± 0.806
0.0MetXaa: 0.0 ± 0.0
Asn
0.954AsnAla: 0.954 ± 0.508
1.431AsnCys: 1.431 ± 0.806
0.477AsnAsp: 0.477 ± 0.504
0.477AsnGlu: 0.477 ± 0.317
2.385AsnPhe: 2.385 ± 1.369
0.0AsnGly: 0.0 ± 0.0
0.954AsnHis: 0.954 ± 0.635
2.863AsnIle: 2.863 ± 1.109
0.0AsnLys: 0.0 ± 0.0
3.34AsnLeu: 3.34 ± 0.39
1.431AsnMet: 1.431 ± 0.787
0.954AsnAsn: 0.954 ± 0.635
3.817AsnPro: 3.817 ± 1.5
0.477AsnGln: 0.477 ± 0.437
1.908AsnArg: 1.908 ± 0.62
2.385AsnSer: 2.385 ± 0.795
0.477AsnThr: 0.477 ± 0.317
0.0AsnVal: 0.0 ± 0.0
0.954AsnTrp: 0.954 ± 0.635
0.477AsnTyr: 0.477 ± 0.504
0.0AsnXaa: 0.0 ± 0.0
Pro
10.019ProAla: 10.019 ± 1.088
5.248ProCys: 5.248 ± 1.544
0.477ProAsp: 0.477 ± 0.317
2.385ProGlu: 2.385 ± 1.159
4.771ProPhe: 4.771 ± 0.949
7.156ProGly: 7.156 ± 3.161
3.34ProHis: 3.34 ± 1.022
5.725ProIle: 5.725 ± 0.847
1.431ProLys: 1.431 ± 0.573
13.359ProLeu: 13.359 ± 1.527
1.431ProMet: 1.431 ± 0.686
1.431ProAsn: 1.431 ± 0.573
3.34ProPro: 3.34 ± 1.09
4.771ProGln: 4.771 ± 1.675
3.34ProArg: 3.34 ± 0.862
10.019ProSer: 10.019 ± 1.737
9.065ProThr: 9.065 ± 4.253
7.156ProVal: 7.156 ± 1.226
1.431ProTrp: 1.431 ± 0.573
0.954ProTyr: 0.954 ± 0.508
0.0ProXaa: 0.0 ± 0.0
Gln
2.863GlnAla: 2.863 ± 0.857
0.954GlnCys: 0.954 ± 0.508
2.385GlnAsp: 2.385 ± 0.633
0.477GlnGlu: 0.477 ± 0.317
1.908GlnPhe: 1.908 ± 1.27
2.863GlnGly: 2.863 ± 0.658
0.477GlnHis: 0.477 ± 0.317
0.0GlnIle: 0.0 ± 0.0
0.954GlnLys: 0.954 ± 0.642
4.294GlnLeu: 4.294 ± 1.099
0.0GlnMet: 0.0 ± 0.0
0.954GlnAsn: 0.954 ± 0.399
0.477GlnPro: 0.477 ± 0.58
1.431GlnGln: 1.431 ± 0.721
2.385GlnArg: 2.385 ± 1.587
7.634GlnSer: 7.634 ± 0.908
2.385GlnThr: 2.385 ± 0.645
1.431GlnVal: 1.431 ± 0.573
1.908GlnTrp: 1.908 ± 0.997
0.477GlnTyr: 0.477 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
0.954ArgAla: 0.954 ± 0.635
1.431ArgCys: 1.431 ± 0.952
3.34ArgAsp: 3.34 ± 1.049
2.385ArgGlu: 2.385 ± 1.453
5.725ArgPhe: 5.725 ± 1.94
4.294ArgGly: 4.294 ± 0.937
0.954ArgHis: 0.954 ± 1.161
1.431ArgIle: 1.431 ± 0.952
1.431ArgLys: 1.431 ± 0.952
4.771ArgLeu: 4.771 ± 2.457
0.954ArgMet: 0.954 ± 0.572
2.385ArgAsn: 2.385 ± 0.837
5.725ArgPro: 5.725 ± 2.32
1.431ArgGln: 1.431 ± 0.961
7.634ArgArg: 7.634 ± 3.594
2.385ArgSer: 2.385 ± 1.159
3.817ArgThr: 3.817 ± 1.483
2.863ArgVal: 2.863 ± 0.724
1.908ArgTrp: 1.908 ± 0.559
0.954ArgTyr: 0.954 ± 0.635
0.0ArgXaa: 0.0 ± 0.0
Ser
6.679SerAla: 6.679 ± 2.232
5.248SerCys: 5.248 ± 1.91
1.431SerAsp: 1.431 ± 0.681
1.431SerGlu: 1.431 ± 0.952
2.863SerPhe: 2.863 ± 1.362
4.294SerGly: 4.294 ± 1.306
3.817SerHis: 3.817 ± 0.708
0.954SerIle: 0.954 ± 0.572
0.954SerLys: 0.954 ± 0.508
13.359SerLeu: 13.359 ± 2.26
1.431SerMet: 1.431 ± 0.806
1.908SerAsn: 1.908 ± 0.563
11.927SerPro: 11.927 ± 2.277
4.294SerGln: 4.294 ± 1.655
5.725SerArg: 5.725 ± 2.592
15.744SerSer: 15.744 ± 2.127
3.817SerThr: 3.817 ± 0.535
4.771SerVal: 4.771 ± 0.677
4.294SerTrp: 4.294 ± 1.096
0.477SerTyr: 0.477 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
5.248ThrAla: 5.248 ± 0.856
6.202ThrCys: 6.202 ± 2.957
2.385ThrAsp: 2.385 ± 0.421
0.0ThrGlu: 0.0 ± 0.0
0.477ThrPhe: 0.477 ± 0.504
5.725ThrGly: 5.725 ± 0.848
1.908ThrHis: 1.908 ± 0.837
4.294ThrIle: 4.294 ± 2.411
4.771ThrLys: 4.771 ± 0.842
3.34ThrLeu: 3.34 ± 0.936
0.0ThrMet: 0.0 ± 0.0
0.477ThrAsn: 0.477 ± 0.317
7.634ThrPro: 7.634 ± 1.906
0.477ThrGln: 0.477 ± 0.317
1.908ThrArg: 1.908 ± 1.27
8.111ThrSer: 8.111 ± 1.802
12.405ThrThr: 12.405 ± 5.05
2.863ThrVal: 2.863 ± 2.445
1.431ThrTrp: 1.431 ± 0.454
0.954ThrTyr: 0.954 ± 0.635
0.0ThrXaa: 0.0 ± 0.0
Val
1.908ValAla: 1.908 ± 1.27
2.863ValCys: 2.863 ± 0.549
0.954ValAsp: 0.954 ± 0.635
1.908ValGlu: 1.908 ± 0.961
4.771ValPhe: 4.771 ± 0.949
4.294ValGly: 4.294 ± 0.494
0.954ValHis: 0.954 ± 0.635
1.908ValIle: 1.908 ± 0.559
0.0ValLys: 0.0 ± 0.0
5.725ValLeu: 5.725 ± 1.029
0.477ValMet: 0.477 ± 0.317
1.431ValAsn: 1.431 ± 0.961
3.34ValPro: 3.34 ± 1.319
3.817ValGln: 3.817 ± 1.022
3.817ValArg: 3.817 ± 1.36
3.817ValSer: 3.817 ± 1.866
2.385ValThr: 2.385 ± 0.804
2.863ValVal: 2.863 ± 1.362
0.477ValTrp: 0.477 ± 0.504
2.385ValTyr: 2.385 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
2.863TrpAla: 2.863 ± 1.612
0.0TrpCys: 0.0 ± 0.0
1.908TrpAsp: 1.908 ± 1.082
1.431TrpGlu: 1.431 ± 1.108
3.34TrpPhe: 3.34 ± 1.657
3.817TrpGly: 3.817 ± 0.568
0.0TrpHis: 0.0 ± 0.0
0.954TrpIle: 0.954 ± 0.508
1.908TrpLys: 1.908 ± 1.27
4.771TrpLeu: 4.771 ± 0.856
2.863TrpMet: 2.863 ± 1.612
1.431TrpAsn: 1.431 ± 0.966
0.954TrpPro: 0.954 ± 0.399
0.0TrpGln: 0.0 ± 0.0
0.477TrpArg: 0.477 ± 0.317
0.0TrpSer: 0.0 ± 0.0
2.863TrpThr: 2.863 ± 0.493
1.908TrpVal: 1.908 ± 0.922
0.477TrpTrp: 0.477 ± 0.317
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.477TyrAla: 0.477 ± 0.317
0.477TyrCys: 0.477 ± 0.317
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
1.908TyrPhe: 1.908 ± 0.45
0.477TyrGly: 0.477 ± 0.317
1.431TyrHis: 1.431 ± 0.952
1.431TyrIle: 1.431 ± 0.806
1.431TyrLys: 1.431 ± 0.681
2.385TyrLeu: 2.385 ± 0.92
0.477TyrMet: 0.477 ± 0.317
0.954TyrAsn: 0.954 ± 0.572
2.863TyrPro: 2.863 ± 0.493
1.908TyrGln: 1.908 ± 0.559
2.385TyrArg: 2.385 ± 1.248
0.954TyrSer: 0.954 ± 0.635
2.385TyrThr: 2.385 ± 0.421
2.385TyrVal: 2.385 ± 1.216
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski