Amino acid dipepetide frequency for Cycloclasticus sp. (strain P1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.201AlaAla: 7.201 ± 0.122
1.025AlaCys: 1.025 ± 0.042
4.865AlaAsp: 4.865 ± 0.101
5.273AlaGlu: 5.273 ± 0.098
3.346AlaPhe: 3.346 ± 0.071
6.164AlaGly: 6.164 ± 0.11
1.72AlaHis: 1.72 ± 0.053
6.012AlaIle: 6.012 ± 0.108
4.841AlaLys: 4.841 ± 0.1
9.403AlaLeu: 9.403 ± 0.139
2.376AlaMet: 2.376 ± 0.058
3.536AlaAsn: 3.536 ± 0.067
2.597AlaPro: 2.597 ± 0.059
3.252AlaGln: 3.252 ± 0.076
3.511AlaArg: 3.511 ± 0.078
5.505AlaSer: 5.505 ± 0.08
4.348AlaThr: 4.348 ± 0.084
5.893AlaVal: 5.893 ± 0.097
0.936AlaTrp: 0.936 ± 0.039
2.388AlaTyr: 2.388 ± 0.064
0.0AlaXaa: 0.0 ± 0.0
Cys
0.781CysAla: 0.781 ± 0.038
0.147CysCys: 0.147 ± 0.016
0.597CysAsp: 0.597 ± 0.032
0.609CysGlu: 0.609 ± 0.034
0.471CysPhe: 0.471 ± 0.027
0.897CysGly: 0.897 ± 0.041
0.317CysHis: 0.317 ± 0.02
0.641CysIle: 0.641 ± 0.028
0.44CysLys: 0.44 ± 0.024
1.039CysLeu: 1.039 ± 0.038
0.228CysMet: 0.228 ± 0.022
0.306CysAsn: 0.306 ± 0.019
0.493CysPro: 0.493 ± 0.028
0.457CysGln: 0.457 ± 0.026
0.435CysArg: 0.435 ± 0.027
0.671CysSer: 0.671 ± 0.035
0.516CysThr: 0.516 ± 0.026
0.682CysVal: 0.682 ± 0.035
0.12CysTrp: 0.12 ± 0.015
0.32CysTyr: 0.32 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.45AspAla: 4.45 ± 0.086
0.556AspCys: 0.556 ± 0.028
3.342AspAsp: 3.342 ± 0.073
4.332AspGlu: 4.332 ± 0.099
2.501AspPhe: 2.501 ± 0.063
3.712AspGly: 3.712 ± 0.082
1.132AspHis: 1.132 ± 0.044
4.506AspIle: 4.506 ± 0.089
3.275AspLys: 3.275 ± 0.066
5.396AspLeu: 5.396 ± 0.092
1.452AspMet: 1.452 ± 0.054
2.458AspAsn: 2.458 ± 0.054
2.11AspPro: 2.11 ± 0.059
2.038AspGln: 2.038 ± 0.048
2.344AspArg: 2.344 ± 0.061
3.269AspSer: 3.269 ± 0.07
2.835AspThr: 2.835 ± 0.065
4.109AspVal: 4.109 ± 0.078
0.805AspTrp: 0.805 ± 0.035
1.851AspTyr: 1.851 ± 0.056
0.0AspXaa: 0.0 ± 0.0
Glu
5.345GluAla: 5.345 ± 0.109
0.485GluCys: 0.485 ± 0.027
2.921GluAsp: 2.921 ± 0.068
4.014GluGlu: 4.014 ± 0.108
2.328GluPhe: 2.328 ± 0.056
4.035GluGly: 4.035 ± 0.089
1.572GluHis: 1.572 ± 0.054
3.845GluIle: 3.845 ± 0.081
4.553GluLys: 4.553 ± 0.1
6.829GluLeu: 6.829 ± 0.099
1.738GluMet: 1.738 ± 0.052
2.664GluAsn: 2.664 ± 0.065
2.075GluPro: 2.075 ± 0.051
3.667GluGln: 3.667 ± 0.078
3.22GluArg: 3.22 ± 0.069
3.561GluSer: 3.561 ± 0.075
3.396GluThr: 3.396 ± 0.082
4.313GluVal: 4.313 ± 0.082
0.683GluTrp: 0.683 ± 0.032
1.667GluTyr: 1.667 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
3.222PheAla: 3.222 ± 0.07
0.509PheCys: 0.509 ± 0.026
2.642PheAsp: 2.642 ± 0.05
2.511PheGlu: 2.511 ± 0.058
1.973PhePhe: 1.973 ± 0.059
2.931PheGly: 2.931 ± 0.072
0.805PheHis: 0.805 ± 0.037
3.011PheIle: 3.011 ± 0.068
2.263PheLys: 2.263 ± 0.057
3.679PheLeu: 3.679 ± 0.08
1.023PheMet: 1.023 ± 0.04
2.03PheAsn: 2.03 ± 0.056
1.484PhePro: 1.484 ± 0.045
1.27PheGln: 1.27 ± 0.047
1.337PheArg: 1.337 ± 0.046
3.315PheSer: 3.315 ± 0.07
2.091PheThr: 2.091 ± 0.054
2.59PheVal: 2.59 ± 0.058
0.465PheTrp: 0.465 ± 0.026
1.407PheTyr: 1.407 ± 0.046
0.0PheXaa: 0.0 ± 0.0
Gly
5.327GlyAla: 5.327 ± 0.111
0.863GlyCys: 0.863 ± 0.043
3.741GlyAsp: 3.741 ± 0.078
4.422GlyGlu: 4.422 ± 0.074
3.18GlyPhe: 3.18 ± 0.069
4.946GlyGly: 4.946 ± 0.108
1.792GlyHis: 1.792 ± 0.054
4.784GlyIle: 4.784 ± 0.083
3.87GlyLys: 3.87 ± 0.086
7.582GlyLeu: 7.582 ± 0.116
1.974GlyMet: 1.974 ± 0.048
2.351GlyAsn: 2.351 ± 0.06
1.918GlyPro: 1.918 ± 0.06
2.856GlyGln: 2.856 ± 0.069
3.04GlyArg: 3.04 ± 0.068
4.127GlySer: 4.127 ± 0.079
3.43GlyThr: 3.43 ± 0.089
5.657GlyVal: 5.657 ± 0.1
0.995GlyTrp: 0.995 ± 0.039
2.318GlyTyr: 2.318 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.822HisAla: 1.822 ± 0.053
0.307HisCys: 0.307 ± 0.022
1.123HisAsp: 1.123 ± 0.043
1.295HisGlu: 1.295 ± 0.043
1.058HisPhe: 1.058 ± 0.04
1.589HisGly: 1.589 ± 0.053
0.713HisHis: 0.713 ± 0.032
1.436HisIle: 1.436 ± 0.045
1.157HisLys: 1.157 ± 0.045
2.3HisLeu: 2.3 ± 0.062
0.541HisMet: 0.541 ± 0.03
0.849HisAsn: 0.849 ± 0.039
1.182HisPro: 1.182 ± 0.046
1.129HisGln: 1.129 ± 0.043
1.059HisArg: 1.059 ± 0.043
1.253HisSer: 1.253 ± 0.05
1.119HisThr: 1.119 ± 0.04
1.475HisVal: 1.475 ± 0.045
0.349HisTrp: 0.349 ± 0.024
0.863HisTyr: 0.863 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
6.117IleAla: 6.117 ± 0.093
0.697IleCys: 0.697 ± 0.028
4.57IleAsp: 4.57 ± 0.094
4.866IleGlu: 4.866 ± 0.093
2.438IlePhe: 2.438 ± 0.069
4.915IleGly: 4.915 ± 0.09
1.376IleHis: 1.376 ± 0.045
4.344IleIle: 4.344 ± 0.106
4.246IleLys: 4.246 ± 0.086
5.566IleLeu: 5.566 ± 0.097
1.453IleMet: 1.453 ± 0.047
3.578IleAsn: 3.578 ± 0.077
2.829IlePro: 2.829 ± 0.071
2.529IleGln: 2.529 ± 0.063
2.931IleArg: 2.931 ± 0.064
4.587IleSer: 4.587 ± 0.087
3.81IleThr: 3.81 ± 0.086
4.33IleVal: 4.33 ± 0.08
0.574IleTrp: 0.574 ± 0.029
1.718IleTyr: 1.718 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
5.133LysAla: 5.133 ± 0.088
0.334LysCys: 0.334 ± 0.026
2.995LysAsp: 2.995 ± 0.072
3.817LysGlu: 3.817 ± 0.09
1.5LysPhe: 1.5 ± 0.047
3.883LysGly: 3.883 ± 0.086
1.361LysHis: 1.361 ± 0.046
3.544LysIle: 3.544 ± 0.076
4.328LysLys: 4.328 ± 0.091
5.436LysLeu: 5.436 ± 0.093
1.482LysMet: 1.482 ± 0.045
2.864LysAsn: 2.864 ± 0.064
2.684LysPro: 2.684 ± 0.073
2.998LysGln: 2.998 ± 0.067
2.909LysArg: 2.909 ± 0.062
3.3LysSer: 3.3 ± 0.071
3.558LysThr: 3.558 ± 0.079
3.832LysVal: 3.832 ± 0.075
0.535LysTrp: 0.535 ± 0.03
1.28LysTyr: 1.28 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
9.503LeuAla: 9.503 ± 0.126
0.991LeuCys: 0.991 ± 0.036
6.122LeuAsp: 6.122 ± 0.101
6.115LeuGlu: 6.115 ± 0.105
4.12LeuPhe: 4.12 ± 0.082
7.097LeuGly: 7.097 ± 0.128
2.086LeuHis: 2.086 ± 0.057
6.811LeuIle: 6.811 ± 0.117
6.17LeuLys: 6.17 ± 0.099
10.962LeuLeu: 10.962 ± 0.174
2.763LeuMet: 2.763 ± 0.06
4.834LeuAsn: 4.834 ± 0.084
4.617LeuPro: 4.617 ± 0.083
3.712LeuGln: 3.712 ± 0.083
4.437LeuArg: 4.437 ± 0.084
8.174LeuSer: 8.174 ± 0.124
6.11LeuThr: 6.11 ± 0.101
6.677LeuVal: 6.677 ± 0.098
0.999LeuTrp: 0.999 ± 0.041
2.61LeuTyr: 2.61 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.421MetAla: 2.421 ± 0.059
0.222MetCys: 0.222 ± 0.019
1.415MetAsp: 1.415 ± 0.049
1.292MetGlu: 1.292 ± 0.051
0.865MetPhe: 0.865 ± 0.037
1.941MetGly: 1.941 ± 0.047
0.528MetHis: 0.528 ± 0.03
1.485MetIle: 1.485 ± 0.048
1.499MetLys: 1.499 ± 0.047
2.752MetLeu: 2.752 ± 0.07
0.759MetMet: 0.759 ± 0.035
1.168MetAsn: 1.168 ± 0.045
1.27MetPro: 1.27 ± 0.048
1.038MetGln: 1.038 ± 0.046
1.116MetArg: 1.116 ± 0.036
2.075MetSer: 2.075 ± 0.061
1.503MetThr: 1.503 ± 0.047
1.769MetVal: 1.769 ± 0.048
0.182MetTrp: 0.182 ± 0.017
0.461MetTyr: 0.461 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.512AsnAla: 3.512 ± 0.071
0.4AsnCys: 0.4 ± 0.024
2.445AsnAsp: 2.445 ± 0.062
2.811AsnGlu: 2.811 ± 0.061
1.499AsnPhe: 1.499 ± 0.043
2.907AsnGly: 2.907 ± 0.065
0.9AsnHis: 0.9 ± 0.037
3.313AsnIle: 3.313 ± 0.073
2.945AsnLys: 2.945 ± 0.061
3.842AsnLeu: 3.842 ± 0.08
0.998AsnMet: 0.998 ± 0.033
2.11AsnAsn: 2.11 ± 0.058
2.105AsnPro: 2.105 ± 0.051
1.819AsnGln: 1.819 ± 0.061
1.955AsnArg: 1.955 ± 0.061
2.382AsnSer: 2.382 ± 0.065
2.452AsnThr: 2.452 ± 0.053
2.688AsnVal: 2.688 ± 0.067
0.58AsnTrp: 0.58 ± 0.031
1.25AsnTyr: 1.25 ± 0.042
0.0AsnXaa: 0.0 ± 0.0
Pro
3.271ProAla: 3.271 ± 0.081
0.337ProCys: 0.337 ± 0.023
2.495ProAsp: 2.495 ± 0.058
2.898ProGlu: 2.898 ± 0.061
1.72ProPhe: 1.72 ± 0.057
2.459ProGly: 2.459 ± 0.057
0.812ProHis: 0.812 ± 0.034
2.791ProIle: 2.791 ± 0.071
1.97ProLys: 1.97 ± 0.059
4.022ProLeu: 4.022 ± 0.086
1.016ProMet: 1.016 ± 0.036
1.833ProAsn: 1.833 ± 0.054
1.222ProPro: 1.222 ± 0.04
1.277ProGln: 1.277 ± 0.039
1.405ProArg: 1.405 ± 0.046
2.624ProSer: 2.624 ± 0.057
2.283ProThr: 2.283 ± 0.06
3.19ProVal: 3.19 ± 0.068
0.457ProTrp: 0.457 ± 0.028
1.22ProTyr: 1.22 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
4.126GlnAla: 4.126 ± 0.087
0.407GlnCys: 0.407 ± 0.024
1.653GlnAsp: 1.653 ± 0.047
2.188GlnGlu: 2.188 ± 0.058
1.587GlnPhe: 1.587 ± 0.044
2.414GlnGly: 2.414 ± 0.061
1.351GlnHis: 1.351 ± 0.047
2.425GlnIle: 2.425 ± 0.062
2.075GlnLys: 2.075 ± 0.054
4.997GlnLeu: 4.997 ± 0.106
1.013GlnMet: 1.013 ± 0.04
1.512GlnAsn: 1.512 ± 0.051
1.484GlnPro: 1.484 ± 0.047
3.011GlnGln: 3.011 ± 0.087
2.374GlnArg: 2.374 ± 0.066
2.53GlnSer: 2.53 ± 0.066
2.295GlnThr: 2.295 ± 0.06
2.843GlnVal: 2.843 ± 0.069
0.64GlnTrp: 0.64 ± 0.032
1.28GlnTyr: 1.28 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
3.331ArgAla: 3.331 ± 0.073
0.465ArgCys: 0.465 ± 0.027
2.428ArgAsp: 2.428 ± 0.064
2.817ArgGlu: 2.817 ± 0.065
2.199ArgPhe: 2.199 ± 0.064
2.687ArgGly: 2.687 ± 0.07
1.084ArgHis: 1.084 ± 0.039
2.956ArgIle: 2.956 ± 0.062
2.349ArgLys: 2.349 ± 0.055
5.08ArgLeu: 5.08 ± 0.107
1.176ArgMet: 1.176 ± 0.039
1.635ArgAsn: 1.635 ± 0.043
1.625ArgPro: 1.625 ± 0.045
2.189ArgGln: 2.189 ± 0.057
2.263ArgArg: 2.263 ± 0.069
2.638ArgSer: 2.638 ± 0.061
2.016ArgThr: 2.016 ± 0.056
3.218ArgVal: 3.218 ± 0.077
0.567ArgTrp: 0.567 ± 0.028
1.661ArgTyr: 1.661 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
5.308SerAla: 5.308 ± 0.096
0.683SerCys: 0.683 ± 0.032
3.544SerAsp: 3.544 ± 0.07
3.746SerGlu: 3.746 ± 0.072
2.863SerPhe: 2.863 ± 0.067
4.883SerGly: 4.883 ± 0.097
1.408SerHis: 1.408 ± 0.042
4.697SerIle: 4.697 ± 0.077
3.391SerLys: 3.391 ± 0.083
7.278SerLeu: 7.278 ± 0.123
1.758SerMet: 1.758 ± 0.048
2.703SerAsn: 2.703 ± 0.061
2.46SerPro: 2.46 ± 0.054
2.369SerGln: 2.369 ± 0.061
2.791SerArg: 2.791 ± 0.062
4.415SerSer: 4.415 ± 0.076
3.691SerThr: 3.691 ± 0.073
4.623SerVal: 4.623 ± 0.106
0.795SerTrp: 0.795 ± 0.037
1.869SerTyr: 1.869 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
4.455ThrAla: 4.455 ± 0.084
0.49ThrCys: 0.49 ± 0.027
3.026ThrAsp: 3.026 ± 0.078
3.116ThrGlu: 3.116 ± 0.066
2.152ThrPhe: 2.152 ± 0.061
4.374ThrGly: 4.374 ± 0.095
1.326ThrHis: 1.326 ± 0.046
3.525ThrIle: 3.525 ± 0.065
2.575ThrLys: 2.575 ± 0.056
6.547ThrLeu: 6.547 ± 0.119
1.222ThrMet: 1.222 ± 0.043
2.001ThrAsn: 2.001 ± 0.059
2.738ThrPro: 2.738 ± 0.068
2.185ThrGln: 2.185 ± 0.061
2.244ThrArg: 2.244 ± 0.058
3.271ThrSer: 3.271 ± 0.073
3.001ThrThr: 3.001 ± 0.085
3.961ThrVal: 3.961 ± 0.098
0.559ThrTrp: 0.559 ± 0.025
1.411ThrTyr: 1.411 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
5.852ValAla: 5.852 ± 0.103
0.77ValCys: 0.77 ± 0.034
4.325ValAsp: 4.325 ± 0.087
4.624ValGlu: 4.624 ± 0.099
2.935ValPhe: 2.935 ± 0.068
4.643ValGly: 4.643 ± 0.101
1.323ValHis: 1.323 ± 0.05
5.024ValIle: 5.024 ± 0.101
3.721ValLys: 3.721 ± 0.087
7.35ValLeu: 7.35 ± 0.108
1.861ValMet: 1.861 ± 0.055
3.06ValAsn: 3.06 ± 0.069
2.682ValPro: 2.682 ± 0.064
2.103ValGln: 2.103 ± 0.054
2.9ValArg: 2.9 ± 0.065
4.956ValSer: 4.956 ± 0.089
3.822ValThr: 3.822 ± 0.093
5.069ValVal: 5.069 ± 0.097
0.672ValTrp: 0.672 ± 0.029
1.9ValTyr: 1.9 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.882TrpAla: 0.882 ± 0.037
0.143TrpCys: 0.143 ± 0.014
0.581TrpAsp: 0.581 ± 0.03
0.592TrpGlu: 0.592 ± 0.026
0.566TrpPhe: 0.566 ± 0.032
0.686TrpGly: 0.686 ± 0.032
0.277TrpHis: 0.277 ± 0.018
0.604TrpIle: 0.604 ± 0.035
0.574TrpLys: 0.574 ± 0.032
1.568TrpLeu: 1.568 ± 0.051
0.305TrpMet: 0.305 ± 0.02
0.412TrpAsn: 0.412 ± 0.024
0.447TrpPro: 0.447 ± 0.025
0.686TrpGln: 0.686 ± 0.03
0.549TrpArg: 0.549 ± 0.028
0.768TrpSer: 0.768 ± 0.036
0.447TrpThr: 0.447 ± 0.028
0.929TrpVal: 0.929 ± 0.037
0.145TrpTrp: 0.145 ± 0.017
0.282TrpTyr: 0.282 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.135TyrAla: 2.135 ± 0.055
0.335TyrCys: 0.335 ± 0.025
1.639TyrAsp: 1.639 ± 0.051
1.575TyrGlu: 1.575 ± 0.053
1.257TyrPhe: 1.257 ± 0.043
2.012TyrGly: 2.012 ± 0.056
0.738TyrHis: 0.738 ± 0.033
1.72TyrIle: 1.72 ± 0.048
1.534TyrLys: 1.534 ± 0.047
3.283TyrLeu: 3.283 ± 0.07
0.623TyrMet: 0.623 ± 0.027
1.009TyrAsn: 1.009 ± 0.036
1.277TyrPro: 1.277 ± 0.043
1.618TyrGln: 1.618 ± 0.053
1.562TyrArg: 1.562 ± 0.046
1.855TyrSer: 1.855 ± 0.057
1.454TyrThr: 1.454 ± 0.052
1.746TyrVal: 1.746 ± 0.05
0.384TyrTrp: 0.384 ± 0.022
0.946TyrTyr: 0.946 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2247 proteins (715765 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski