Amino acid dipepetide frequency for Roseicitreum antarcticum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.947AlaAla: 18.947 ± 0.192
1.165AlaCys: 1.165 ± 0.036
7.758AlaAsp: 7.758 ± 0.101
7.683AlaGlu: 7.683 ± 0.108
4.347AlaPhe: 4.347 ± 0.064
11.497AlaGly: 11.497 ± 0.117
2.747AlaHis: 2.747 ± 0.048
5.861AlaIle: 5.861 ± 0.072
3.154AlaLys: 3.154 ± 0.064
15.144AlaLeu: 15.144 ± 0.148
4.17AlaMet: 4.17 ± 0.062
2.618AlaAsn: 2.618 ± 0.051
6.773AlaPro: 6.773 ± 0.106
5.644AlaGln: 5.644 ± 0.084
10.215AlaArg: 10.215 ± 0.112
5.795AlaSer: 5.795 ± 0.072
6.766AlaThr: 6.766 ± 0.086
9.125AlaVal: 9.125 ± 0.104
1.485AlaTrp: 1.485 ± 0.04
2.439AlaTyr: 2.439 ± 0.05
0.001AlaXaa: 0.001 ± 0.001
Cys
1.156CysAla: 1.156 ± 0.037
0.102CysCys: 0.102 ± 0.009
0.627CysAsp: 0.627 ± 0.026
0.376CysGlu: 0.376 ± 0.017
0.318CysPhe: 0.318 ± 0.016
0.92CysGly: 0.92 ± 0.031
0.237CysHis: 0.237 ± 0.014
0.397CysIle: 0.397 ± 0.02
0.198CysLys: 0.198 ± 0.013
0.824CysLeu: 0.824 ± 0.028
0.19CysMet: 0.19 ± 0.012
0.212CysAsn: 0.212 ± 0.012
0.511CysPro: 0.511 ± 0.023
0.221CysGln: 0.221 ± 0.013
0.554CysArg: 0.554 ± 0.023
0.393CysSer: 0.393 ± 0.022
0.459CysThr: 0.459 ± 0.02
0.624CysVal: 0.624 ± 0.025
0.138CysTrp: 0.138 ± 0.01
0.184CysTyr: 0.184 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
8.598AspAla: 8.598 ± 0.106
0.531AspCys: 0.531 ± 0.022
3.374AspAsp: 3.374 ± 0.057
2.898AspGlu: 2.898 ± 0.057
2.114AspPhe: 2.114 ± 0.049
5.499AspGly: 5.499 ± 0.071
1.412AspHis: 1.412 ± 0.035
2.99AspIle: 2.99 ± 0.054
1.296AspLys: 1.296 ± 0.037
6.565AspLeu: 6.565 ± 0.068
1.733AspMet: 1.733 ± 0.04
1.253AspAsn: 1.253 ± 0.03
3.675AspPro: 3.675 ± 0.062
1.998AspGln: 1.998 ± 0.044
4.41AspArg: 4.41 ± 0.07
2.332AspSer: 2.332 ± 0.043
3.439AspThr: 3.439 ± 0.055
4.193AspVal: 4.193 ± 0.055
1.224AspTrp: 1.224 ± 0.029
1.387AspTyr: 1.387 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
7.017GluAla: 7.017 ± 0.103
0.335GluCys: 0.335 ± 0.016
2.798GluAsp: 2.798 ± 0.048
2.46GluGlu: 2.46 ± 0.05
1.647GluPhe: 1.647 ± 0.036
4.254GluGly: 4.254 ± 0.066
0.999GluHis: 0.999 ± 0.031
3.017GluIle: 3.017 ± 0.059
1.531GluLys: 1.531 ± 0.046
4.42GluLeu: 4.42 ± 0.061
1.623GluMet: 1.623 ± 0.04
1.445GluAsn: 1.445 ± 0.035
2.154GluPro: 2.154 ± 0.055
1.678GluGln: 1.678 ± 0.038
3.858GluArg: 3.858 ± 0.072
1.935GluSer: 1.935 ± 0.043
3.333GluThr: 3.333 ± 0.057
4.077GluVal: 4.077 ± 0.057
0.635GluTrp: 0.635 ± 0.025
0.981GluTyr: 0.981 ± 0.03
0.001GluXaa: 0.001 ± 0.001
Phe
4.493PheAla: 4.493 ± 0.069
0.389PheCys: 0.389 ± 0.017
2.853PheAsp: 2.853 ± 0.053
1.874PheGlu: 1.874 ± 0.043
1.35PhePhe: 1.35 ± 0.036
3.687PheGly: 3.687 ± 0.057
0.75PheHis: 0.75 ± 0.027
1.675PheIle: 1.675 ± 0.036
0.72PheLys: 0.72 ± 0.024
3.175PheLeu: 3.175 ± 0.059
0.861PheMet: 0.861 ± 0.031
1.006PheAsn: 1.006 ± 0.028
1.537PhePro: 1.537 ± 0.036
0.98PheGln: 0.98 ± 0.029
2.208PheArg: 2.208 ± 0.049
2.009PheSer: 2.009 ± 0.046
2.045PheThr: 2.045 ± 0.039
2.578PheVal: 2.578 ± 0.054
0.583PheTrp: 0.583 ± 0.025
0.868PheTyr: 0.868 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
11.194GlyAla: 11.194 ± 0.118
0.848GlyCys: 0.848 ± 0.027
4.657GlyAsp: 4.657 ± 0.072
3.714GlyGlu: 3.714 ± 0.056
3.64GlyPhe: 3.64 ± 0.06
7.569GlyGly: 7.569 ± 0.093
1.953GlyHis: 1.953 ± 0.041
4.47GlyIle: 4.47 ± 0.069
2.535GlyLys: 2.535 ± 0.058
9.394GlyLeu: 9.394 ± 0.085
2.791GlyMet: 2.791 ± 0.05
2.03GlyAsn: 2.03 ± 0.046
3.942GlyPro: 3.942 ± 0.067
3.465GlyGln: 3.465 ± 0.053
5.977GlyArg: 5.977 ± 0.074
4.098GlySer: 4.098 ± 0.057
4.972GlyThr: 4.972 ± 0.061
6.723GlyVal: 6.723 ± 0.08
1.558GlyTrp: 1.558 ± 0.038
2.211GlyTyr: 2.211 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.687HisAla: 2.687 ± 0.047
0.257HisCys: 0.257 ± 0.014
1.48HisAsp: 1.48 ± 0.036
0.972HisGlu: 0.972 ± 0.029
0.807HisPhe: 0.807 ± 0.03
1.925HisGly: 1.925 ± 0.047
0.575HisHis: 0.575 ± 0.026
1.038HisIle: 1.038 ± 0.026
0.47HisLys: 0.47 ± 0.023
2.279HisLeu: 2.279 ± 0.046
0.539HisMet: 0.539 ± 0.023
0.459HisAsn: 0.459 ± 0.017
1.458HisPro: 1.458 ± 0.036
0.639HisGln: 0.639 ± 0.023
1.424HisArg: 1.424 ± 0.034
0.976HisSer: 0.976 ± 0.029
0.912HisThr: 0.912 ± 0.03
1.488HisVal: 1.488 ± 0.036
0.347HisTrp: 0.347 ± 0.017
0.499HisTyr: 0.499 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
7.184IleAla: 7.184 ± 0.096
0.583IleCys: 0.583 ± 0.024
3.363IleAsp: 3.363 ± 0.05
2.919IleGlu: 2.919 ± 0.059
1.666IlePhe: 1.666 ± 0.038
4.715IleGly: 4.715 ± 0.064
0.963IleHis: 0.963 ± 0.029
2.315IleIle: 2.315 ± 0.06
1.176IleLys: 1.176 ± 0.037
4.461IleLeu: 4.461 ± 0.069
1.115IleMet: 1.115 ± 0.033
1.327IleAsn: 1.327 ± 0.04
2.408IlePro: 2.408 ± 0.047
1.095IleGln: 1.095 ± 0.035
3.205IleArg: 3.205 ± 0.046
2.86IleSer: 2.86 ± 0.052
2.964IleThr: 2.964 ± 0.045
3.781IleVal: 3.781 ± 0.067
0.745IleTrp: 0.745 ± 0.022
1.104IleTyr: 1.104 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
3.222LysAla: 3.222 ± 0.07
0.144LysCys: 0.144 ± 0.011
1.298LysAsp: 1.298 ± 0.036
1.053LysGlu: 1.053 ± 0.033
0.77LysPhe: 0.77 ± 0.025
2.124LysGly: 2.124 ± 0.043
0.548LysHis: 0.548 ± 0.023
1.328LysIle: 1.328 ± 0.033
0.87LysLys: 0.87 ± 0.03
2.513LysLeu: 2.513 ± 0.053
0.736LysMet: 0.736 ± 0.026
0.588LysAsn: 0.588 ± 0.026
1.514LysPro: 1.514 ± 0.043
0.723LysGln: 0.723 ± 0.025
1.882LysArg: 1.882 ± 0.043
1.507LysSer: 1.507 ± 0.038
1.676LysThr: 1.676 ± 0.041
1.821LysVal: 1.821 ± 0.05
0.286LysTrp: 0.286 ± 0.015
0.517LysTyr: 0.517 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
13.596LeuAla: 13.596 ± 0.123
0.929LeuCys: 0.929 ± 0.029
6.086LeuAsp: 6.086 ± 0.076
4.854LeuGlu: 4.854 ± 0.06
3.342LeuPhe: 3.342 ± 0.064
8.455LeuGly: 8.455 ± 0.102
2.167LeuHis: 2.167 ± 0.048
5.418LeuIle: 5.418 ± 0.078
2.486LeuLys: 2.486 ± 0.053
8.956LeuLeu: 8.956 ± 0.115
2.809LeuMet: 2.809 ± 0.052
2.635LeuAsn: 2.635 ± 0.046
5.892LeuPro: 5.892 ± 0.077
2.829LeuGln: 2.829 ± 0.052
7.967LeuArg: 7.967 ± 0.091
6.586LeuSer: 6.586 ± 0.081
6.447LeuThr: 6.447 ± 0.078
6.668LeuVal: 6.668 ± 0.074
1.354LeuTrp: 1.354 ± 0.034
1.826LeuTyr: 1.826 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
3.779MetAla: 3.779 ± 0.06
0.168MetCys: 0.168 ± 0.01
1.407MetAsp: 1.407 ± 0.036
1.231MetGlu: 1.231 ± 0.033
0.83MetPhe: 0.83 ± 0.033
2.444MetGly: 2.444 ± 0.049
0.506MetHis: 0.506 ± 0.02
1.537MetIle: 1.537 ± 0.04
0.859MetLys: 0.859 ± 0.026
2.812MetLeu: 2.812 ± 0.051
0.807MetMet: 0.807 ± 0.03
0.834MetAsn: 0.834 ± 0.026
1.668MetPro: 1.668 ± 0.038
1.116MetGln: 1.116 ± 0.034
2.078MetArg: 2.078 ± 0.034
1.706MetSer: 1.706 ± 0.035
2.185MetThr: 2.185 ± 0.041
2.022MetVal: 2.022 ± 0.048
0.233MetTrp: 0.233 ± 0.016
0.35MetTyr: 0.35 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.17AsnAla: 3.17 ± 0.054
0.207AsnCys: 0.207 ± 0.014
1.338AsnAsp: 1.338 ± 0.034
0.973AsnGlu: 0.973 ± 0.027
0.881AsnPhe: 0.881 ± 0.026
2.072AsnGly: 2.072 ± 0.042
0.489AsnHis: 0.489 ± 0.02
1.386AsnIle: 1.386 ± 0.036
0.566AsnLys: 0.566 ± 0.022
2.432AsnLeu: 2.432 ± 0.042
0.648AsnMet: 0.648 ± 0.024
0.643AsnAsn: 0.643 ± 0.027
1.818AsnPro: 1.818 ± 0.043
0.705AsnGln: 0.705 ± 0.024
1.693AsnArg: 1.693 ± 0.035
1.069AsnSer: 1.069 ± 0.031
1.344AsnThr: 1.344 ± 0.033
1.704AsnVal: 1.704 ± 0.04
0.418AsnTrp: 0.418 ± 0.023
0.582AsnTyr: 0.582 ± 0.025
0.001AsnXaa: 0.001 ± 0.001
Pro
6.827ProAla: 6.827 ± 0.106
0.387ProCys: 0.387 ± 0.018
4.368ProAsp: 4.368 ± 0.061
3.635ProGlu: 3.635 ± 0.057
1.924ProPhe: 1.924 ± 0.041
5.136ProGly: 5.136 ± 0.071
1.172ProHis: 1.172 ± 0.031
2.064ProIle: 2.064 ± 0.04
1.347ProLys: 1.347 ± 0.034
4.947ProLeu: 4.947 ± 0.075
1.431ProMet: 1.431 ± 0.035
1.189ProAsn: 1.189 ± 0.031
2.731ProPro: 2.731 ± 0.057
1.933ProGln: 1.933 ± 0.044
3.223ProArg: 3.223 ± 0.05
2.531ProSer: 2.531 ± 0.043
2.66ProThr: 2.66 ± 0.049
4.459ProVal: 4.459 ± 0.057
0.672ProTrp: 0.672 ± 0.024
1.07ProTyr: 1.07 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.563GlnAla: 4.563 ± 0.069
0.235GlnCys: 0.235 ± 0.017
1.787GlnAsp: 1.787 ± 0.038
1.443GlnGlu: 1.443 ± 0.04
1.151GlnPhe: 1.151 ± 0.034
2.854GlnGly: 2.854 ± 0.047
0.688GlnHis: 0.688 ± 0.025
2.22GlnIle: 2.22 ± 0.043
0.918GlnLys: 0.918 ± 0.03
2.82GlnLeu: 2.82 ± 0.049
1.17GlnMet: 1.17 ± 0.031
0.895GlnAsn: 0.895 ± 0.028
1.89GlnPro: 1.89 ± 0.046
1.147GlnGln: 1.147 ± 0.036
2.471GlnArg: 2.471 ± 0.049
1.888GlnSer: 1.888 ± 0.038
2.038GlnThr: 2.038 ± 0.04
2.608GlnVal: 2.608 ± 0.046
0.414GlnTrp: 0.414 ± 0.017
0.618GlnTyr: 0.618 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
9.742ArgAla: 9.742 ± 0.107
0.545ArgCys: 0.545 ± 0.024
4.746ArgAsp: 4.746 ± 0.069
3.359ArgGlu: 3.359 ± 0.055
2.65ArgPhe: 2.65 ± 0.045
5.121ArgGly: 5.121 ± 0.058
1.615ArgHis: 1.615 ± 0.04
3.801ArgIle: 3.801 ± 0.054
2.039ArgLys: 2.039 ± 0.04
7.698ArgLeu: 7.698 ± 0.102
2.191ArgMet: 2.191 ± 0.043
1.836ArgAsn: 1.836 ± 0.039
3.722ArgPro: 3.722 ± 0.055
2.482ArgGln: 2.482 ± 0.05
5.425ArgArg: 5.425 ± 0.078
3.201ArgSer: 3.201 ± 0.059
3.222ArgThr: 3.222 ± 0.047
5.076ArgVal: 5.076 ± 0.08
1.04ArgTrp: 1.04 ± 0.034
1.556ArgTyr: 1.556 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
6.176SerAla: 6.176 ± 0.077
0.395SerCys: 0.395 ± 0.021
3.369SerAsp: 3.369 ± 0.053
2.482SerGlu: 2.482 ± 0.048
2.012SerPhe: 2.012 ± 0.048
5.43SerGly: 5.43 ± 0.064
1.066SerHis: 1.066 ± 0.034
2.359SerIle: 2.359 ± 0.048
1.224SerLys: 1.224 ± 0.033
4.731SerLeu: 4.731 ± 0.07
1.376SerMet: 1.376 ± 0.03
1.194SerAsn: 1.194 ± 0.037
2.547SerPro: 2.547 ± 0.045
1.562SerGln: 1.562 ± 0.038
3.232SerArg: 3.232 ± 0.057
2.319SerSer: 2.319 ± 0.044
2.492SerThr: 2.492 ± 0.049
4.021SerVal: 4.021 ± 0.058
0.622SerTrp: 0.622 ± 0.025
1.169SerTyr: 1.169 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
7.058ThrAla: 7.058 ± 0.078
0.465ThrCys: 0.465 ± 0.021
3.408ThrAsp: 3.408 ± 0.057
2.774ThrGlu: 2.774 ± 0.051
1.947ThrPhe: 1.947 ± 0.041
5.736ThrGly: 5.736 ± 0.075
1.198ThrHis: 1.198 ± 0.03
2.546ThrIle: 2.546 ± 0.049
1.253ThrLys: 1.253 ± 0.035
6.421ThrLeu: 6.421 ± 0.074
1.318ThrMet: 1.318 ± 0.035
1.231ThrAsn: 1.231 ± 0.033
3.978ThrPro: 3.978 ± 0.058
1.811ThrGln: 1.811 ± 0.036
3.947ThrArg: 3.947 ± 0.055
2.616ThrSer: 2.616 ± 0.039
3.186ThrThr: 3.186 ± 0.055
4.162ThrVal: 4.162 ± 0.047
0.651ThrTrp: 0.651 ± 0.023
1.217ThrTyr: 1.217 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
9.779ValAla: 9.779 ± 0.109
0.601ValCys: 0.601 ± 0.021
3.977ValAsp: 3.977 ± 0.06
3.88ValGlu: 3.88 ± 0.059
2.89ValPhe: 2.89 ± 0.053
5.37ValGly: 5.37 ± 0.075
1.39ValHis: 1.39 ± 0.033
4.019ValIle: 4.019 ± 0.065
1.675ValLys: 1.675 ± 0.042
7.799ValLeu: 7.799 ± 0.085
2.196ValMet: 2.196 ± 0.044
1.792ValAsn: 1.792 ± 0.041
3.739ValPro: 3.739 ± 0.054
2.397ValGln: 2.397 ± 0.043
4.567ValArg: 4.567 ± 0.068
4.17ValSer: 4.17 ± 0.066
4.822ValThr: 4.822 ± 0.061
5.814ValVal: 5.814 ± 0.081
0.991ValTrp: 0.991 ± 0.031
1.391ValTyr: 1.391 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.561TrpAla: 1.561 ± 0.037
0.131TrpCys: 0.131 ± 0.01
0.783TrpAsp: 0.783 ± 0.026
0.599TrpGlu: 0.599 ± 0.022
0.545TrpPhe: 0.545 ± 0.021
0.986TrpGly: 0.986 ± 0.029
0.297TrpHis: 0.297 ± 0.016
0.637TrpIle: 0.637 ± 0.022
0.344TrpLys: 0.344 ± 0.017
1.697TrpLeu: 1.697 ± 0.041
0.399TrpMet: 0.399 ± 0.021
0.408TrpAsn: 0.408 ± 0.021
0.734TrpPro: 0.734 ± 0.026
0.688TrpGln: 0.688 ± 0.027
1.182TrpArg: 1.182 ± 0.032
0.737TrpSer: 0.737 ± 0.026
0.831TrpThr: 0.831 ± 0.029
0.935TrpVal: 0.935 ± 0.031
0.229TrpTrp: 0.229 ± 0.014
0.248TrpTyr: 0.248 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.582TyrAla: 2.582 ± 0.044
0.224TyrCys: 0.224 ± 0.013
1.445TyrAsp: 1.445 ± 0.035
1.02TyrGlu: 1.02 ± 0.031
0.813TyrPhe: 0.813 ± 0.027
1.948TyrGly: 1.948 ± 0.044
0.46TyrHis: 0.46 ± 0.02
0.871TyrIle: 0.871 ± 0.029
0.459TyrLys: 0.459 ± 0.02
2.214TyrLeu: 2.214 ± 0.052
0.468TyrMet: 0.468 ± 0.02
0.544TyrAsn: 0.544 ± 0.02
1.014TyrPro: 1.014 ± 0.031
0.706TyrGln: 0.706 ± 0.027
1.544TyrArg: 1.544 ± 0.042
1.048TyrSer: 1.048 ± 0.03
1.131TyrThr: 1.131 ± 0.029
1.382TyrVal: 1.382 ± 0.037
0.344TyrTrp: 0.344 ± 0.016
0.468TyrTyr: 0.468 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.001XaaHis: 0.001 ± 0.001
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.001XaaTrp: 0.001 ± 0.001
0.0XaaTyr: 0.0 ± 0.0
0.006XaaXaa: 0.006 ± 0.004
Statistics based on 4002 proteins (1234146 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski