Amino acid dipepetide frequency for Oryzomicrobium terrae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.545AlaAla: 19.545 ± 0.244
1.264AlaCys: 1.264 ± 0.04
7.093AlaAsp: 7.093 ± 0.102
7.789AlaGlu: 7.789 ± 0.102
4.047AlaPhe: 4.047 ± 0.069
11.269AlaGly: 11.269 ± 0.134
2.432AlaHis: 2.432 ± 0.053
5.264AlaIle: 5.264 ± 0.075
4.506AlaLys: 4.506 ± 0.103
15.189AlaLeu: 15.189 ± 0.163
2.967AlaMet: 2.967 ± 0.062
3.147AlaAsn: 3.147 ± 0.064
6.979AlaPro: 6.979 ± 0.109
4.983AlaGln: 4.983 ± 0.079
9.036AlaArg: 9.036 ± 0.126
6.179AlaSer: 6.179 ± 0.093
6.516AlaThr: 6.516 ± 0.085
9.303AlaVal: 9.303 ± 0.103
1.824AlaTrp: 1.824 ± 0.047
2.75AlaTyr: 2.75 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
1.07CysAla: 1.07 ± 0.037
0.131CysCys: 0.131 ± 0.014
0.5CysAsp: 0.5 ± 0.021
0.396CysGlu: 0.396 ± 0.022
0.338CysPhe: 0.338 ± 0.018
0.936CysGly: 0.936 ± 0.032
0.325CysHis: 0.325 ± 0.021
0.359CysIle: 0.359 ± 0.019
0.223CysLys: 0.223 ± 0.013
0.923CysLeu: 0.923 ± 0.027
0.167CysMet: 0.167 ± 0.011
0.231CysAsn: 0.231 ± 0.015
0.56CysPro: 0.56 ± 0.027
0.286CysGln: 0.286 ± 0.016
0.657CysArg: 0.657 ± 0.026
0.451CysSer: 0.451 ± 0.026
0.418CysThr: 0.418 ± 0.02
0.574CysVal: 0.574 ± 0.02
0.107CysTrp: 0.107 ± 0.01
0.216CysTyr: 0.216 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
6.494AspAla: 6.494 ± 0.08
0.463AspCys: 0.463 ± 0.023
2.89AspAsp: 2.89 ± 0.065
3.307AspGlu: 3.307 ± 0.064
2.169AspPhe: 2.169 ± 0.048
4.748AspGly: 4.748 ± 0.077
1.068AspHis: 1.068 ± 0.034
2.408AspIle: 2.408 ± 0.051
1.828AspLys: 1.828 ± 0.048
5.824AspLeu: 5.824 ± 0.094
1.023AspMet: 1.023 ± 0.03
1.242AspAsn: 1.242 ± 0.035
3.065AspPro: 3.065 ± 0.061
1.742AspGln: 1.742 ± 0.044
3.296AspArg: 3.296 ± 0.07
2.31AspSer: 2.31 ± 0.053
2.616AspThr: 2.616 ± 0.054
3.725AspVal: 3.725 ± 0.07
0.869AspTrp: 0.869 ± 0.027
1.566AspTyr: 1.566 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
7.816GluAla: 7.816 ± 0.1
0.381GluCys: 0.381 ± 0.02
2.236GluAsp: 2.236 ± 0.043
3.096GluGlu: 3.096 ± 0.063
1.788GluPhe: 1.788 ± 0.033
4.078GluGly: 4.078 ± 0.066
1.159GluHis: 1.159 ± 0.031
2.922GluIle: 2.922 ± 0.048
2.362GluLys: 2.362 ± 0.058
5.794GluLeu: 5.794 ± 0.084
1.276GluMet: 1.276 ± 0.039
1.488GluAsn: 1.488 ± 0.035
2.213GluPro: 2.213 ± 0.049
2.43GluGln: 2.43 ± 0.056
4.768GluArg: 4.768 ± 0.074
2.53GluSer: 2.53 ± 0.054
2.899GluThr: 2.899 ± 0.058
4.387GluVal: 4.387 ± 0.073
0.744GluTrp: 0.744 ± 0.027
1.067GluTyr: 1.067 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.657PheAla: 4.657 ± 0.076
0.401PheCys: 0.401 ± 0.021
2.264PheAsp: 2.264 ± 0.05
1.809PheGlu: 1.809 ± 0.045
1.496PhePhe: 1.496 ± 0.051
3.133PheGly: 3.133 ± 0.064
0.765PheHis: 0.765 ± 0.027
1.605PheIle: 1.605 ± 0.044
1.151PheLys: 1.151 ± 0.034
3.457PheLeu: 3.457 ± 0.058
0.668PheMet: 0.668 ± 0.024
1.05PheAsn: 1.05 ± 0.034
1.715PhePro: 1.715 ± 0.037
1.025PheGln: 1.025 ± 0.032
1.994PheArg: 1.994 ± 0.042
2.158PheSer: 2.158 ± 0.049
1.881PheThr: 1.881 ± 0.043
2.737PheVal: 2.737 ± 0.056
0.51PheTrp: 0.51 ± 0.026
0.934PheTyr: 0.934 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
9.191GlyAla: 9.191 ± 0.119
0.916GlyCys: 0.916 ± 0.029
4.315GlyAsp: 4.315 ± 0.081
5.086GlyGlu: 5.086 ± 0.072
3.302GlyPhe: 3.302 ± 0.057
7.115GlyGly: 7.115 ± 0.115
1.881GlyHis: 1.881 ± 0.044
4.087GlyIle: 4.087 ± 0.071
3.578GlyLys: 3.578 ± 0.077
9.505GlyLeu: 9.505 ± 0.123
2.113GlyMet: 2.113 ± 0.043
2.29GlyAsn: 2.29 ± 0.058
3.025GlyPro: 3.025 ± 0.055
3.309GlyGln: 3.309 ± 0.061
5.881GlyArg: 5.881 ± 0.084
4.34GlySer: 4.34 ± 0.076
4.244GlyThr: 4.244 ± 0.081
6.437GlyVal: 6.437 ± 0.082
1.326GlyTrp: 1.326 ± 0.039
2.292GlyTyr: 2.292 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.391HisAla: 2.391 ± 0.049
0.249HisCys: 0.249 ± 0.016
1.166HisAsp: 1.166 ± 0.035
0.937HisGlu: 0.937 ± 0.035
0.876HisPhe: 0.876 ± 0.032
2.092HisGly: 2.092 ± 0.048
0.67HisHis: 0.67 ± 0.032
0.86HisIle: 0.86 ± 0.032
0.562HisLys: 0.562 ± 0.028
2.444HisLeu: 2.444 ± 0.052
0.404HisMet: 0.404 ± 0.018
0.539HisAsn: 0.539 ± 0.024
1.596HisPro: 1.596 ± 0.045
0.795HisGln: 0.795 ± 0.027
1.494HisArg: 1.494 ± 0.041
0.966HisSer: 0.966 ± 0.032
0.941HisThr: 0.941 ± 0.031
1.263HisVal: 1.263 ± 0.038
0.315HisTrp: 0.315 ± 0.02
0.666HisTyr: 0.666 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.118IleAla: 6.118 ± 0.081
0.36IleCys: 0.36 ± 0.02
2.831IleAsp: 2.831 ± 0.054
2.669IleGlu: 2.669 ± 0.055
1.417IlePhe: 1.417 ± 0.039
4.032IleGly: 4.032 ± 0.064
0.896IleHis: 0.896 ± 0.028
1.633IleIle: 1.633 ± 0.041
1.648IleLys: 1.648 ± 0.047
3.973IleLeu: 3.973 ± 0.075
0.72IleMet: 0.72 ± 0.029
1.45IleAsn: 1.45 ± 0.04
2.252IlePro: 2.252 ± 0.048
1.319IleGln: 1.319 ± 0.036
2.764IleArg: 2.764 ± 0.058
2.408IleSer: 2.408 ± 0.055
2.408IleThr: 2.408 ± 0.056
3.355IleVal: 3.355 ± 0.064
0.396IleTrp: 0.396 ± 0.02
0.956IleTyr: 0.956 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.167LysAla: 4.167 ± 0.093
0.159LysCys: 0.159 ± 0.013
1.688LysAsp: 1.688 ± 0.045
1.937LysGlu: 1.937 ± 0.056
0.909LysPhe: 0.909 ± 0.031
2.621LysGly: 2.621 ± 0.057
0.637LysHis: 0.637 ± 0.027
1.617LysIle: 1.617 ± 0.05
1.576LysLys: 1.576 ± 0.055
3.501LysLeu: 3.501 ± 0.064
0.791LysMet: 0.791 ± 0.03
1.034LysAsn: 1.034 ± 0.033
2.048LysPro: 2.048 ± 0.058
1.237LysGln: 1.237 ± 0.036
2.397LysArg: 2.397 ± 0.052
1.83LysSer: 1.83 ± 0.044
1.903LysThr: 1.903 ± 0.043
2.822LysVal: 2.822 ± 0.063
0.324LysTrp: 0.324 ± 0.018
0.706LysTyr: 0.706 ± 0.031
0.0LysXaa: 0.0 ± 0.0
Leu
17.401LeuAla: 17.401 ± 0.17
0.998LeuCys: 0.998 ± 0.03
6.103LeuAsp: 6.103 ± 0.09
5.846LeuGlu: 5.846 ± 0.081
3.863LeuPhe: 3.863 ± 0.079
8.931LeuGly: 8.931 ± 0.127
2.286LeuHis: 2.286 ± 0.051
4.61LeuIle: 4.61 ± 0.079
3.928LeuLys: 3.928 ± 0.071
12.077LeuLeu: 12.077 ± 0.187
2.161LeuMet: 2.161 ± 0.043
2.845LeuAsn: 2.845 ± 0.064
6.886LeuPro: 6.886 ± 0.086
3.423LeuGln: 3.423 ± 0.059
7.164LeuArg: 7.164 ± 0.097
5.932LeuSer: 5.932 ± 0.083
6.095LeuThr: 6.095 ± 0.08
8.18LeuVal: 8.18 ± 0.097
1.333LeuTrp: 1.333 ± 0.042
2.159LeuTyr: 2.159 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
2.788MetAla: 2.788 ± 0.05
0.133MetCys: 0.133 ± 0.01
0.991MetAsp: 0.991 ± 0.029
1.027MetGlu: 1.027 ± 0.032
0.611MetPhe: 0.611 ± 0.026
1.585MetGly: 1.585 ± 0.039
0.429MetHis: 0.429 ± 0.019
0.895MetIle: 0.895 ± 0.028
0.922MetLys: 0.922 ± 0.033
2.137MetLeu: 2.137 ± 0.047
0.43MetMet: 0.43 ± 0.024
0.697MetAsn: 0.697 ± 0.027
1.315MetPro: 1.315 ± 0.039
0.732MetGln: 0.732 ± 0.026
1.264MetArg: 1.264 ± 0.045
1.41MetSer: 1.41 ± 0.042
1.401MetThr: 1.401 ± 0.042
1.57MetVal: 1.57 ± 0.044
0.163MetTrp: 0.163 ± 0.012
0.326MetTyr: 0.326 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.17AsnAla: 3.17 ± 0.058
0.267AsnCys: 0.267 ± 0.015
1.39AsnAsp: 1.39 ± 0.04
1.164AsnGlu: 1.164 ± 0.033
0.955AsnPhe: 0.955 ± 0.028
2.358AsnGly: 2.358 ± 0.059
0.508AsnHis: 0.508 ± 0.024
1.173AsnIle: 1.173 ± 0.037
0.841AsnLys: 0.841 ± 0.031
3.008AsnLeu: 3.008 ± 0.058
0.487AsnMet: 0.487 ± 0.02
0.761AsnAsn: 0.761 ± 0.036
1.982AsnPro: 1.982 ± 0.045
0.958AsnGln: 0.958 ± 0.033
1.913AsnArg: 1.913 ± 0.044
1.177AsnSer: 1.177 ± 0.042
1.368AsnThr: 1.368 ± 0.039
1.772AsnVal: 1.772 ± 0.045
0.36AsnTrp: 0.36 ± 0.02
0.643AsnTyr: 0.643 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
7.939ProAla: 7.939 ± 0.127
0.409ProCys: 0.409 ± 0.021
3.257ProAsp: 3.257 ± 0.062
3.658ProGlu: 3.658 ± 0.068
1.856ProPhe: 1.856 ± 0.042
4.949ProGly: 4.949 ± 0.08
1.094ProHis: 1.094 ± 0.037
1.949ProIle: 1.949 ± 0.047
1.49ProLys: 1.49 ± 0.045
6.002ProLeu: 6.002 ± 0.087
1.023ProMet: 1.023 ± 0.031
1.344ProAsn: 1.344 ± 0.036
3.2ProPro: 3.2 ± 0.085
1.829ProGln: 1.829 ± 0.043
3.318ProArg: 3.318 ± 0.074
2.69ProSer: 2.69 ± 0.054
2.558ProThr: 2.558 ± 0.055
4.437ProVal: 4.437 ± 0.068
0.74ProTrp: 0.74 ± 0.028
1.209ProTyr: 1.209 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.204GlnAla: 5.204 ± 0.09
0.234GlnCys: 0.234 ± 0.015
1.574GlnAsp: 1.574 ± 0.039
1.905GlnGlu: 1.905 ± 0.044
1.097GlnPhe: 1.097 ± 0.033
3.058GlnGly: 3.058 ± 0.05
0.768GlnHis: 0.768 ± 0.028
1.584GlnIle: 1.584 ± 0.043
1.15GlnLys: 1.15 ± 0.036
3.842GlnLeu: 3.842 ± 0.068
0.805GlnMet: 0.805 ± 0.03
0.829GlnAsn: 0.829 ± 0.029
1.976GlnPro: 1.976 ± 0.038
1.568GlnGln: 1.568 ± 0.049
2.767GlnArg: 2.767 ± 0.056
1.767GlnSer: 1.767 ± 0.05
1.74GlnThr: 1.74 ± 0.042
3.132GlnVal: 3.132 ± 0.062
0.483GlnTrp: 0.483 ± 0.024
0.719GlnTyr: 0.719 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
7.299ArgAla: 7.299 ± 0.107
0.543ArgCys: 0.543 ± 0.023
3.764ArgAsp: 3.764 ± 0.063
4.165ArgGlu: 4.165 ± 0.07
2.908ArgPhe: 2.908 ± 0.061
4.688ArgGly: 4.688 ± 0.082
2.039ArgHis: 2.039 ± 0.049
3.558ArgIle: 3.558 ± 0.06
1.769ArgLys: 1.769 ± 0.042
8.943ArgLeu: 8.943 ± 0.135
1.481ArgMet: 1.481 ± 0.034
1.854ArgAsn: 1.854 ± 0.04
3.498ArgPro: 3.498 ± 0.077
3.26ArgGln: 3.26 ± 0.07
5.419ArgArg: 5.419 ± 0.094
3.034ArgSer: 3.034 ± 0.051
2.898ArgThr: 2.898 ± 0.058
4.637ArgVal: 4.637 ± 0.067
1.036ArgTrp: 1.036 ± 0.028
2.007ArgTyr: 2.007 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
6.296SerAla: 6.296 ± 0.09
0.441SerCys: 0.441 ± 0.021
2.57SerAsp: 2.57 ± 0.053
2.439SerGlu: 2.439 ± 0.053
1.754SerPhe: 1.754 ± 0.038
5.108SerGly: 5.108 ± 0.081
1.126SerHis: 1.126 ± 0.032
2.209SerIle: 2.209 ± 0.048
1.392SerLys: 1.392 ± 0.037
5.92SerLeu: 5.92 ± 0.086
1.001SerMet: 1.001 ± 0.034
1.338SerAsn: 1.338 ± 0.048
3.108SerPro: 3.108 ± 0.062
1.688SerGln: 1.688 ± 0.04
3.545SerArg: 3.545 ± 0.057
2.824SerSer: 2.824 ± 0.076
2.581SerThr: 2.581 ± 0.059
3.44SerVal: 3.44 ± 0.062
0.592SerTrp: 0.592 ± 0.023
1.124SerTyr: 1.124 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
6.177ThrAla: 6.177 ± 0.081
0.401ThrCys: 0.401 ± 0.02
2.333ThrAsp: 2.333 ± 0.051
2.124ThrGlu: 2.124 ± 0.045
1.787ThrPhe: 1.787 ± 0.046
4.654ThrGly: 4.654 ± 0.076
0.989ThrHis: 0.989 ± 0.029
2.087ThrIle: 2.087 ± 0.052
1.298ThrLys: 1.298 ± 0.038
6.892ThrLeu: 6.892 ± 0.093
0.879ThrMet: 0.879 ± 0.032
1.202ThrAsn: 1.202 ± 0.039
3.739ThrPro: 3.739 ± 0.055
1.687ThrGln: 1.687 ± 0.044
3.311ThrArg: 3.311 ± 0.059
2.488ThrSer: 2.488 ± 0.062
2.778ThrThr: 2.778 ± 0.075
4.461ThrVal: 4.461 ± 0.07
0.665ThrTrp: 0.665 ± 0.026
1.151ThrTyr: 1.151 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
10.516ValAla: 10.516 ± 0.11
0.711ValCys: 0.711 ± 0.024
3.912ValAsp: 3.912 ± 0.063
4.454ValGlu: 4.454 ± 0.07
2.769ValPhe: 2.769 ± 0.059
5.846ValGly: 5.846 ± 0.079
1.323ValHis: 1.323 ± 0.037
3.467ValIle: 3.467 ± 0.071
2.532ValLys: 2.532 ± 0.063
8.112ValLeu: 8.112 ± 0.101
1.713ValMet: 1.713 ± 0.047
1.923ValAsn: 1.923 ± 0.044
3.908ValPro: 3.908 ± 0.073
2.244ValGln: 2.244 ± 0.047
4.794ValArg: 4.794 ± 0.068
4.164ValSer: 4.164 ± 0.076
4.14ValThr: 4.14 ± 0.076
6.732ValVal: 6.732 ± 0.096
0.931ValTrp: 0.931 ± 0.03
1.394ValTyr: 1.394 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.166TrpAla: 1.166 ± 0.036
0.131TrpCys: 0.131 ± 0.011
0.555TrpAsp: 0.555 ± 0.022
0.617TrpGlu: 0.617 ± 0.026
0.513TrpPhe: 0.513 ± 0.023
0.942TrpGly: 0.942 ± 0.034
0.337TrpHis: 0.337 ± 0.019
0.57TrpIle: 0.57 ± 0.023
0.473TrpLys: 0.473 ± 0.021
2.102TrpLeu: 2.102 ± 0.055
0.311TrpMet: 0.311 ± 0.016
0.395TrpAsn: 0.395 ± 0.023
0.626TrpPro: 0.626 ± 0.025
0.763TrpGln: 0.763 ± 0.033
1.123TrpArg: 1.123 ± 0.036
0.657TrpSer: 0.657 ± 0.028
0.557TrpThr: 0.557 ± 0.026
0.963TrpVal: 0.963 ± 0.035
0.267TrpTrp: 0.267 ± 0.016
0.272TrpTyr: 0.272 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.68TyrAla: 2.68 ± 0.055
0.274TyrCys: 0.274 ± 0.017
1.221TyrAsp: 1.221 ± 0.034
1.008TyrGlu: 1.008 ± 0.035
0.955TyrPhe: 0.955 ± 0.029
2.089TyrGly: 2.089 ± 0.052
0.495TyrHis: 0.495 ± 0.023
0.746TyrIle: 0.746 ± 0.028
0.675TyrLys: 0.675 ± 0.027
2.575TyrLeu: 2.575 ± 0.057
0.407TyrMet: 0.407 ± 0.019
0.595TyrAsn: 0.595 ± 0.026
1.258TyrPro: 1.258 ± 0.034
0.927TyrGln: 0.927 ± 0.035
1.895TyrArg: 1.895 ± 0.038
1.182TyrSer: 1.182 ± 0.038
1.159TyrThr: 1.159 ± 0.034
1.658TyrVal: 1.658 ± 0.04
0.357TyrTrp: 0.357 ± 0.02
0.652TyrTyr: 0.652 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3119 proteins (1042354 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski