Amino acid dipepetide frequency for Leucobacter luti

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.31AlaAla: 21.31 ± 0.204
0.747AlaCys: 0.747 ± 0.028
7.32AlaAsp: 7.32 ± 0.091
9.154AlaGlu: 9.154 ± 0.112
3.771AlaPhe: 3.771 ± 0.059
12.317AlaGly: 12.317 ± 0.124
2.695AlaHis: 2.695 ± 0.068
6.106AlaIle: 6.106 ± 0.071
2.837AlaLys: 2.837 ± 0.078
14.324AlaLeu: 14.324 ± 0.162
2.603AlaMet: 2.603 ± 0.056
2.557AlaAsn: 2.557 ± 0.055
6.867AlaPro: 6.867 ± 0.114
4.69AlaGln: 4.69 ± 0.073
8.423AlaArg: 8.423 ± 0.108
7.322AlaSer: 7.322 ± 0.09
7.524AlaThr: 7.524 ± 0.09
10.61AlaVal: 10.61 ± 0.103
1.703AlaTrp: 1.703 ± 0.039
2.279AlaTyr: 2.279 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.829CysAla: 0.829 ± 0.031
0.06CysCys: 0.06 ± 0.008
0.357CysAsp: 0.357 ± 0.019
0.327CysGlu: 0.327 ± 0.017
0.172CysPhe: 0.172 ± 0.012
0.645CysGly: 0.645 ± 0.026
0.114CysHis: 0.114 ± 0.01
0.19CysIle: 0.19 ± 0.013
0.069CysLys: 0.069 ± 0.008
0.432CysLeu: 0.432 ± 0.022
0.097CysMet: 0.097 ± 0.008
0.112CysAsn: 0.112 ± 0.012
0.272CysPro: 0.272 ± 0.019
0.13CysGln: 0.13 ± 0.01
0.303CysArg: 0.303 ± 0.019
0.367CysSer: 0.367 ± 0.017
0.414CysThr: 0.414 ± 0.019
0.48CysVal: 0.48 ± 0.022
0.069CysTrp: 0.069 ± 0.009
0.116CysTyr: 0.116 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.595AspAla: 7.595 ± 0.095
0.262AspCys: 0.262 ± 0.015
2.951AspAsp: 2.951 ± 0.072
3.755AspGlu: 3.755 ± 0.073
1.727AspPhe: 1.727 ± 0.04
5.084AspGly: 5.084 ± 0.09
1.115AspHis: 1.115 ± 0.029
2.126AspIle: 2.126 ± 0.041
0.815AspLys: 0.815 ± 0.034
5.402AspLeu: 5.402 ± 0.069
0.812AspMet: 0.812 ± 0.031
0.854AspAsn: 0.854 ± 0.031
4.508AspPro: 4.508 ± 0.113
1.621AspGln: 1.621 ± 0.043
4.211AspArg: 4.211 ± 0.08
2.817AspSer: 2.817 ± 0.058
3.235AspThr: 3.235 ± 0.068
4.315AspVal: 4.315 ± 0.062
0.815AspTrp: 0.815 ± 0.031
1.158AspTyr: 1.158 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
7.386GluAla: 7.386 ± 0.087
0.297GluCys: 0.297 ± 0.017
2.627GluAsp: 2.627 ± 0.059
3.075GluGlu: 3.075 ± 0.062
2.086GluPhe: 2.086 ± 0.043
3.736GluGly: 3.736 ± 0.068
1.689GluHis: 1.689 ± 0.042
3.055GluIle: 3.055 ± 0.058
1.284GluLys: 1.284 ± 0.04
7.795GluLeu: 7.795 ± 0.098
1.141GluMet: 1.141 ± 0.034
1.322GluAsn: 1.322 ± 0.038
2.998GluPro: 2.998 ± 0.058
2.72GluGln: 2.72 ± 0.049
5.521GluArg: 5.521 ± 0.102
3.314GluSer: 3.314 ± 0.055
3.39GluThr: 3.39 ± 0.06
4.491GluVal: 4.491 ± 0.074
0.909GluTrp: 0.909 ± 0.028
1.234GluTyr: 1.234 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.381PheAla: 4.381 ± 0.066
0.19PheCys: 0.19 ± 0.015
2.078PheAsp: 2.078 ± 0.044
1.88PheGlu: 1.88 ± 0.041
1.099PhePhe: 1.099 ± 0.038
3.458PheGly: 3.458 ± 0.057
0.499PheHis: 0.499 ± 0.024
1.353PheIle: 1.353 ± 0.038
0.464PheLys: 0.464 ± 0.019
2.746PheLeu: 2.746 ± 0.049
0.51PheMet: 0.51 ± 0.023
0.66PheAsn: 0.66 ± 0.027
1.609PhePro: 1.609 ± 0.038
0.833PheGln: 0.833 ± 0.032
1.718PheArg: 1.718 ± 0.036
1.978PheSer: 1.978 ± 0.041
2.414PheThr: 2.414 ± 0.049
2.678PheVal: 2.678 ± 0.054
0.49PheTrp: 0.49 ± 0.02
0.58PheTyr: 0.58 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
11.314GlyAla: 11.314 ± 0.123
0.603GlyCys: 0.603 ± 0.022
4.584GlyAsp: 4.584 ± 0.066
5.196GlyGlu: 5.196 ± 0.069
3.231GlyPhe: 3.231 ± 0.058
7.731GlyGly: 7.731 ± 0.142
1.665GlyHis: 1.665 ± 0.037
5.123GlyIle: 5.123 ± 0.075
2.301GlyLys: 2.301 ± 0.055
8.704GlyLeu: 8.704 ± 0.092
1.893GlyMet: 1.893 ± 0.047
1.913GlyAsn: 1.913 ± 0.051
3.709GlyPro: 3.709 ± 0.065
2.521GlyGln: 2.521 ± 0.051
5.527GlyArg: 5.527 ± 0.085
5.861GlySer: 5.861 ± 0.08
5.987GlyThr: 5.987 ± 0.126
7.795GlyVal: 7.795 ± 0.089
1.527GlyTrp: 1.527 ± 0.043
2.144GlyTyr: 2.144 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.592HisAla: 2.592 ± 0.052
0.121HisCys: 0.121 ± 0.009
1.19HisAsp: 1.19 ± 0.033
1.297HisGlu: 1.297 ± 0.033
0.582HisPhe: 0.582 ± 0.019
1.927HisGly: 1.927 ± 0.046
0.514HisHis: 0.514 ± 0.023
0.762HisIle: 0.762 ± 0.027
0.284HisLys: 0.284 ± 0.016
2.018HisLeu: 2.018 ± 0.048
0.352HisMet: 0.352 ± 0.018
0.376HisAsn: 0.376 ± 0.02
1.448HisPro: 1.448 ± 0.039
0.59HisGln: 0.59 ± 0.024
1.554HisArg: 1.554 ± 0.035
1.146HisSer: 1.146 ± 0.032
1.292HisThr: 1.292 ± 0.034
1.465HisVal: 1.465 ± 0.038
0.294HisTrp: 0.294 ± 0.015
0.409HisTyr: 0.409 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
7.541IleAla: 7.541 ± 0.085
0.3IleCys: 0.3 ± 0.017
3.233IleAsp: 3.233 ± 0.055
3.094IleGlu: 3.094 ± 0.058
1.209IlePhe: 1.209 ± 0.038
4.759IleGly: 4.759 ± 0.078
0.701IleHis: 0.701 ± 0.026
2.034IleIle: 2.034 ± 0.054
0.743IleLys: 0.743 ± 0.029
3.987IleLeu: 3.987 ± 0.073
0.702IleMet: 0.702 ± 0.028
0.984IleAsn: 0.984 ± 0.03
2.674IlePro: 2.674 ± 0.046
1.058IleGln: 1.058 ± 0.031
2.739IleArg: 2.739 ± 0.055
2.833IleSer: 2.833 ± 0.049
3.224IleThr: 3.224 ± 0.063
4.536IleVal: 4.536 ± 0.071
0.532IleTrp: 0.532 ± 0.023
0.706IleTyr: 0.706 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
2.082LysAla: 2.082 ± 0.057
0.089LysCys: 0.089 ± 0.009
1.063LysAsp: 1.063 ± 0.036
0.893LysGlu: 0.893 ± 0.033
0.596LysPhe: 0.596 ± 0.024
1.297LysGly: 1.297 ± 0.045
0.479LysHis: 0.479 ± 0.022
1.115LysIle: 1.115 ± 0.039
0.762LysLys: 0.762 ± 0.038
2.195LysLeu: 2.195 ± 0.053
0.407LysMet: 0.407 ± 0.021
0.589LysAsn: 0.589 ± 0.026
1.283LysPro: 1.283 ± 0.042
0.819LysGln: 0.819 ± 0.034
1.739LysArg: 1.739 ± 0.043
1.281LysSer: 1.281 ± 0.042
1.343LysThr: 1.343 ± 0.039
1.468LysVal: 1.468 ± 0.042
0.251LysTrp: 0.251 ± 0.016
0.473LysTyr: 0.473 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
14.632LeuAla: 14.632 ± 0.148
0.554LeuCys: 0.554 ± 0.021
6.021LeuAsp: 6.021 ± 0.077
5.107LeuGlu: 5.107 ± 0.073
2.92LeuPhe: 2.92 ± 0.065
9.697LeuGly: 9.697 ± 0.115
1.866LeuHis: 1.866 ± 0.042
4.998LeuIle: 4.998 ± 0.085
1.816LeuLys: 1.816 ± 0.049
10.455LeuLeu: 10.455 ± 0.158
1.667LeuMet: 1.667 ± 0.044
2.1LeuAsn: 2.1 ± 0.047
5.489LeuPro: 5.489 ± 0.075
2.7LeuGln: 2.7 ± 0.045
7.387LeuArg: 7.387 ± 0.108
6.516LeuSer: 6.516 ± 0.087
6.81LeuThr: 6.81 ± 0.075
8.436LeuVal: 8.436 ± 0.111
1.205LeuTrp: 1.205 ± 0.029
1.531LeuTyr: 1.531 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
1.872MetAla: 1.872 ± 0.043
0.107MetCys: 0.107 ± 0.01
0.744MetAsp: 0.744 ± 0.026
0.67MetGlu: 0.67 ± 0.025
0.551MetPhe: 0.551 ± 0.024
1.363MetGly: 1.363 ± 0.036
0.4MetHis: 0.4 ± 0.017
0.941MetIle: 0.941 ± 0.031
0.432MetLys: 0.432 ± 0.02
2.038MetLeu: 2.038 ± 0.049
0.318MetMet: 0.318 ± 0.018
0.584MetAsn: 0.584 ± 0.024
1.029MetPro: 1.029 ± 0.035
0.644MetGln: 0.644 ± 0.026
1.375MetArg: 1.375 ± 0.038
1.591MetSer: 1.591 ± 0.036
1.606MetThr: 1.606 ± 0.033
1.278MetVal: 1.278 ± 0.038
0.203MetTrp: 0.203 ± 0.013
0.326MetTyr: 0.326 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.714AsnAla: 2.714 ± 0.049
0.135AsnCys: 0.135 ± 0.012
1.1AsnAsp: 1.1 ± 0.033
1.138AsnGlu: 1.138 ± 0.033
0.688AsnPhe: 0.688 ± 0.022
1.992AsnGly: 1.992 ± 0.043
0.392AsnHis: 0.392 ± 0.019
0.917AsnIle: 0.917 ± 0.029
0.389AsnLys: 0.389 ± 0.021
1.957AsnLeu: 1.957 ± 0.041
0.397AsnMet: 0.397 ± 0.019
0.506AsnAsn: 0.506 ± 0.022
1.647AsnPro: 1.647 ± 0.042
0.632AsnGln: 0.632 ± 0.022
1.438AsnArg: 1.438 ± 0.039
1.227AsnSer: 1.227 ± 0.04
1.396AsnThr: 1.396 ± 0.048
1.737AsnVal: 1.737 ± 0.047
0.367AsnTrp: 0.367 ± 0.02
0.508AsnTyr: 0.508 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.353ProAla: 7.353 ± 0.105
0.192ProCys: 0.192 ± 0.012
3.399ProAsp: 3.399 ± 0.061
4.576ProGlu: 4.576 ± 0.073
1.645ProPhe: 1.645 ± 0.033
5.443ProGly: 5.443 ± 0.127
1.214ProHis: 1.214 ± 0.035
2.302ProIle: 2.302 ± 0.047
1.202ProLys: 1.202 ± 0.042
4.839ProLeu: 4.839 ± 0.081
0.877ProMet: 0.877 ± 0.028
1.245ProAsn: 1.245 ± 0.037
2.203ProPro: 2.203 ± 0.062
1.758ProGln: 1.758 ± 0.037
3.214ProArg: 3.214 ± 0.056
3.159ProSer: 3.159 ± 0.064
3.261ProThr: 3.261 ± 0.066
4.774ProVal: 4.774 ± 0.071
0.762ProTrp: 0.762 ± 0.025
0.977ProTyr: 0.977 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.77GlnAla: 3.77 ± 0.061
0.149GlnCys: 0.149 ± 0.013
1.381GlnAsp: 1.381 ± 0.037
1.633GlnGlu: 1.633 ± 0.046
0.951GlnPhe: 0.951 ± 0.028
2.237GlnGly: 2.237 ± 0.048
0.817GlnHis: 0.817 ± 0.025
1.666GlnIle: 1.666 ± 0.038
0.665GlnLys: 0.665 ± 0.029
3.879GlnLeu: 3.879 ± 0.059
0.617GlnMet: 0.617 ± 0.024
0.769GlnAsn: 0.769 ± 0.028
1.749GlnPro: 1.749 ± 0.047
1.478GlnGln: 1.478 ± 0.039
2.806GlnArg: 2.806 ± 0.057
1.754GlnSer: 1.754 ± 0.042
1.592GlnThr: 1.592 ± 0.037
2.282GlnVal: 2.282 ± 0.044
0.438GlnTrp: 0.438 ± 0.021
0.637GlnTyr: 0.637 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
8.94ArgAla: 8.94 ± 0.116
0.314ArgCys: 0.314 ± 0.018
3.921ArgAsp: 3.921 ± 0.071
4.762ArgGlu: 4.762 ± 0.081
2.398ArgPhe: 2.398 ± 0.048
5.492ArgGly: 5.492 ± 0.088
1.385ArgHis: 1.385 ± 0.041
3.722ArgIle: 3.722 ± 0.065
1.517ArgLys: 1.517 ± 0.037
6.526ArgLeu: 6.526 ± 0.096
1.493ArgMet: 1.493 ± 0.042
1.438ArgAsn: 1.438 ± 0.034
3.259ArgPro: 3.259 ± 0.063
1.953ArgGln: 1.953 ± 0.048
5.41ArgArg: 5.41 ± 0.09
4.169ArgSer: 4.169 ± 0.072
4.288ArgThr: 4.288 ± 0.066
5.955ArgVal: 5.955 ± 0.085
1.057ArgTrp: 1.057 ± 0.029
1.446ArgTyr: 1.446 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
7.772SerAla: 7.772 ± 0.111
0.378SerCys: 0.378 ± 0.019
3.369SerAsp: 3.369 ± 0.058
3.721SerGlu: 3.721 ± 0.067
2.111SerPhe: 2.111 ± 0.05
6.651SerGly: 6.651 ± 0.102
1.141SerHis: 1.141 ± 0.031
2.621SerIle: 2.621 ± 0.05
1.254SerLys: 1.254 ± 0.034
5.383SerLeu: 5.383 ± 0.08
1.181SerMet: 1.181 ± 0.033
1.293SerAsn: 1.293 ± 0.036
3.157SerPro: 3.157 ± 0.064
1.683SerGln: 1.683 ± 0.044
3.909SerArg: 3.909 ± 0.064
3.668SerSer: 3.668 ± 0.071
3.652SerThr: 3.652 ± 0.065
4.724SerVal: 4.724 ± 0.064
0.982SerTrp: 0.982 ± 0.029
1.293SerTyr: 1.293 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
8.334ThrAla: 8.334 ± 0.103
0.283ThrCys: 0.283 ± 0.015
3.424ThrAsp: 3.424 ± 0.092
3.639ThrGlu: 3.639 ± 0.065
1.778ThrPhe: 1.778 ± 0.04
6.217ThrGly: 6.217 ± 0.093
1.309ThrHis: 1.309 ± 0.035
3.014ThrIle: 3.014 ± 0.057
1.242ThrLys: 1.242 ± 0.039
6.396ThrLeu: 6.396 ± 0.077
1.006ThrMet: 1.006 ± 0.03
1.341ThrAsn: 1.341 ± 0.041
4.347ThrPro: 4.347 ± 0.064
1.872ThrGln: 1.872 ± 0.04
3.946ThrArg: 3.946 ± 0.064
3.539ThrSer: 3.539 ± 0.051
3.71ThrThr: 3.71 ± 0.081
5.755ThrVal: 5.755 ± 0.089
0.807ThrTrp: 0.807 ± 0.03
0.986ThrTyr: 0.986 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
10.863ValAla: 10.863 ± 0.122
0.541ValCys: 0.541 ± 0.026
4.48ValAsp: 4.48 ± 0.068
4.384ValGlu: 4.384 ± 0.077
2.876ValPhe: 2.876 ± 0.053
6.356ValGly: 6.356 ± 0.081
1.614ValHis: 1.614 ± 0.041
4.194ValIle: 4.194 ± 0.066
1.537ValLys: 1.537 ± 0.042
9.169ValLeu: 9.169 ± 0.125
1.418ValMet: 1.418 ± 0.04
1.824ValAsn: 1.824 ± 0.042
4.506ValPro: 4.506 ± 0.073
2.446ValGln: 2.446 ± 0.045
5.484ValArg: 5.484 ± 0.077
5.389ValSer: 5.389 ± 0.065
5.865ValThr: 5.865 ± 0.082
7.194ValVal: 7.194 ± 0.093
1.104ValTrp: 1.104 ± 0.032
1.48ValTyr: 1.48 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.561TrpAla: 1.561 ± 0.043
0.106TrpCys: 0.106 ± 0.009
0.738TrpAsp: 0.738 ± 0.024
0.694TrpGlu: 0.694 ± 0.027
0.557TrpPhe: 0.557 ± 0.022
1.114TrpGly: 1.114 ± 0.038
0.303TrpHis: 0.303 ± 0.018
0.7TrpIle: 0.7 ± 0.027
0.313TrpLys: 0.313 ± 0.016
1.627TrpLeu: 1.627 ± 0.043
0.294TrpMet: 0.294 ± 0.016
0.401TrpAsn: 0.401 ± 0.023
0.659TrpPro: 0.659 ± 0.024
0.529TrpGln: 0.529 ± 0.02
1.117TrpArg: 1.117 ± 0.036
0.869TrpSer: 0.869 ± 0.025
0.769TrpThr: 0.769 ± 0.032
1.208TrpVal: 1.208 ± 0.035
0.313TrpTrp: 0.313 ± 0.018
0.255TrpTyr: 0.255 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.317TyrAla: 2.317 ± 0.041
0.128TyrCys: 0.128 ± 0.01
1.2TyrAsp: 1.2 ± 0.036
1.075TyrGlu: 1.075 ± 0.034
0.668TyrPhe: 0.668 ± 0.024
1.815TyrGly: 1.815 ± 0.04
0.298TyrHis: 0.298 ± 0.019
0.651TyrIle: 0.651 ± 0.029
0.3TyrLys: 0.3 ± 0.018
2.035TyrLeu: 2.035 ± 0.045
0.286TyrMet: 0.286 ± 0.017
0.41TyrAsn: 0.41 ± 0.021
0.982TyrPro: 0.982 ± 0.027
0.594TyrGln: 0.594 ± 0.023
1.626TyrArg: 1.626 ± 0.042
1.14TyrSer: 1.14 ± 0.036
1.176TyrThr: 1.176 ± 0.036
1.544TyrVal: 1.544 ± 0.037
0.298TyrTrp: 0.298 ± 0.017
0.421TyrTyr: 0.421 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3417 proteins (1145008 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski