Amino acid dipepetide frequency for Microvirga ossetica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.321AlaAla: 15.321 ± 0.118
1.073AlaCys: 1.073 ± 0.021
6.119AlaAsp: 6.119 ± 0.056
7.252AlaGlu: 7.252 ± 0.064
4.459AlaPhe: 4.459 ± 0.048
9.683AlaGly: 9.683 ± 0.071
2.291AlaHis: 2.291 ± 0.031
6.431AlaIle: 6.431 ± 0.054
4.064AlaLys: 4.064 ± 0.053
12.895AlaLeu: 12.895 ± 0.093
3.297AlaMet: 3.297 ± 0.035
2.762AlaAsn: 2.762 ± 0.034
5.09AlaPro: 5.09 ± 0.054
4.07AlaGln: 4.07 ± 0.045
8.203AlaArg: 8.203 ± 0.07
6.513AlaSer: 6.513 ± 0.053
5.77AlaThr: 5.77 ± 0.05
8.689AlaVal: 8.689 ± 0.068
1.514AlaTrp: 1.514 ± 0.026
2.648AlaTyr: 2.648 ± 0.031
0.0AlaXaa: 0.0 ± 0.0
Cys
0.876CysAla: 0.876 ± 0.018
0.136CysCys: 0.136 ± 0.009
0.488CysAsp: 0.488 ± 0.014
0.445CysGlu: 0.445 ± 0.014
0.3CysPhe: 0.3 ± 0.011
0.822CysGly: 0.822 ± 0.02
0.235CysHis: 0.235 ± 0.01
0.398CysIle: 0.398 ± 0.015
0.194CysLys: 0.194 ± 0.009
0.853CysLeu: 0.853 ± 0.022
0.161CysMet: 0.161 ± 0.009
0.198CysAsn: 0.198 ± 0.008
0.459CysPro: 0.459 ± 0.015
0.24CysGln: 0.24 ± 0.01
0.71CysArg: 0.71 ± 0.017
0.469CysSer: 0.469 ± 0.014
0.434CysThr: 0.434 ± 0.014
0.562CysVal: 0.562 ± 0.016
0.115CysTrp: 0.115 ± 0.007
0.181CysTyr: 0.181 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
6.265AspAla: 6.265 ± 0.052
0.414AspCys: 0.414 ± 0.013
2.98AspAsp: 2.98 ± 0.047
3.696AspGlu: 3.696 ± 0.044
2.011AspPhe: 2.011 ± 0.03
4.969AspGly: 4.969 ± 0.066
1.221AspHis: 1.221 ± 0.023
2.812AspIle: 2.812 ± 0.043
1.719AspLys: 1.719 ± 0.032
6.143AspLeu: 6.143 ± 0.056
1.163AspMet: 1.163 ± 0.021
1.252AspAsn: 1.252 ± 0.025
3.64AspPro: 3.64 ± 0.044
1.826AspGln: 1.826 ± 0.03
4.438AspArg: 4.438 ± 0.045
2.087AspSer: 2.087 ± 0.032
2.595AspThr: 2.595 ± 0.043
4.169AspVal: 4.169 ± 0.042
0.882AspTrp: 0.882 ± 0.019
1.343AspTyr: 1.343 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
7.953GluAla: 7.953 ± 0.073
0.373GluCys: 0.373 ± 0.013
2.801GluAsp: 2.801 ± 0.034
3.31GluGlu: 3.31 ± 0.049
1.784GluPhe: 1.784 ± 0.028
4.325GluGly: 4.325 ± 0.046
1.3GluHis: 1.3 ± 0.021
3.512GluIle: 3.512 ± 0.041
2.073GluLys: 2.073 ± 0.031
5.272GluLeu: 5.272 ± 0.054
1.477GluMet: 1.477 ± 0.027
1.48GluAsn: 1.48 ± 0.023
2.932GluPro: 2.932 ± 0.035
2.23GluGln: 2.23 ± 0.033
5.598GluArg: 5.598 ± 0.058
2.466GluSer: 2.466 ± 0.032
3.499GluThr: 3.499 ± 0.041
4.021GluVal: 4.021 ± 0.046
0.727GluTrp: 0.727 ± 0.018
0.973GluTyr: 0.973 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
4.241PheAla: 4.241 ± 0.043
0.358PheCys: 0.358 ± 0.011
2.466PheAsp: 2.466 ± 0.029
2.134PheGlu: 2.134 ± 0.031
1.361PhePhe: 1.361 ± 0.029
3.544PheGly: 3.544 ± 0.044
0.718PheHis: 0.718 ± 0.019
1.738PheIle: 1.738 ± 0.027
1.097PheLys: 1.097 ± 0.023
3.421PheLeu: 3.421 ± 0.039
0.806PheMet: 0.806 ± 0.019
0.995PheAsn: 0.995 ± 0.022
1.601PhePro: 1.601 ± 0.025
1.122PheGln: 1.122 ± 0.025
2.396PheArg: 2.396 ± 0.033
2.166PheSer: 2.166 ± 0.03
2.016PheThr: 2.016 ± 0.03
2.881PheVal: 2.881 ± 0.032
0.561PheTrp: 0.561 ± 0.017
0.913PheTyr: 0.913 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
8.452GlyAla: 8.452 ± 0.072
0.781GlyCys: 0.781 ± 0.019
4.274GlyAsp: 4.274 ± 0.058
4.595GlyGlu: 4.595 ± 0.052
3.529GlyPhe: 3.529 ± 0.037
6.999GlyGly: 6.999 ± 0.084
1.915GlyHis: 1.915 ± 0.031
4.69GlyIle: 4.69 ± 0.043
3.023GlyLys: 3.023 ± 0.039
8.86GlyLeu: 8.86 ± 0.075
2.089GlyMet: 2.089 ± 0.031
2.311GlyAsn: 2.311 ± 0.055
3.61GlyPro: 3.61 ± 0.042
3.067GlyGln: 3.067 ± 0.035
6.248GlyArg: 6.248 ± 0.056
5.088GlySer: 5.088 ± 0.043
4.729GlyThr: 4.729 ± 0.064
5.637GlyVal: 5.637 ± 0.052
1.4GlyTrp: 1.4 ± 0.025
2.275GlyTyr: 2.275 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
2.305HisAla: 2.305 ± 0.033
0.23HisCys: 0.23 ± 0.01
1.26HisAsp: 1.26 ± 0.022
1.193HisGlu: 1.193 ± 0.022
0.829HisPhe: 0.829 ± 0.018
1.923HisGly: 1.923 ± 0.032
0.6HisHis: 0.6 ± 0.018
0.929HisIle: 0.929 ± 0.021
0.531HisLys: 0.531 ± 0.015
2.28HisLeu: 2.28 ± 0.034
0.481HisMet: 0.481 ± 0.014
0.472HisAsn: 0.472 ± 0.013
1.432HisPro: 1.432 ± 0.024
0.641HisGln: 0.641 ± 0.017
1.731HisArg: 1.731 ± 0.028
1.008HisSer: 1.008 ± 0.02
0.855HisThr: 0.855 ± 0.02
1.536HisVal: 1.536 ± 0.027
0.36HisTrp: 0.36 ± 0.013
0.531HisTyr: 0.531 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
6.984IleAla: 6.984 ± 0.055
0.507IleCys: 0.507 ± 0.016
3.44IleAsp: 3.44 ± 0.042
3.629IleGlu: 3.629 ± 0.047
1.638IlePhe: 1.638 ± 0.026
5.06IleGly: 5.06 ± 0.046
0.99IleHis: 0.99 ± 0.022
2.191IleIle: 2.191 ± 0.034
1.463IleLys: 1.463 ± 0.024
4.947IleLeu: 4.947 ± 0.055
1.026IleMet: 1.026 ± 0.019
1.276IleAsn: 1.276 ± 0.026
2.49IlePro: 2.49 ± 0.038
1.437IleGln: 1.437 ± 0.024
3.606IleArg: 3.606 ± 0.04
2.756IleSer: 2.756 ± 0.035
2.718IleThr: 2.718 ± 0.035
4.567IleVal: 4.567 ± 0.044
0.618IleTrp: 0.618 ± 0.013
1.079IleTyr: 1.079 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
4.478LysAla: 4.478 ± 0.053
0.173LysCys: 0.173 ± 0.009
1.899LysAsp: 1.899 ± 0.034
1.711LysGlu: 1.711 ± 0.033
0.825LysPhe: 0.825 ± 0.019
2.781LysGly: 2.781 ± 0.038
0.679LysHis: 0.679 ± 0.015
1.637LysIle: 1.637 ± 0.03
1.195LysLys: 1.195 ± 0.03
3.227LysLeu: 3.227 ± 0.04
0.668LysMet: 0.668 ± 0.019
0.857LysAsn: 0.857 ± 0.022
2.169LysPro: 2.169 ± 0.031
1.046LysGln: 1.046 ± 0.019
2.501LysArg: 2.501 ± 0.039
1.833LysSer: 1.833 ± 0.027
1.913LysThr: 1.913 ± 0.031
2.532LysVal: 2.532 ± 0.042
0.367LysTrp: 0.367 ± 0.013
0.569LysTyr: 0.569 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
12.973LeuAla: 12.973 ± 0.083
0.878LeuCys: 0.878 ± 0.02
5.973LeuAsp: 5.973 ± 0.047
5.424LeuGlu: 5.424 ± 0.056
3.566LeuPhe: 3.566 ± 0.046
8.293LeuGly: 8.293 ± 0.066
1.976LeuHis: 1.976 ± 0.026
5.112LeuIle: 5.112 ± 0.053
3.85LeuLys: 3.85 ± 0.046
9.662LeuLeu: 9.662 ± 0.103
2.305LeuMet: 2.305 ± 0.035
2.685LeuAsn: 2.685 ± 0.035
5.545LeuPro: 5.545 ± 0.05
3.052LeuGln: 3.052 ± 0.037
7.232LeuArg: 7.232 ± 0.062
6.744LeuSer: 6.744 ± 0.051
5.661LeuThr: 5.661 ± 0.054
7.929LeuVal: 7.929 ± 0.071
1.253LeuTrp: 1.253 ± 0.024
2.094LeuTyr: 2.094 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
2.957MetAla: 2.957 ± 0.036
0.131MetCys: 0.131 ± 0.007
1.072MetAsp: 1.072 ± 0.022
1.067MetGlu: 1.067 ± 0.02
0.652MetPhe: 0.652 ± 0.017
1.66MetGly: 1.66 ± 0.029
0.439MetHis: 0.439 ± 0.013
1.318MetIle: 1.318 ± 0.025
0.991MetLys: 0.991 ± 0.021
2.357MetLeu: 2.357 ± 0.034
0.625MetMet: 0.625 ± 0.017
0.78MetAsn: 0.78 ± 0.018
1.475MetPro: 1.475 ± 0.023
0.776MetGln: 0.776 ± 0.02
1.868MetArg: 1.868 ± 0.026
1.587MetSer: 1.587 ± 0.026
1.819MetThr: 1.819 ± 0.026
1.561MetVal: 1.561 ± 0.025
0.199MetTrp: 0.199 ± 0.009
0.291MetTyr: 0.291 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.018AsnAla: 3.018 ± 0.032
0.196AsnCys: 0.196 ± 0.009
1.559AsnAsp: 1.559 ± 0.038
1.31AsnGlu: 1.31 ± 0.025
0.862AsnPhe: 0.862 ± 0.02
2.412AsnGly: 2.412 ± 0.043
0.487AsnHis: 0.487 ± 0.015
1.245AsnIle: 1.245 ± 0.028
0.723AsnLys: 0.723 ± 0.019
2.742AsnLeu: 2.742 ± 0.032
0.548AsnMet: 0.548 ± 0.017
0.701AsnAsn: 0.701 ± 0.019
1.859AsnPro: 1.859 ± 0.031
0.807AsnGln: 0.807 ± 0.019
1.852AsnArg: 1.852 ± 0.027
1.279AsnSer: 1.279 ± 0.022
1.277AsnThr: 1.277 ± 0.025
1.954AsnVal: 1.954 ± 0.033
0.395AsnTrp: 0.395 ± 0.013
0.627AsnTyr: 0.627 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
5.857ProAla: 5.857 ± 0.058
0.33ProCys: 0.33 ± 0.012
3.635ProAsp: 3.635 ± 0.044
3.728ProGlu: 3.728 ± 0.047
2.001ProPhe: 2.001 ± 0.028
4.458ProGly: 4.458 ± 0.042
1.15ProHis: 1.15 ± 0.021
2.453ProIle: 2.453 ± 0.033
1.86ProLys: 1.86 ± 0.034
4.76ProLeu: 4.76 ± 0.043
1.201ProMet: 1.201 ± 0.022
1.499ProAsn: 1.499 ± 0.026
2.728ProPro: 2.728 ± 0.04
1.813ProGln: 1.813 ± 0.03
2.995ProArg: 2.995 ± 0.041
3.294ProSer: 3.294 ± 0.035
2.687ProThr: 2.687 ± 0.036
4.254ProVal: 4.254 ± 0.045
0.757ProTrp: 0.757 ± 0.019
1.213ProTyr: 1.213 ± 0.021
0.0ProXaa: 0.0 ± 0.0
Gln
4.494GlnAla: 4.494 ± 0.048
0.222GlnCys: 0.222 ± 0.01
1.741GlnAsp: 1.741 ± 0.03
1.83GlnGlu: 1.83 ± 0.025
1.044GlnPhe: 1.044 ± 0.02
2.638GlnGly: 2.638 ± 0.031
0.739GlnHis: 0.739 ± 0.021
1.856GlnIle: 1.856 ± 0.027
1.092GlnLys: 1.092 ± 0.024
2.813GlnLeu: 2.813 ± 0.039
0.824GlnMet: 0.824 ± 0.019
0.879GlnAsn: 0.879 ± 0.02
1.86GlnPro: 1.86 ± 0.026
1.309GlnGln: 1.309 ± 0.025
2.688GlnArg: 2.688 ± 0.036
1.862GlnSer: 1.862 ± 0.027
1.826GlnThr: 1.826 ± 0.029
2.506GlnVal: 2.506 ± 0.031
0.418GlnTrp: 0.418 ± 0.012
0.622GlnTyr: 0.622 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
7.567ArgAla: 7.567 ± 0.055
0.594ArgCys: 0.594 ± 0.018
3.954ArgAsp: 3.954 ± 0.043
4.599ArgGlu: 4.599 ± 0.055
3.034ArgPhe: 3.034 ± 0.038
4.858ArgGly: 4.858 ± 0.047
1.855ArgHis: 1.855 ± 0.03
4.342ArgIle: 4.342 ± 0.037
2.412ArgLys: 2.412 ± 0.034
8.341ArgLeu: 8.341 ± 0.081
1.909ArgMet: 1.909 ± 0.025
1.954ArgAsn: 1.954 ± 0.029
3.705ArgPro: 3.705 ± 0.041
2.939ArgGln: 2.939 ± 0.042
6.353ArgArg: 6.353 ± 0.074
4.316ArgSer: 4.316 ± 0.039
3.694ArgThr: 3.694 ± 0.04
5.015ArgVal: 5.015 ± 0.048
1.178ArgTrp: 1.178 ± 0.026
1.81ArgTyr: 1.81 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
5.957SerAla: 5.957 ± 0.054
0.444SerCys: 0.444 ± 0.016
3.069SerAsp: 3.069 ± 0.036
2.958SerGlu: 2.958 ± 0.038
2.359SerPhe: 2.359 ± 0.034
5.543SerGly: 5.543 ± 0.056
1.214SerHis: 1.214 ± 0.023
2.956SerIle: 2.956 ± 0.037
1.684SerLys: 1.684 ± 0.032
6.02SerLeu: 6.02 ± 0.056
1.318SerMet: 1.318 ± 0.026
1.415SerAsn: 1.415 ± 0.028
3.104SerPro: 3.104 ± 0.035
1.795SerGln: 1.795 ± 0.029
4.039SerArg: 4.039 ± 0.045
3.254SerSer: 3.254 ± 0.041
2.913SerThr: 2.913 ± 0.036
4.072SerVal: 4.072 ± 0.047
0.828SerTrp: 0.828 ± 0.018
1.395SerTyr: 1.395 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
5.93ThrAla: 5.93 ± 0.048
0.435ThrCys: 0.435 ± 0.016
2.751ThrAsp: 2.751 ± 0.042
2.712ThrGlu: 2.712 ± 0.03
2.125ThrPhe: 2.125 ± 0.028
5.007ThrGly: 5.007 ± 0.051
1.06ThrHis: 1.06 ± 0.024
3.152ThrIle: 3.152 ± 0.034
1.651ThrLys: 1.651 ± 0.029
5.941ThrLeu: 5.941 ± 0.058
1.206ThrMet: 1.206 ± 0.02
1.32ThrAsn: 1.32 ± 0.026
3.189ThrPro: 3.189 ± 0.04
1.503ThrGln: 1.503 ± 0.026
3.461ThrArg: 3.461 ± 0.04
2.989ThrSer: 2.989 ± 0.037
2.832ThrThr: 2.832 ± 0.043
4.356ThrVal: 4.356 ± 0.05
0.785ThrTrp: 0.785 ± 0.019
1.314ThrTyr: 1.314 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
8.738ValAla: 8.738 ± 0.063
0.628ValCys: 0.628 ± 0.015
4.043ValAsp: 4.043 ± 0.048
4.674ValGlu: 4.674 ± 0.049
2.712ValPhe: 2.712 ± 0.035
5.584ValGly: 5.584 ± 0.055
1.428ValHis: 1.428 ± 0.025
4.008ValIle: 4.008 ± 0.042
2.315ValLys: 2.315 ± 0.037
7.715ValLeu: 7.715 ± 0.069
1.799ValMet: 1.799 ± 0.03
1.971ValAsn: 1.971 ± 0.028
3.913ValPro: 3.913 ± 0.037
2.311ValGln: 2.311 ± 0.032
5.357ValArg: 5.357 ± 0.047
4.562ValSer: 4.562 ± 0.048
4.549ValThr: 4.549 ± 0.047
6.048ValVal: 6.048 ± 0.054
0.914ValTrp: 0.914 ± 0.019
1.538ValTyr: 1.538 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
1.287TrpAla: 1.287 ± 0.027
0.138TrpCys: 0.138 ± 0.008
0.677TrpAsp: 0.677 ± 0.017
0.571TrpGlu: 0.571 ± 0.016
0.56TrpPhe: 0.56 ± 0.016
0.894TrpGly: 0.894 ± 0.021
0.368TrpHis: 0.368 ± 0.011
0.76TrpIle: 0.76 ± 0.018
0.49TrpLys: 0.49 ± 0.015
1.718TrpLeu: 1.718 ± 0.025
0.343TrpMet: 0.343 ± 0.011
0.45TrpAsn: 0.45 ± 0.013
0.707TrpPro: 0.707 ± 0.017
0.592TrpGln: 0.592 ± 0.016
1.243TrpArg: 1.243 ± 0.023
0.899TrpSer: 0.899 ± 0.02
0.83TrpThr: 0.83 ± 0.019
0.831TrpVal: 0.831 ± 0.02
0.232TrpTrp: 0.232 ± 0.012
0.301TrpTyr: 0.301 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.491TyrAla: 2.491 ± 0.034
0.236TyrCys: 0.236 ± 0.01
1.426TyrAsp: 1.426 ± 0.028
1.269TyrGlu: 1.269 ± 0.023
0.886TyrPhe: 0.886 ± 0.018
2.119TyrGly: 2.119 ± 0.032
0.462TyrHis: 0.462 ± 0.012
0.883TyrIle: 0.883 ± 0.021
0.633TyrLys: 0.633 ± 0.017
2.238TyrLeu: 2.238 ± 0.03
0.377TyrMet: 0.377 ± 0.011
0.595TyrAsn: 0.595 ± 0.016
1.166TyrPro: 1.166 ± 0.024
0.687TyrGln: 0.687 ± 0.016
1.909TyrArg: 1.909 ± 0.03
1.163TyrSer: 1.163 ± 0.023
1.104TyrThr: 1.104 ± 0.023
1.687TyrVal: 1.687 ± 0.028
0.387TyrTrp: 0.387 ± 0.013
0.602TyrTyr: 0.602 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8355 proteins (2404899 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski