Amino acid dipepetide frequency for Arcticibacter tournemirensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.158AlaAla: 6.158 ± 0.083
0.647AlaCys: 0.647 ± 0.019
4.258AlaAsp: 4.258 ± 0.059
4.611AlaGlu: 4.611 ± 0.064
3.522AlaPhe: 3.522 ± 0.048
6.155AlaGly: 6.155 ± 0.068
1.029AlaHis: 1.029 ± 0.029
4.911AlaIle: 4.911 ± 0.066
4.03AlaLys: 4.03 ± 0.056
6.629AlaLeu: 6.629 ± 0.074
1.54AlaMet: 1.54 ± 0.033
3.329AlaAsn: 3.329 ± 0.056
2.251AlaPro: 2.251 ± 0.045
2.431AlaGln: 2.431 ± 0.044
3.027AlaArg: 3.027 ± 0.044
4.954AlaSer: 4.954 ± 0.063
3.716AlaThr: 3.716 ± 0.065
5.016AlaVal: 5.016 ± 0.061
0.829AlaTrp: 0.829 ± 0.026
2.806AlaTyr: 2.806 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.491CysAla: 0.491 ± 0.022
0.127CysCys: 0.127 ± 0.01
0.403CysAsp: 0.403 ± 0.016
0.361CysGlu: 0.361 ± 0.016
0.429CysPhe: 0.429 ± 0.018
0.628CysGly: 0.628 ± 0.022
0.175CysHis: 0.175 ± 0.012
0.628CysIle: 0.628 ± 0.018
0.468CysLys: 0.468 ± 0.021
0.806CysLeu: 0.806 ± 0.024
0.17CysMet: 0.17 ± 0.009
0.372CysAsn: 0.372 ± 0.015
0.303CysPro: 0.303 ± 0.016
0.224CysGln: 0.224 ± 0.014
0.402CysArg: 0.402 ± 0.017
0.636CysSer: 0.636 ± 0.02
0.41CysThr: 0.41 ± 0.02
0.447CysVal: 0.447 ± 0.018
0.104CysTrp: 0.104 ± 0.015
0.318CysTyr: 0.318 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.739AspAla: 3.739 ± 0.058
0.395AspCys: 0.395 ± 0.016
2.6AspAsp: 2.6 ± 0.05
3.311AspGlu: 3.311 ± 0.05
3.029AspPhe: 3.029 ± 0.045
3.908AspGly: 3.908 ± 0.069
0.906AspHis: 0.906 ± 0.023
4.11AspIle: 4.11 ± 0.053
3.594AspLys: 3.594 ± 0.047
5.043AspLeu: 5.043 ± 0.063
1.135AspMet: 1.135 ± 0.03
2.683AspAsn: 2.683 ± 0.046
2.419AspPro: 2.419 ± 0.039
1.709AspGln: 1.709 ± 0.036
2.367AspArg: 2.367 ± 0.044
3.088AspSer: 3.088 ± 0.05
2.447AspThr: 2.447 ± 0.044
3.402AspVal: 3.402 ± 0.052
0.826AspTrp: 0.826 ± 0.022
2.563AspTyr: 2.563 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
4.44GluAla: 4.44 ± 0.063
0.364GluCys: 0.364 ± 0.017
2.898GluAsp: 2.898 ± 0.05
4.143GluGlu: 4.143 ± 0.07
2.453GluPhe: 2.453 ± 0.041
3.913GluGly: 3.913 ± 0.053
1.029GluHis: 1.029 ± 0.028
4.351GluIle: 4.351 ± 0.066
4.811GluLys: 4.811 ± 0.072
5.576GluLeu: 5.576 ± 0.065
1.511GluMet: 1.511 ± 0.032
3.352GluAsn: 3.352 ± 0.052
1.768GluPro: 1.768 ± 0.037
2.303GluGln: 2.303 ± 0.043
3.028GluArg: 3.028 ± 0.045
3.361GluSer: 3.361 ± 0.049
3.015GluThr: 3.015 ± 0.045
3.934GluVal: 3.934 ± 0.05
0.76GluTrp: 0.76 ± 0.024
2.253GluTyr: 2.253 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.153PheAla: 3.153 ± 0.048
0.48PheCys: 0.48 ± 0.017
2.871PheAsp: 2.871 ± 0.04
2.652PheGlu: 2.652 ± 0.048
2.527PhePhe: 2.527 ± 0.05
3.363PheGly: 3.363 ± 0.053
0.808PheHis: 0.808 ± 0.026
3.513PheIle: 3.513 ± 0.051
3.036PheLys: 3.036 ± 0.049
4.406PheLeu: 4.406 ± 0.064
1.108PheMet: 1.108 ± 0.026
3.054PheAsn: 3.054 ± 0.047
1.766PhePro: 1.766 ± 0.033
1.388PheGln: 1.388 ± 0.03
2.289PheArg: 2.289 ± 0.038
4.124PheSer: 4.124 ± 0.061
2.983PheThr: 2.983 ± 0.043
2.789PheVal: 2.789 ± 0.05
0.628PheTrp: 0.628 ± 0.021
2.148PheTyr: 2.148 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
4.584GlyAla: 4.584 ± 0.058
0.665GlyCys: 0.665 ± 0.031
3.504GlyAsp: 3.504 ± 0.052
3.75GlyGlu: 3.75 ± 0.057
3.735GlyPhe: 3.735 ± 0.056
5.103GlyGly: 5.103 ± 0.084
1.163GlyHis: 1.163 ± 0.027
5.42GlyIle: 5.42 ± 0.062
5.357GlyLys: 5.357 ± 0.053
6.386GlyLeu: 6.386 ± 0.079
1.71GlyMet: 1.71 ± 0.031
3.785GlyAsn: 3.785 ± 0.065
1.69GlyPro: 1.69 ± 0.033
2.265GlyGln: 2.265 ± 0.042
3.158GlyArg: 3.158 ± 0.048
5.031GlySer: 5.031 ± 0.072
4.443GlyThr: 4.443 ± 0.074
4.51GlyVal: 4.51 ± 0.059
1.066GlyTrp: 1.066 ± 0.03
3.308GlyTyr: 3.308 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.073HisAla: 1.073 ± 0.027
0.175HisCys: 0.175 ± 0.011
0.837HisAsp: 0.837 ± 0.023
0.986HisGlu: 0.986 ± 0.026
1.032HisPhe: 1.032 ± 0.026
1.101HisGly: 1.101 ± 0.033
0.477HisHis: 0.477 ± 0.021
1.281HisIle: 1.281 ± 0.03
0.881HisLys: 0.881 ± 0.025
1.764HisLeu: 1.764 ± 0.036
0.306HisMet: 0.306 ± 0.013
0.886HisAsn: 0.886 ± 0.023
0.931HisPro: 0.931 ± 0.026
0.669HisGln: 0.669 ± 0.023
0.791HisArg: 0.791 ± 0.022
1.104HisSer: 1.104 ± 0.026
0.964HisThr: 0.964 ± 0.027
0.932HisVal: 0.932 ± 0.024
0.253HisTrp: 0.253 ± 0.014
0.836HisTyr: 0.836 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.456IleAla: 5.456 ± 0.073
0.7IleCys: 0.7 ± 0.025
3.931IleAsp: 3.931 ± 0.05
4.278IleGlu: 4.278 ± 0.06
2.994IlePhe: 2.994 ± 0.049
4.572IleGly: 4.572 ± 0.069
1.288IleHis: 1.288 ± 0.034
4.828IleIle: 4.828 ± 0.073
4.637IleLys: 4.637 ± 0.06
6.132IleLeu: 6.132 ± 0.08
1.313IleMet: 1.313 ± 0.03
3.917IleAsn: 3.917 ± 0.051
3.114IlePro: 3.114 ± 0.05
2.105IleGln: 2.105 ± 0.037
3.498IleArg: 3.498 ± 0.046
5.481IleSer: 5.481 ± 0.064
4.354IleThr: 4.354 ± 0.058
4.203IleVal: 4.203 ± 0.053
0.748IleTrp: 0.748 ± 0.024
2.601IleTyr: 2.601 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
5.026LysAla: 5.026 ± 0.067
0.326LysCys: 0.326 ± 0.014
4.065LysAsp: 4.065 ± 0.055
4.778LysGlu: 4.778 ± 0.06
2.412LysPhe: 2.412 ± 0.038
4.595LysGly: 4.595 ± 0.05
1.14LysHis: 1.14 ± 0.029
4.59LysIle: 4.59 ± 0.056
4.937LysLys: 4.937 ± 0.067
5.793LysLeu: 5.793 ± 0.063
1.666LysMet: 1.666 ± 0.035
3.774LysAsn: 3.774 ± 0.048
2.558LysPro: 2.558 ± 0.043
2.399LysGln: 2.399 ± 0.043
3.046LysArg: 3.046 ± 0.046
3.946LysSer: 3.946 ± 0.053
3.798LysThr: 3.798 ± 0.05
4.418LysVal: 4.418 ± 0.055
0.836LysTrp: 0.836 ± 0.023
2.835LysTyr: 2.835 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
6.416LeuAla: 6.416 ± 0.068
0.784LeuCys: 0.784 ± 0.025
4.52LeuAsp: 4.52 ± 0.056
4.996LeuGlu: 4.996 ± 0.07
4.731LeuPhe: 4.731 ± 0.065
5.685LeuGly: 5.685 ± 0.068
1.6LeuHis: 1.6 ± 0.033
6.451LeuIle: 6.451 ± 0.079
7.136LeuLys: 7.136 ± 0.077
9.236LeuLeu: 9.236 ± 0.107
2.067LeuMet: 2.067 ± 0.038
5.55LeuAsn: 5.55 ± 0.07
4.111LeuPro: 4.111 ± 0.052
3.348LeuGln: 3.348 ± 0.048
4.375LeuArg: 4.375 ± 0.057
7.559LeuSer: 7.559 ± 0.084
5.196LeuThr: 5.196 ± 0.06
5.335LeuVal: 5.335 ± 0.06
1.013LeuTrp: 1.013 ± 0.025
3.474LeuTyr: 3.474 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
1.609MetAla: 1.609 ± 0.037
0.127MetCys: 0.127 ± 0.009
1.125MetAsp: 1.125 ± 0.027
1.3MetGlu: 1.3 ± 0.031
0.856MetPhe: 0.856 ± 0.026
1.424MetGly: 1.424 ± 0.033
0.386MetHis: 0.386 ± 0.015
1.495MetIle: 1.495 ± 0.031
1.994MetLys: 1.994 ± 0.034
2.056MetLeu: 2.056 ± 0.041
0.561MetMet: 0.561 ± 0.019
1.323MetAsn: 1.323 ± 0.027
1.0MetPro: 1.0 ± 0.027
0.864MetGln: 0.864 ± 0.023
1.043MetArg: 1.043 ± 0.027
1.382MetSer: 1.382 ± 0.03
1.078MetThr: 1.078 ± 0.028
1.331MetVal: 1.331 ± 0.028
0.205MetTrp: 0.205 ± 0.011
0.723MetTyr: 0.723 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
3.821AsnAla: 3.821 ± 0.055
0.398AsnCys: 0.398 ± 0.019
2.692AsnAsp: 2.692 ± 0.044
3.116AsnGlu: 3.116 ± 0.046
2.502AsnPhe: 2.502 ± 0.046
4.081AsnGly: 4.081 ± 0.062
0.936AsnHis: 0.936 ± 0.028
4.065AsnIle: 4.065 ± 0.056
3.453AsnLys: 3.453 ± 0.044
4.783AsnLeu: 4.783 ± 0.061
1.094AsnMet: 1.094 ± 0.027
3.169AsnAsn: 3.169 ± 0.054
2.747AsnPro: 2.747 ± 0.046
1.804AsnGln: 1.804 ± 0.037
2.582AsnArg: 2.582 ± 0.043
3.542AsnSer: 3.542 ± 0.057
3.29AsnThr: 3.29 ± 0.059
3.257AsnVal: 3.257 ± 0.048
0.784AsnTrp: 0.784 ± 0.025
2.603AsnTyr: 2.603 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
3.307ProAla: 3.307 ± 0.05
0.238ProCys: 0.238 ± 0.012
2.696ProAsp: 2.696 ± 0.045
3.072ProGlu: 3.072 ± 0.047
1.965ProPhe: 1.965 ± 0.035
3.25ProGly: 3.25 ± 0.051
0.684ProHis: 0.684 ± 0.022
1.876ProIle: 1.876 ± 0.039
1.847ProLys: 1.847 ± 0.035
3.597ProLeu: 3.597 ± 0.05
0.711ProMet: 0.711 ± 0.02
1.693ProAsn: 1.693 ± 0.036
1.068ProPro: 1.068 ± 0.03
1.395ProGln: 1.395 ± 0.028
1.379ProArg: 1.379 ± 0.034
2.635ProSer: 2.635 ± 0.043
1.641ProThr: 1.641 ± 0.043
3.641ProVal: 3.641 ± 0.056
0.5ProTrp: 0.5 ± 0.017
1.576ProTyr: 1.576 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
2.329GlnAla: 2.329 ± 0.042
0.2GlnCys: 0.2 ± 0.012
1.558GlnAsp: 1.558 ± 0.027
1.962GlnGlu: 1.962 ± 0.038
1.556GlnPhe: 1.556 ± 0.031
2.108GlnGly: 2.108 ± 0.034
0.645GlnHis: 0.645 ± 0.022
2.281GlnIle: 2.281 ± 0.039
2.53GlnLys: 2.53 ± 0.042
3.47GlnLeu: 3.47 ± 0.055
0.808GlnMet: 0.808 ± 0.024
1.917GlnAsn: 1.917 ± 0.037
1.306GlnPro: 1.306 ± 0.031
1.721GlnGln: 1.721 ± 0.035
1.607GlnArg: 1.607 ± 0.029
2.195GlnSer: 2.195 ± 0.036
1.935GlnThr: 1.935 ± 0.037
2.127GlnVal: 2.127 ± 0.039
0.477GlnTrp: 0.477 ± 0.019
1.514GlnTyr: 1.514 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.839ArgAla: 2.839 ± 0.038
0.282ArgCys: 0.282 ± 0.014
2.358ArgAsp: 2.358 ± 0.04
2.793ArgGlu: 2.793 ± 0.042
2.486ArgPhe: 2.486 ± 0.04
2.621ArgGly: 2.621 ± 0.047
0.791ArgHis: 0.791 ± 0.023
3.545ArgIle: 3.545 ± 0.047
3.279ArgLys: 3.279 ± 0.047
4.527ArgLeu: 4.527 ± 0.057
1.184ArgMet: 1.184 ± 0.03
2.706ArgAsn: 2.706 ± 0.04
1.651ArgPro: 1.651 ± 0.036
1.735ArgGln: 1.735 ± 0.033
2.209ArgArg: 2.209 ± 0.037
2.811ArgSer: 2.811 ± 0.037
2.371ArgThr: 2.371 ± 0.043
2.755ArgVal: 2.755 ± 0.044
0.667ArgTrp: 0.667 ± 0.02
2.137ArgTyr: 2.137 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.165SerAla: 5.165 ± 0.06
0.603SerCys: 0.603 ± 0.02
3.63SerAsp: 3.63 ± 0.045
3.787SerGlu: 3.787 ± 0.05
3.993SerPhe: 3.993 ± 0.055
5.65SerGly: 5.65 ± 0.076
1.149SerHis: 1.149 ± 0.032
4.627SerIle: 4.627 ± 0.06
3.964SerLys: 3.964 ± 0.053
6.847SerLeu: 6.847 ± 0.072
1.391SerMet: 1.391 ± 0.033
3.376SerAsn: 3.376 ± 0.053
2.703SerPro: 2.703 ± 0.044
2.066SerGln: 2.066 ± 0.039
3.187SerArg: 3.187 ± 0.044
5.05SerSer: 5.05 ± 0.072
3.54SerThr: 3.54 ± 0.06
4.933SerVal: 4.933 ± 0.065
0.912SerTrp: 0.912 ± 0.027
3.122SerTyr: 3.122 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
4.356ThrAla: 4.356 ± 0.066
0.347ThrCys: 0.347 ± 0.017
3.204ThrAsp: 3.204 ± 0.047
3.153ThrGlu: 3.153 ± 0.044
2.682ThrPhe: 2.682 ± 0.041
5.082ThrGly: 5.082 ± 0.079
0.911ThrHis: 0.911 ± 0.026
3.912ThrIle: 3.912 ± 0.054
2.835ThrLys: 2.835 ± 0.04
5.056ThrLeu: 5.056 ± 0.055
0.979ThrMet: 0.979 ± 0.024
2.637ThrAsn: 2.637 ± 0.057
2.438ThrPro: 2.438 ± 0.046
1.612ThrGln: 1.612 ± 0.032
2.195ThrArg: 2.195 ± 0.034
3.817ThrSer: 3.817 ± 0.066
3.137ThrThr: 3.137 ± 0.06
3.983ThrVal: 3.983 ± 0.059
0.686ThrTrp: 0.686 ± 0.024
2.383ThrTyr: 2.383 ± 0.047
0.001ThrXaa: 0.001 ± 0.001
Val
4.381ValAla: 4.381 ± 0.06
0.575ValCys: 0.575 ± 0.021
3.169ValAsp: 3.169 ± 0.045
3.364ValGlu: 3.364 ± 0.05
3.261ValPhe: 3.261 ± 0.047
3.559ValGly: 3.559 ± 0.057
1.06ValHis: 1.06 ± 0.029
4.816ValIle: 4.816 ± 0.064
4.598ValLys: 4.598 ± 0.063
6.281ValLeu: 6.281 ± 0.068
1.429ValMet: 1.429 ± 0.033
3.717ValAsn: 3.717 ± 0.052
2.685ValPro: 2.685 ± 0.042
2.056ValGln: 2.056 ± 0.039
2.818ValArg: 2.818 ± 0.043
4.966ValSer: 4.966 ± 0.061
3.785ValThr: 3.785 ± 0.057
4.398ValVal: 4.398 ± 0.064
0.768ValTrp: 0.768 ± 0.023
2.661ValTyr: 2.661 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.794TrpAla: 0.794 ± 0.025
0.118TrpCys: 0.118 ± 0.009
0.703TrpAsp: 0.703 ± 0.023
0.733TrpGlu: 0.733 ± 0.02
0.598TrpPhe: 0.598 ± 0.02
0.882TrpGly: 0.882 ± 0.026
0.266TrpHis: 0.266 ± 0.011
0.825TrpIle: 0.825 ± 0.027
0.936TrpLys: 0.936 ± 0.025
1.213TrpLeu: 1.213 ± 0.031
0.385TrpMet: 0.385 ± 0.016
0.785TrpAsn: 0.785 ± 0.022
0.409TrpPro: 0.409 ± 0.016
0.544TrpGln: 0.544 ± 0.019
0.582TrpArg: 0.582 ± 0.02
0.824TrpSer: 0.824 ± 0.026
0.743TrpThr: 0.743 ± 0.021
0.726TrpVal: 0.726 ± 0.024
0.221TrpTrp: 0.221 ± 0.012
0.549TrpTyr: 0.549 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.712TyrAla: 2.712 ± 0.047
0.354TyrCys: 0.354 ± 0.017
2.254TyrAsp: 2.254 ± 0.041
2.117TyrGlu: 2.117 ± 0.043
2.322TyrPhe: 2.322 ± 0.04
2.952TyrGly: 2.952 ± 0.052
0.838TyrHis: 0.838 ± 0.026
2.624TyrIle: 2.624 ± 0.04
2.619TyrLys: 2.619 ± 0.046
3.983TyrLeu: 3.983 ± 0.06
0.813TyrMet: 0.813 ± 0.025
2.693TyrAsn: 2.693 ± 0.047
1.855TyrPro: 1.855 ± 0.034
1.607TyrGln: 1.607 ± 0.032
2.162TyrArg: 2.162 ± 0.036
3.187TyrSer: 3.187 ± 0.046
2.56TyrThr: 2.56 ± 0.043
2.207TyrVal: 2.207 ± 0.042
0.55TyrTrp: 0.55 ± 0.02
1.984TyrTyr: 1.984 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4170 proteins (1573326 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski