Amino acid dipepetide frequency for Klebsormidium nitens (Green alga) (Ulothrix nitens)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.546AlaAla: 14.546 ± 0.073
1.46AlaCys: 1.46 ± 0.016
5.157AlaAsp: 5.157 ± 0.03
7.773AlaGlu: 7.773 ± 0.059
3.335AlaPhe: 3.335 ± 0.022
8.595AlaGly: 8.595 ± 0.04
1.991AlaHis: 1.991 ± 0.016
3.332AlaIle: 3.332 ± 0.022
4.694AlaLys: 4.694 ± 0.036
9.936AlaLeu: 9.936 ± 0.052
1.845AlaMet: 1.845 ± 0.016
2.765AlaAsn: 2.765 ± 0.019
6.345AlaPro: 6.345 ± 0.04
4.091AlaGln: 4.091 ± 0.025
6.956AlaArg: 6.956 ± 0.042
7.923AlaSer: 7.923 ± 0.039
5.223AlaThr: 5.223 ± 0.028
7.17AlaVal: 7.17 ± 0.032
1.203AlaTrp: 1.203 ± 0.015
1.952AlaTyr: 1.952 ± 0.019
0.0AlaXaa: 0.0 ± 0.0
Cys
1.302CysAla: 1.302 ± 0.014
0.425CysCys: 0.425 ± 0.011
0.666CysAsp: 0.666 ± 0.012
0.76CysGlu: 0.76 ± 0.011
0.586CysPhe: 0.586 ± 0.007
1.223CysGly: 1.223 ± 0.016
0.351CysHis: 0.351 ± 0.007
0.556CysIle: 0.556 ± 0.009
0.688CysLys: 0.688 ± 0.01
1.354CysLeu: 1.354 ± 0.014
0.269CysMet: 0.269 ± 0.006
0.472CysAsn: 0.472 ± 0.009
0.942CysPro: 0.942 ± 0.017
0.59CysGln: 0.59 ± 0.009
0.957CysArg: 0.957 ± 0.012
1.129CysSer: 1.129 ± 0.014
0.816CysThr: 0.816 ± 0.011
0.863CysVal: 0.863 ± 0.012
0.203CysTrp: 0.203 ± 0.005
0.364CysTyr: 0.364 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
5.289AspAla: 5.289 ± 0.028
0.764AspCys: 0.764 ± 0.012
3.119AspAsp: 3.119 ± 0.028
4.085AspGlu: 4.085 ± 0.03
1.942AspPhe: 1.942 ± 0.017
4.565AspGly: 4.565 ± 0.027
0.925AspHis: 0.925 ± 0.011
1.856AspIle: 1.856 ± 0.015
2.044AspLys: 2.044 ± 0.017
4.778AspLeu: 4.778 ± 0.026
0.999AspMet: 0.999 ± 0.011
1.227AspAsn: 1.227 ± 0.013
3.079AspPro: 3.079 ± 0.02
1.61AspGln: 1.61 ± 0.013
3.049AspArg: 3.049 ± 0.023
3.472AspSer: 3.472 ± 0.02
2.287AspThr: 2.287 ± 0.018
4.142AspVal: 4.142 ± 0.026
0.825AspTrp: 0.825 ± 0.01
1.15AspTyr: 1.15 ± 0.014
0.0AspXaa: 0.0 ± 0.0
Glu
8.102GluAla: 8.102 ± 0.055
0.79GluCys: 0.79 ± 0.01
3.979GluAsp: 3.979 ± 0.028
7.264GluGlu: 7.264 ± 0.073
1.733GluPhe: 1.733 ± 0.015
6.148GluGly: 6.148 ± 0.042
1.356GluHis: 1.356 ± 0.015
2.304GluIle: 2.304 ± 0.017
4.095GluLys: 4.095 ± 0.039
6.257GluLeu: 6.257 ± 0.043
1.406GluMet: 1.406 ± 0.017
1.924GluAsn: 1.924 ± 0.018
3.189GluPro: 3.189 ± 0.026
2.87GluGln: 2.87 ± 0.023
5.806GluArg: 5.806 ± 0.047
4.624GluSer: 4.624 ± 0.037
3.499GluThr: 3.499 ± 0.022
4.508GluVal: 4.508 ± 0.028
0.876GluTrp: 0.876 ± 0.01
1.341GluTyr: 1.341 ± 0.013
0.0GluXaa: 0.0 ± 0.0
Phe
2.972PheAla: 2.972 ± 0.021
0.62PheCys: 0.62 ± 0.009
1.933PheAsp: 1.933 ± 0.015
2.163PheGlu: 2.163 ± 0.02
1.359PhePhe: 1.359 ± 0.015
2.774PheGly: 2.774 ± 0.022
0.733PheHis: 0.733 ± 0.01
1.044PheIle: 1.044 ± 0.014
1.39PheLys: 1.39 ± 0.013
3.3PheLeu: 3.3 ± 0.023
0.588PheMet: 0.588 ± 0.009
1.045PheAsn: 1.045 ± 0.013
1.703PhePro: 1.703 ± 0.016
1.366PheGln: 1.366 ± 0.011
2.021PheArg: 2.021 ± 0.017
2.52PheSer: 2.52 ± 0.019
1.61PheThr: 1.61 ± 0.017
2.366PheVal: 2.366 ± 0.019
0.507PheTrp: 0.507 ± 0.008
0.844PheTyr: 0.844 ± 0.012
0.0PheXaa: 0.0 ± 0.0
Gly
8.783GlyAla: 8.783 ± 0.044
1.143GlyCys: 1.143 ± 0.015
4.141GlyAsp: 4.141 ± 0.024
5.444GlyGlu: 5.444 ± 0.039
2.795GlyPhe: 2.795 ± 0.021
9.148GlyGly: 9.148 ± 0.068
1.696GlyHis: 1.696 ± 0.016
2.628GlyIle: 2.628 ± 0.018
4.029GlyLys: 4.029 ± 0.027
7.247GlyLeu: 7.247 ± 0.031
1.518GlyMet: 1.518 ± 0.013
2.383GlyAsn: 2.383 ± 0.019
4.709GlyPro: 4.709 ± 0.032
3.201GlyGln: 3.201 ± 0.024
5.883GlyArg: 5.883 ± 0.035
6.753GlySer: 6.753 ± 0.04
4.431GlyThr: 4.431 ± 0.031
5.745GlyVal: 5.745 ± 0.037
1.097GlyTrp: 1.097 ± 0.012
1.729GlyTyr: 1.729 ± 0.019
0.0GlyXaa: 0.0 ± 0.0
His
1.828HisAla: 1.828 ± 0.015
0.373HisCys: 0.373 ± 0.006
0.909HisAsp: 0.909 ± 0.01
1.177HisGlu: 1.177 ± 0.01
0.875HisPhe: 0.875 ± 0.01
1.528HisGly: 1.528 ± 0.015
0.558HisHis: 0.558 ± 0.009
0.755HisIle: 0.755 ± 0.009
0.874HisLys: 0.874 ± 0.011
1.997HisLeu: 1.997 ± 0.017
0.432HisMet: 0.432 ± 0.008
0.561HisAsn: 0.561 ± 0.009
1.326HisPro: 1.326 ± 0.013
0.774HisGln: 0.774 ± 0.011
1.424HisArg: 1.424 ± 0.014
1.496HisSer: 1.496 ± 0.014
1.0HisThr: 1.0 ± 0.012
1.585HisVal: 1.585 ± 0.014
0.333HisTrp: 0.333 ± 0.006
0.497HisTyr: 0.497 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.326IleAla: 3.326 ± 0.024
0.551IleCys: 0.551 ± 0.008
1.924IleAsp: 1.924 ± 0.017
2.193IleGlu: 2.193 ± 0.019
1.237IlePhe: 1.237 ± 0.015
2.409IleGly: 2.409 ± 0.02
0.711IleHis: 0.711 ± 0.01
1.24IleIle: 1.24 ± 0.014
1.498IleLys: 1.498 ± 0.015
3.217IleLeu: 3.217 ± 0.022
0.656IleMet: 0.656 ± 0.009
1.02IleAsn: 1.02 ± 0.011
2.047IlePro: 2.047 ± 0.021
1.321IleGln: 1.321 ± 0.013
2.143IleArg: 2.143 ± 0.018
2.277IleSer: 2.277 ± 0.021
1.71IleThr: 1.71 ± 0.019
2.42IleVal: 2.42 ± 0.019
0.424IleTrp: 0.424 ± 0.007
0.782IleTyr: 0.782 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
4.969LysAla: 4.969 ± 0.037
0.552LysCys: 0.552 ± 0.01
2.433LysAsp: 2.433 ± 0.019
3.995LysGlu: 3.995 ± 0.038
1.143LysPhe: 1.143 ± 0.012
3.805LysGly: 3.805 ± 0.026
0.974LysHis: 0.974 ± 0.011
1.502LysIle: 1.502 ± 0.015
3.397LysLys: 3.397 ± 0.037
4.326LysLeu: 4.326 ± 0.027
0.943LysMet: 0.943 ± 0.01
1.255LysAsn: 1.255 ± 0.014
2.5LysPro: 2.5 ± 0.022
2.019LysGln: 2.019 ± 0.017
3.911LysArg: 3.911 ± 0.028
2.921LysSer: 2.921 ± 0.021
2.309LysThr: 2.309 ± 0.018
3.194LysVal: 3.194 ± 0.022
0.588LysTrp: 0.588 ± 0.009
0.996LysTyr: 0.996 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
9.578LeuAla: 9.578 ± 0.048
1.383LeuCys: 1.383 ± 0.015
4.912LeuAsp: 4.912 ± 0.031
6.916LeuGlu: 6.916 ± 0.051
3.007LeuPhe: 3.007 ± 0.021
6.808LeuGly: 6.808 ± 0.032
2.089LeuHis: 2.089 ± 0.016
2.799LeuIle: 2.799 ± 0.022
4.344LeuLys: 4.344 ± 0.027
9.559LeuLeu: 9.559 ± 0.055
1.616LeuMet: 1.616 ± 0.015
2.513LeuAsn: 2.513 ± 0.019
5.673LeuPro: 5.673 ± 0.031
4.487LeuGln: 4.487 ± 0.027
6.803LeuArg: 6.803 ± 0.038
6.743LeuSer: 6.743 ± 0.035
4.805LeuThr: 4.805 ± 0.029
6.146LeuVal: 6.146 ± 0.03
1.128LeuTrp: 1.128 ± 0.012
1.964LeuTyr: 1.964 ± 0.018
0.0LeuXaa: 0.0 ± 0.0
Met
2.213MetAla: 2.213 ± 0.015
0.237MetCys: 0.237 ± 0.005
1.069MetAsp: 1.069 ± 0.013
1.388MetGlu: 1.388 ± 0.013
0.539MetPhe: 0.539 ± 0.008
1.42MetGly: 1.42 ± 0.014
0.419MetHis: 0.419 ± 0.007
0.581MetIle: 0.581 ± 0.007
0.817MetLys: 0.817 ± 0.011
1.746MetLeu: 1.746 ± 0.015
0.42MetMet: 0.42 ± 0.007
0.48MetAsn: 0.48 ± 0.008
0.928MetPro: 0.928 ± 0.01
0.875MetGln: 0.875 ± 0.01
1.263MetArg: 1.263 ± 0.012
1.212MetSer: 1.212 ± 0.012
0.957MetThr: 0.957 ± 0.011
1.21MetVal: 1.21 ± 0.012
0.207MetTrp: 0.207 ± 0.005
0.417MetTyr: 0.417 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.693AsnAla: 2.693 ± 0.02
0.425AsnCys: 0.425 ± 0.009
1.252AsnAsp: 1.252 ± 0.012
1.742AsnGlu: 1.742 ± 0.015
1.024AsnPhe: 1.024 ± 0.011
3.024AsnGly: 3.024 ± 0.027
0.534AsnHis: 0.534 ± 0.008
1.069AsnIle: 1.069 ± 0.012
1.219AsnLys: 1.219 ± 0.014
2.797AsnLeu: 2.797 ± 0.022
0.555AsnMet: 0.555 ± 0.008
0.912AsnAsn: 0.912 ± 0.014
2.006AsnPro: 2.006 ± 0.017
1.085AsnGln: 1.085 ± 0.013
1.717AsnArg: 1.717 ± 0.016
1.808AsnSer: 1.808 ± 0.018
1.322AsnThr: 1.322 ± 0.014
2.022AsnVal: 2.022 ± 0.017
0.388AsnTrp: 0.388 ± 0.007
0.661AsnTyr: 0.661 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
6.432ProAla: 6.432 ± 0.039
0.681ProCys: 0.681 ± 0.012
3.011ProAsp: 3.011 ± 0.022
4.267ProGlu: 4.267 ± 0.03
1.922ProPhe: 1.922 ± 0.016
5.274ProGly: 5.274 ± 0.041
1.152ProHis: 1.152 ± 0.012
1.818ProIle: 1.818 ± 0.018
2.485ProLys: 2.485 ± 0.023
5.213ProLeu: 5.213 ± 0.03
0.872ProMet: 0.872 ± 0.011
1.663ProAsn: 1.663 ± 0.016
5.856ProPro: 5.856 ± 0.068
2.387ProGln: 2.387 ± 0.022
3.703ProArg: 3.703 ± 0.027
5.427ProSer: 5.427 ± 0.04
3.158ProThr: 3.158 ± 0.024
4.069ProVal: 4.069 ± 0.028
0.689ProTrp: 0.689 ± 0.009
1.098ProTyr: 1.098 ± 0.014
0.0ProXaa: 0.0 ± 0.0
Gln
4.301GlnAla: 4.301 ± 0.027
0.517GlnCys: 0.517 ± 0.009
1.719GlnAsp: 1.719 ± 0.015
2.918GlnGlu: 2.918 ± 0.021
1.096GlnPhe: 1.096 ± 0.013
3.103GlnGly: 3.103 ± 0.022
0.905GlnHis: 0.905 ± 0.01
1.387GlnIle: 1.387 ± 0.013
2.091GlnLys: 2.091 ± 0.018
3.94GlnLeu: 3.94 ± 0.027
0.815GlnMet: 0.815 ± 0.011
1.211GlnAsn: 1.211 ± 0.011
2.383GlnPro: 2.383 ± 0.021
2.219GlnGln: 2.219 ± 0.028
3.075GlnArg: 3.075 ± 0.022
2.753GlnSer: 2.753 ± 0.02
2.044GlnThr: 2.044 ± 0.018
2.469GlnVal: 2.469 ± 0.016
0.504GlnTrp: 0.504 ± 0.007
0.849GlnTyr: 0.849 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
7.002ArgAla: 7.002 ± 0.043
1.021ArgCys: 1.021 ± 0.012
3.27ArgAsp: 3.27 ± 0.024
4.992ArgGlu: 4.992 ± 0.042
2.236ArgPhe: 2.236 ± 0.017
5.603ArgGly: 5.603 ± 0.035
1.419ArgHis: 1.419 ± 0.014
2.343ArgIle: 2.343 ± 0.019
3.869ArgLys: 3.869 ± 0.029
6.4ArgLeu: 6.4 ± 0.038
1.346ArgMet: 1.346 ± 0.012
2.04ArgAsn: 2.04 ± 0.016
3.911ArgPro: 3.911 ± 0.029
2.841ArgGln: 2.841 ± 0.021
5.85ArgArg: 5.85 ± 0.04
4.959ArgSer: 4.959 ± 0.034
3.496ArgThr: 3.496 ± 0.024
4.321ArgVal: 4.321 ± 0.026
0.921ArgTrp: 0.921 ± 0.01
1.346ArgTyr: 1.346 ± 0.014
0.0ArgXaa: 0.0 ± 0.0
Ser
7.579SerAla: 7.579 ± 0.039
1.013SerCys: 1.013 ± 0.011
3.808SerAsp: 3.808 ± 0.024
4.86SerGlu: 4.86 ± 0.037
2.575SerPhe: 2.575 ± 0.02
6.931SerGly: 6.931 ± 0.043
1.311SerHis: 1.311 ± 0.013
2.472SerIle: 2.472 ± 0.019
3.385SerLys: 3.385 ± 0.021
6.662SerLeu: 6.662 ± 0.034
1.292SerMet: 1.292 ± 0.013
2.198SerAsn: 2.198 ± 0.02
5.206SerPro: 5.206 ± 0.047
2.735SerGln: 2.735 ± 0.021
4.635SerArg: 4.635 ± 0.028
6.746SerSer: 6.746 ± 0.043
3.894SerThr: 3.894 ± 0.026
4.634SerVal: 4.634 ± 0.029
0.944SerTrp: 0.944 ± 0.011
1.374SerTyr: 1.374 ± 0.015
0.0SerXaa: 0.0 ± 0.0
Thr
5.336ThrAla: 5.336 ± 0.032
0.839ThrCys: 0.839 ± 0.013
2.387ThrAsp: 2.387 ± 0.017
3.203ThrGlu: 3.203 ± 0.025
1.871ThrPhe: 1.871 ± 0.018
4.514ThrGly: 4.514 ± 0.031
0.89ThrHis: 0.89 ± 0.01
1.835ThrIle: 1.835 ± 0.019
2.052ThrLys: 2.052 ± 0.019
4.744ThrLeu: 4.744 ± 0.032
0.884ThrMet: 0.884 ± 0.01
1.404ThrAsn: 1.404 ± 0.017
3.694ThrPro: 3.694 ± 0.032
1.68ThrGln: 1.68 ± 0.016
3.01ThrArg: 3.01 ± 0.02
4.297ThrSer: 4.297 ± 0.029
2.668ThrThr: 2.668 ± 0.027
3.631ThrVal: 3.631 ± 0.029
0.786ThrTrp: 0.786 ± 0.011
1.102ThrTyr: 1.102 ± 0.015
0.0ThrXaa: 0.0 ± 0.0
Val
7.021ValAla: 7.021 ± 0.033
1.099ValCys: 1.099 ± 0.013
3.618ValAsp: 3.618 ± 0.023
4.725ValGlu: 4.725 ± 0.027
2.288ValPhe: 2.288 ± 0.019
4.9ValGly: 4.9 ± 0.03
1.474ValHis: 1.474 ± 0.015
2.388ValIle: 2.388 ± 0.019
3.124ValLys: 3.124 ± 0.023
6.492ValLeu: 6.492 ± 0.032
1.221ValMet: 1.221 ± 0.013
1.984ValAsn: 1.984 ± 0.015
4.023ValPro: 4.023 ± 0.024
2.708ValGln: 2.708 ± 0.016
4.602ValArg: 4.602 ± 0.029
4.903ValSer: 4.903 ± 0.031
3.814ValThr: 3.814 ± 0.03
4.972ValVal: 4.972 ± 0.028
0.857ValTrp: 0.857 ± 0.011
1.532ValTyr: 1.532 ± 0.016
0.0ValXaa: 0.0 ± 0.0
Trp
1.189TrpAla: 1.189 ± 0.014
0.186TrpCys: 0.186 ± 0.004
0.732TrpAsp: 0.732 ± 0.01
0.889TrpGlu: 0.889 ± 0.011
0.43TrpPhe: 0.43 ± 0.007
1.011TrpGly: 1.011 ± 0.013
0.325TrpHis: 0.325 ± 0.006
0.419TrpIle: 0.419 ± 0.007
0.68TrpLys: 0.68 ± 0.01
1.279TrpLeu: 1.279 ± 0.014
0.271TrpMet: 0.271 ± 0.006
0.449TrpAsn: 0.449 ± 0.008
0.622TrpPro: 0.622 ± 0.009
0.635TrpGln: 0.635 ± 0.009
1.043TrpArg: 1.043 ± 0.013
0.856TrpSer: 0.856 ± 0.01
0.674TrpThr: 0.674 ± 0.009
0.797TrpVal: 0.797 ± 0.011
0.245TrpTrp: 0.245 ± 0.006
0.322TrpTyr: 0.322 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.833TyrAla: 1.833 ± 0.017
0.437TyrCys: 0.437 ± 0.008
1.166TyrAsp: 1.166 ± 0.013
1.317TyrGlu: 1.317 ± 0.014
0.864TyrPhe: 0.864 ± 0.011
1.72TyrGly: 1.72 ± 0.019
0.488TyrHis: 0.488 ± 0.008
0.778TyrIle: 0.778 ± 0.01
0.96TyrLys: 0.96 ± 0.012
2.028TyrLeu: 2.028 ± 0.019
0.448TyrMet: 0.448 ± 0.007
0.748TyrAsn: 0.748 ± 0.012
1.039TyrPro: 1.039 ± 0.014
0.806TyrGln: 0.806 ± 0.011
1.334TyrArg: 1.334 ± 0.015
1.435TyrSer: 1.435 ± 0.017
1.109TyrThr: 1.109 ± 0.015
1.484TyrVal: 1.484 ± 0.014
0.33TyrTrp: 0.33 ± 0.008
0.619TyrTyr: 0.619 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.002
Statistics based on 16251 proteins (8760576 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski