Amino acid dipepetide frequency for Rutstroemia sp. NJR-2017a BBW

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.375AlaAla: 8.375 ± 0.066
1.021AlaCys: 1.021 ± 0.016
4.0AlaAsp: 4.0 ± 0.028
5.023AlaGlu: 5.023 ± 0.043
3.055AlaPhe: 3.055 ± 0.029
5.748AlaGly: 5.748 ± 0.039
1.638AlaHis: 1.638 ± 0.021
4.383AlaIle: 4.383 ± 0.036
4.28AlaLys: 4.28 ± 0.039
7.436AlaLeu: 7.436 ± 0.044
1.928AlaMet: 1.928 ± 0.02
3.043AlaAsn: 3.043 ± 0.027
4.44AlaPro: 4.44 ± 0.046
3.194AlaGln: 3.194 ± 0.027
4.357AlaArg: 4.357 ± 0.041
6.941AlaSer: 6.941 ± 0.05
5.12AlaThr: 5.12 ± 0.035
5.019AlaVal: 5.019 ± 0.038
1.084AlaTrp: 1.084 ± 0.016
2.185AlaTyr: 2.185 ± 0.023
0.0AlaXaa: 0.0 ± 0.0
Cys
0.907CysAla: 0.907 ± 0.015
0.254CysCys: 0.254 ± 0.009
0.636CysAsp: 0.636 ± 0.012
0.618CysGlu: 0.618 ± 0.013
0.527CysPhe: 0.527 ± 0.01
0.976CysGly: 0.976 ± 0.017
0.289CysHis: 0.289 ± 0.008
0.776CysIle: 0.776 ± 0.013
0.548CysLys: 0.548 ± 0.011
1.23CysLeu: 1.23 ± 0.016
0.277CysMet: 0.277 ± 0.008
0.43CysAsn: 0.43 ± 0.011
0.613CysPro: 0.613 ± 0.013
0.412CysGln: 0.412 ± 0.01
0.67CysArg: 0.67 ± 0.013
0.922CysSer: 0.922 ± 0.019
0.706CysThr: 0.706 ± 0.012
0.784CysVal: 0.784 ± 0.012
0.198CysTrp: 0.198 ± 0.006
0.372CysTyr: 0.372 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.379AspAla: 4.379 ± 0.036
0.605AspCys: 0.605 ± 0.012
3.979AspAsp: 3.979 ± 0.045
4.417AspGlu: 4.417 ± 0.042
2.178AspPhe: 2.178 ± 0.022
3.949AspGly: 3.949 ± 0.032
1.17AspHis: 1.17 ± 0.018
3.249AspIle: 3.249 ± 0.029
2.371AspLys: 2.371 ± 0.025
4.983AspLeu: 4.983 ± 0.039
1.259AspMet: 1.259 ± 0.015
1.845AspAsn: 1.845 ± 0.022
3.067AspPro: 3.067 ± 0.029
1.695AspGln: 1.695 ± 0.019
2.825AspArg: 2.825 ± 0.031
4.055AspSer: 4.055 ± 0.032
2.9AspThr: 2.9 ± 0.024
3.585AspVal: 3.585 ± 0.029
0.874AspTrp: 0.874 ± 0.013
1.613AspTyr: 1.613 ± 0.02
0.0AspXaa: 0.0 ± 0.0
Glu
5.223GluAla: 5.223 ± 0.044
0.626GluCys: 0.626 ± 0.011
4.29GluAsp: 4.29 ± 0.044
6.053GluGlu: 6.053 ± 0.063
2.023GluPhe: 2.023 ± 0.022
4.196GluGly: 4.196 ± 0.035
1.309GluHis: 1.309 ± 0.019
3.42GluIle: 3.42 ± 0.029
4.308GluLys: 4.308 ± 0.041
5.196GluLeu: 5.196 ± 0.036
1.599GluMet: 1.599 ± 0.019
2.536GluAsn: 2.536 ± 0.024
2.55GluPro: 2.55 ± 0.035
2.324GluGln: 2.324 ± 0.026
3.946GluArg: 3.946 ± 0.04
4.363GluSer: 4.363 ± 0.033
3.391GluThr: 3.391 ± 0.027
3.857GluVal: 3.857 ± 0.033
0.952GluTrp: 0.952 ± 0.015
1.764GluTyr: 1.764 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
2.991PheAla: 2.991 ± 0.028
0.55PheCys: 0.55 ± 0.012
2.202PheAsp: 2.202 ± 0.023
2.269PheGlu: 2.269 ± 0.022
1.626PhePhe: 1.626 ± 0.023
2.925PheGly: 2.925 ± 0.031
0.881PheHis: 0.881 ± 0.014
1.944PheIle: 1.944 ± 0.02
1.609PheLys: 1.609 ± 0.018
3.566PheLeu: 3.566 ± 0.03
0.835PheMet: 0.835 ± 0.013
1.481PheAsn: 1.481 ± 0.015
1.923PhePro: 1.923 ± 0.021
1.35PheGln: 1.35 ± 0.018
1.851PheArg: 1.851 ± 0.022
3.091PheSer: 3.091 ± 0.028
2.234PheThr: 2.234 ± 0.022
2.368PheVal: 2.368 ± 0.025
0.638PheTrp: 0.638 ± 0.012
1.124PheTyr: 1.124 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
5.32GlyAla: 5.32 ± 0.042
0.895GlyCys: 0.895 ± 0.015
3.587GlyAsp: 3.587 ± 0.031
4.102GlyGlu: 4.102 ± 0.036
2.846GlyPhe: 2.846 ± 0.03
6.235GlyGly: 6.235 ± 0.066
1.606GlyHis: 1.606 ± 0.02
3.761GlyIle: 3.761 ± 0.033
3.972GlyLys: 3.972 ± 0.034
6.037GlyLeu: 6.037 ± 0.044
1.783GlyMet: 1.783 ± 0.022
2.74GlyAsn: 2.74 ± 0.027
3.03GlyPro: 3.03 ± 0.031
2.291GlyGln: 2.291 ± 0.022
4.088GlyArg: 4.088 ± 0.036
5.742GlySer: 5.742 ± 0.04
4.116GlyThr: 4.116 ± 0.033
4.579GlyVal: 4.579 ± 0.034
1.23GlyTrp: 1.23 ± 0.017
2.194GlyTyr: 2.194 ± 0.025
0.0GlyXaa: 0.0 ± 0.0
His
1.629HisAla: 1.629 ± 0.017
0.311HisCys: 0.311 ± 0.009
1.221HisAsp: 1.221 ± 0.016
1.299HisGlu: 1.299 ± 0.018
0.921HisPhe: 0.921 ± 0.013
1.576HisGly: 1.576 ± 0.021
0.741HisHis: 0.741 ± 0.015
1.263HisIle: 1.263 ± 0.017
0.973HisLys: 0.973 ± 0.016
2.157HisLeu: 2.157 ± 0.024
0.464HisMet: 0.464 ± 0.009
0.884HisAsn: 0.884 ± 0.015
1.568HisPro: 1.568 ± 0.021
0.893HisGln: 0.893 ± 0.014
1.35HisArg: 1.35 ± 0.018
1.829HisSer: 1.829 ± 0.021
1.344HisThr: 1.344 ± 0.017
1.285HisVal: 1.285 ± 0.016
0.32HisTrp: 0.32 ± 0.007
0.688HisTyr: 0.688 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.341IleAla: 4.341 ± 0.031
0.83IleCys: 0.83 ± 0.014
3.009IleAsp: 3.009 ± 0.025
3.164IleGlu: 3.164 ± 0.028
2.181IlePhe: 2.181 ± 0.024
3.404IleGly: 3.404 ± 0.033
1.27IleHis: 1.27 ± 0.017
2.892IleIle: 2.892 ± 0.03
2.445IleLys: 2.445 ± 0.027
5.048IleLeu: 5.048 ± 0.042
1.119IleMet: 1.119 ± 0.016
2.033IleAsn: 2.033 ± 0.023
3.234IlePro: 3.234 ± 0.028
1.974IleGln: 1.974 ± 0.026
2.848IleArg: 2.848 ± 0.024
4.319IleSer: 4.319 ± 0.035
3.2IleThr: 3.2 ± 0.027
3.371IleVal: 3.371 ± 0.032
0.807IleTrp: 0.807 ± 0.013
1.612IleTyr: 1.612 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
4.457LysAla: 4.457 ± 0.044
0.539LysCys: 0.539 ± 0.011
3.046LysAsp: 3.046 ± 0.03
3.921LysGlu: 3.921 ± 0.038
1.638LysPhe: 1.638 ± 0.021
3.383LysGly: 3.383 ± 0.029
1.167LysHis: 1.167 ± 0.015
2.656LysIle: 2.656 ± 0.025
3.786LysLys: 3.786 ± 0.046
4.388LysLeu: 4.388 ± 0.037
1.138LysMet: 1.138 ± 0.015
2.022LysAsn: 2.022 ± 0.02
2.744LysPro: 2.744 ± 0.03
1.845LysGln: 1.845 ± 0.023
3.363LysArg: 3.363 ± 0.031
3.972LysSer: 3.972 ± 0.029
3.038LysThr: 3.038 ± 0.027
3.105LysVal: 3.105 ± 0.029
0.744LysTrp: 0.744 ± 0.012
1.558LysTyr: 1.558 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
7.394LeuAla: 7.394 ± 0.042
1.192LeuCys: 1.192 ± 0.016
4.948LeuAsp: 4.948 ± 0.034
5.574LeuGlu: 5.574 ± 0.047
3.288LeuPhe: 3.288 ± 0.028
6.063LeuGly: 6.063 ± 0.045
2.181LeuHis: 2.181 ± 0.022
4.236LeuIle: 4.236 ± 0.035
4.533LeuLys: 4.533 ± 0.035
8.499LeuLeu: 8.499 ± 0.069
1.859LeuMet: 1.859 ± 0.02
3.32LeuAsn: 3.32 ± 0.032
5.383LeuPro: 5.383 ± 0.036
3.706LeuGln: 3.706 ± 0.031
5.357LeuArg: 5.357 ± 0.039
7.384LeuSer: 7.384 ± 0.048
4.944LeuThr: 4.944 ± 0.039
5.246LeuVal: 5.246 ± 0.037
1.228LeuTrp: 1.228 ± 0.019
2.463LeuTyr: 2.463 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.105MetAla: 2.105 ± 0.022
0.237MetCys: 0.237 ± 0.007
1.287MetAsp: 1.287 ± 0.016
1.393MetGlu: 1.393 ± 0.017
0.78MetPhe: 0.78 ± 0.013
1.614MetGly: 1.614 ± 0.019
0.486MetHis: 0.486 ± 0.01
1.131MetIle: 1.131 ± 0.016
1.196MetLys: 1.196 ± 0.017
1.859MetLeu: 1.859 ± 0.022
0.612MetMet: 0.612 ± 0.012
0.886MetAsn: 0.886 ± 0.013
1.165MetPro: 1.165 ± 0.017
0.854MetGln: 0.854 ± 0.016
1.248MetArg: 1.248 ± 0.018
1.903MetSer: 1.903 ± 0.02
1.316MetThr: 1.316 ± 0.019
1.377MetVal: 1.377 ± 0.017
0.269MetTrp: 0.269 ± 0.007
0.571MetTyr: 0.571 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.264AsnAla: 3.264 ± 0.027
0.465AsnCys: 0.465 ± 0.011
2.038AsnAsp: 2.038 ± 0.022
2.177AsnGlu: 2.177 ± 0.021
1.507AsnPhe: 1.507 ± 0.017
3.243AsnGly: 3.243 ± 0.032
0.896AsnHis: 0.896 ± 0.013
2.33AsnIle: 2.33 ± 0.023
1.7AsnLys: 1.7 ± 0.017
3.478AsnLeu: 3.478 ± 0.028
0.873AsnMet: 0.873 ± 0.013
1.675AsnAsn: 1.675 ± 0.022
2.485AsnPro: 2.485 ± 0.023
1.329AsnGln: 1.329 ± 0.018
1.904AsnArg: 1.904 ± 0.023
3.099AsnSer: 3.099 ± 0.03
2.454AsnThr: 2.454 ± 0.028
2.363AsnVal: 2.363 ± 0.023
0.604AsnTrp: 0.604 ± 0.013
1.223AsnTyr: 1.223 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
4.704ProAla: 4.704 ± 0.044
0.456ProCys: 0.456 ± 0.011
2.82ProAsp: 2.82 ± 0.023
3.628ProGlu: 3.628 ± 0.038
1.97ProPhe: 1.97 ± 0.021
3.524ProGly: 3.524 ± 0.031
1.232ProHis: 1.232 ± 0.019
2.735ProIle: 2.735 ± 0.027
2.807ProLys: 2.807 ± 0.029
4.641ProLeu: 4.641 ± 0.033
1.017ProMet: 1.017 ± 0.015
2.315ProAsn: 2.315 ± 0.024
4.59ProPro: 4.59 ± 0.063
2.311ProGln: 2.311 ± 0.024
3.061ProArg: 3.061 ± 0.031
5.828ProSer: 5.828 ± 0.051
3.974ProThr: 3.974 ± 0.036
3.296ProVal: 3.296 ± 0.028
0.671ProTrp: 0.671 ± 0.012
1.52ProTyr: 1.52 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.13GlnAla: 3.13 ± 0.029
0.413GlnCys: 0.413 ± 0.01
1.881GlnAsp: 1.881 ± 0.021
2.317GlnGlu: 2.317 ± 0.025
1.269GlnPhe: 1.269 ± 0.018
2.261GlnGly: 2.261 ± 0.025
0.932GlnHis: 0.932 ± 0.015
1.972GlnIle: 1.972 ± 0.019
2.12GlnLys: 2.12 ± 0.023
3.194GlnLeu: 3.194 ± 0.032
0.856GlnMet: 0.856 ± 0.014
1.643GlnAsn: 1.643 ± 0.018
2.13GlnPro: 2.13 ± 0.028
2.032GlnGln: 2.032 ± 0.036
2.321GlnArg: 2.321 ± 0.026
3.004GlnSer: 3.004 ± 0.032
2.234GlnThr: 2.234 ± 0.022
2.067GlnVal: 2.067 ± 0.021
0.547GlnTrp: 0.547 ± 0.012
1.173GlnTyr: 1.173 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
4.237ArgAla: 4.237 ± 0.035
0.667ArgCys: 0.667 ± 0.013
3.07ArgAsp: 3.07 ± 0.029
3.899ArgGlu: 3.899 ± 0.042
1.966ArgPhe: 1.966 ± 0.022
3.716ArgGly: 3.716 ± 0.036
1.353ArgHis: 1.353 ± 0.019
2.901ArgIle: 2.901 ± 0.025
3.603ArgLys: 3.603 ± 0.032
4.985ArgLeu: 4.985 ± 0.041
1.305ArgMet: 1.305 ± 0.017
2.329ArgAsn: 2.329 ± 0.023
3.013ArgPro: 3.013 ± 0.03
2.226ArgGln: 2.226 ± 0.024
4.507ArgArg: 4.507 ± 0.043
4.475ArgSer: 4.475 ± 0.038
3.164ArgThr: 3.164 ± 0.028
3.099ArgVal: 3.099 ± 0.025
0.884ArgTrp: 0.884 ± 0.014
1.608ArgTyr: 1.608 ± 0.023
0.0ArgXaa: 0.0 ± 0.0
Ser
6.585SerAla: 6.585 ± 0.047
0.896SerCys: 0.896 ± 0.015
4.022SerAsp: 4.022 ± 0.035
4.215SerGlu: 4.215 ± 0.036
3.095SerPhe: 3.095 ± 0.028
5.635SerGly: 5.635 ± 0.048
1.898SerHis: 1.898 ± 0.021
4.541SerIle: 4.541 ± 0.033
4.163SerLys: 4.163 ± 0.035
7.142SerLeu: 7.142 ± 0.04
1.791SerMet: 1.791 ± 0.021
3.409SerAsn: 3.409 ± 0.033
5.326SerPro: 5.326 ± 0.05
3.079SerGln: 3.079 ± 0.026
4.66SerArg: 4.66 ± 0.042
9.288SerSer: 9.288 ± 0.095
6.11SerThr: 6.11 ± 0.053
4.559SerVal: 4.559 ± 0.035
1.124SerTrp: 1.124 ± 0.015
2.278SerTyr: 2.278 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
5.003ThrAla: 5.003 ± 0.04
0.757ThrCys: 0.757 ± 0.014
2.807ThrAsp: 2.807 ± 0.026
3.158ThrGlu: 3.158 ± 0.026
2.368ThrPhe: 2.368 ± 0.02
4.228ThrGly: 4.228 ± 0.033
1.315ThrHis: 1.315 ± 0.017
3.467ThrIle: 3.467 ± 0.027
2.837ThrLys: 2.837 ± 0.027
5.329ThrLeu: 5.329 ± 0.038
1.177ThrMet: 1.177 ± 0.016
2.394ThrAsn: 2.394 ± 0.025
4.373ThrPro: 4.373 ± 0.043
2.106ThrGln: 2.106 ± 0.021
2.95ThrArg: 2.95 ± 0.027
5.857ThrSer: 5.857 ± 0.051
4.64ThrThr: 4.64 ± 0.056
3.65ThrVal: 3.65 ± 0.032
0.833ThrTrp: 0.833 ± 0.013
1.774ThrTyr: 1.774 ± 0.023
0.0ThrXaa: 0.0 ± 0.0
Val
4.957ValAla: 4.957 ± 0.036
0.79ValCys: 0.79 ± 0.014
3.584ValAsp: 3.584 ± 0.029
4.128ValGlu: 4.128 ± 0.034
2.428ValPhe: 2.428 ± 0.022
4.254ValGly: 4.254 ± 0.036
1.277ValHis: 1.277 ± 0.015
3.058ValIle: 3.058 ± 0.027
3.144ValLys: 3.144 ± 0.027
5.529ValLeu: 5.529 ± 0.037
1.356ValMet: 1.356 ± 0.017
2.269ValAsn: 2.269 ± 0.024
3.364ValPro: 3.364 ± 0.028
2.238ValGln: 2.238 ± 0.021
3.256ValArg: 3.256 ± 0.027
4.51ValSer: 4.51 ± 0.036
3.424ValThr: 3.424 ± 0.028
4.36ValVal: 4.36 ± 0.032
0.882ValTrp: 0.882 ± 0.015
1.75ValTyr: 1.75 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.081TrpAla: 1.081 ± 0.017
0.2TrpCys: 0.2 ± 0.007
0.9TrpAsp: 0.9 ± 0.014
0.918TrpGlu: 0.918 ± 0.014
0.547TrpPhe: 0.547 ± 0.012
1.045TrpGly: 1.045 ± 0.016
0.326TrpHis: 0.326 ± 0.009
0.818TrpIle: 0.818 ± 0.014
0.857TrpLys: 0.857 ± 0.013
1.326TrpLeu: 1.326 ± 0.019
0.397TrpMet: 0.397 ± 0.009
0.639TrpAsn: 0.639 ± 0.012
0.564TrpPro: 0.564 ± 0.011
0.526TrpGln: 0.526 ± 0.009
0.896TrpArg: 0.896 ± 0.012
1.035TrpSer: 1.035 ± 0.017
0.916TrpThr: 0.916 ± 0.015
0.922TrpVal: 0.922 ± 0.014
0.275TrpTrp: 0.275 ± 0.007
0.439TrpTyr: 0.439 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.189TyrAla: 2.189 ± 0.021
0.441TyrCys: 0.441 ± 0.01
1.673TyrAsp: 1.673 ± 0.02
1.651TyrGlu: 1.651 ± 0.019
1.244TyrPhe: 1.244 ± 0.017
2.177TyrGly: 2.177 ± 0.024
0.75TyrHis: 0.75 ± 0.011
1.568TyrIle: 1.568 ± 0.019
1.26TyrLys: 1.26 ± 0.017
2.771TyrLeu: 2.771 ± 0.029
0.642TyrMet: 0.642 ± 0.011
1.216TyrAsn: 1.216 ± 0.016
1.54TyrPro: 1.54 ± 0.02
1.112TyrGln: 1.112 ± 0.018
1.551TyrArg: 1.551 ± 0.018
2.199TyrSer: 2.199 ± 0.027
1.799TyrThr: 1.799 ± 0.022
1.667TyrVal: 1.667 ± 0.019
0.462TyrTrp: 0.462 ± 0.01
0.987TyrTyr: 0.987 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10779 proteins (4847628 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski