Amino acid dipepetide frequency for Rhodopila globiformis (Rhodopseudomonas globiformis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.308AlaAla: 21.308 ± 0.174
1.483AlaCys: 1.483 ± 0.031
7.498AlaAsp: 7.498 ± 0.076
7.594AlaGlu: 7.594 ± 0.084
4.199AlaPhe: 4.199 ± 0.05
12.097AlaGly: 12.097 ± 0.099
2.599AlaHis: 2.599 ± 0.04
6.1AlaIle: 6.1 ± 0.052
3.006AlaLys: 3.006 ± 0.049
14.274AlaLeu: 14.274 ± 0.114
3.831AlaMet: 3.831 ± 0.04
2.875AlaAsn: 2.875 ± 0.041
6.491AlaPro: 6.491 ± 0.066
4.193AlaGln: 4.193 ± 0.06
10.221AlaArg: 10.221 ± 0.093
6.208AlaSer: 6.208 ± 0.069
6.809AlaThr: 6.809 ± 0.07
10.099AlaVal: 10.099 ± 0.075
2.111AlaTrp: 2.111 ± 0.035
2.518AlaTyr: 2.518 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
1.257CysAla: 1.257 ± 0.026
0.176CysCys: 0.176 ± 0.009
0.595CysAsp: 0.595 ± 0.02
0.413CysGlu: 0.413 ± 0.015
0.37CysPhe: 0.37 ± 0.013
1.045CysGly: 1.045 ± 0.029
0.306CysHis: 0.306 ± 0.016
0.408CysIle: 0.408 ± 0.014
0.184CysLys: 0.184 ± 0.01
0.98CysLeu: 0.98 ± 0.022
0.21CysMet: 0.21 ± 0.011
0.263CysAsn: 0.263 ± 0.012
0.577CysPro: 0.577 ± 0.019
0.303CysGln: 0.303 ± 0.013
0.844CysArg: 0.844 ± 0.021
0.48CysSer: 0.48 ± 0.016
0.482CysThr: 0.482 ± 0.016
0.707CysVal: 0.707 ± 0.023
0.147CysTrp: 0.147 ± 0.009
0.221CysTyr: 0.221 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.173AspAla: 7.173 ± 0.072
0.534AspCys: 0.534 ± 0.018
2.889AspAsp: 2.889 ± 0.043
2.749AspGlu: 2.749 ± 0.041
1.894AspPhe: 1.894 ± 0.034
5.022AspGly: 5.022 ± 0.066
1.291AspHis: 1.291 ± 0.03
2.648AspIle: 2.648 ± 0.041
1.253AspLys: 1.253 ± 0.028
5.948AspLeu: 5.948 ± 0.056
1.302AspMet: 1.302 ± 0.03
1.157AspAsn: 1.157 ± 0.026
3.874AspPro: 3.874 ± 0.05
1.721AspGln: 1.721 ± 0.032
4.584AspArg: 4.584 ± 0.059
2.289AspSer: 2.289 ± 0.038
2.723AspThr: 2.723 ± 0.034
4.153AspVal: 4.153 ± 0.051
1.062AspTrp: 1.062 ± 0.025
1.324AspTyr: 1.324 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
7.611GluAla: 7.611 ± 0.085
0.328GluCys: 0.328 ± 0.013
2.36GluAsp: 2.36 ± 0.042
2.345GluGlu: 2.345 ± 0.046
1.395GluPhe: 1.395 ± 0.029
3.339GluGly: 3.339 ± 0.042
1.155GluHis: 1.155 ± 0.029
2.677GluIle: 2.677 ± 0.04
1.433GluLys: 1.433 ± 0.036
4.438GluLeu: 4.438 ± 0.054
1.256GluMet: 1.256 ± 0.026
1.148GluAsn: 1.148 ± 0.024
2.715GluPro: 2.715 ± 0.04
2.18GluGln: 2.18 ± 0.036
4.48GluArg: 4.48 ± 0.061
2.049GluSer: 2.049 ± 0.034
3.352GluThr: 3.352 ± 0.043
3.311GluVal: 3.311 ± 0.049
0.612GluTrp: 0.612 ± 0.017
0.896GluTyr: 0.896 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
4.314PheAla: 4.314 ± 0.048
0.438PheCys: 0.438 ± 0.016
2.227PheAsp: 2.227 ± 0.036
1.606PheGlu: 1.606 ± 0.032
1.16PhePhe: 1.16 ± 0.027
3.445PheGly: 3.445 ± 0.05
0.827PheHis: 0.827 ± 0.021
1.412PheIle: 1.412 ± 0.033
0.651PheLys: 0.651 ± 0.021
3.267PheLeu: 3.267 ± 0.039
0.712PheMet: 0.712 ± 0.018
0.941PheAsn: 0.941 ± 0.022
1.637PhePro: 1.637 ± 0.026
0.99PheGln: 0.99 ± 0.022
2.447PheArg: 2.447 ± 0.042
1.837PheSer: 1.837 ± 0.03
1.873PheThr: 1.873 ± 0.033
2.483PheVal: 2.483 ± 0.037
0.511PheTrp: 0.511 ± 0.017
0.758PheTyr: 0.758 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
9.729GlyAla: 9.729 ± 0.082
1.031GlyCys: 1.031 ± 0.023
4.173GlyAsp: 4.173 ± 0.049
3.814GlyGlu: 3.814 ± 0.04
3.54GlyPhe: 3.54 ± 0.046
7.583GlyGly: 7.583 ± 0.086
2.11GlyHis: 2.11 ± 0.033
4.419GlyIle: 4.419 ± 0.061
2.678GlyLys: 2.678 ± 0.05
9.293GlyLeu: 9.293 ± 0.074
2.519GlyMet: 2.519 ± 0.034
2.172GlyAsn: 2.172 ± 0.039
4.048GlyPro: 4.048 ± 0.053
3.096GlyGln: 3.096 ± 0.046
6.786GlyArg: 6.786 ± 0.07
4.597GlySer: 4.597 ± 0.063
5.036GlyThr: 5.036 ± 0.066
6.245GlyVal: 6.245 ± 0.065
1.583GlyTrp: 1.583 ± 0.03
2.217GlyTyr: 2.217 ± 0.035
0.0GlyXaa: 0.0 ± 0.0
His
2.954HisAla: 2.954 ± 0.046
0.275HisCys: 0.275 ± 0.013
1.402HisAsp: 1.402 ± 0.026
1.02HisGlu: 1.02 ± 0.026
0.8HisPhe: 0.8 ± 0.021
2.224HisGly: 2.224 ± 0.037
0.674HisHis: 0.674 ± 0.024
0.907HisIle: 0.907 ± 0.022
0.429HisLys: 0.429 ± 0.014
2.224HisLeu: 2.224 ± 0.037
0.514HisMet: 0.514 ± 0.016
0.488HisAsn: 0.488 ± 0.019
1.681HisPro: 1.681 ± 0.031
0.685HisGln: 0.685 ± 0.019
1.786HisArg: 1.786 ± 0.035
0.907HisSer: 0.907 ± 0.023
0.977HisThr: 0.977 ± 0.025
1.762HisVal: 1.762 ± 0.031
0.412HisTrp: 0.412 ± 0.018
0.581HisTyr: 0.581 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.805IleAla: 6.805 ± 0.065
0.501IleCys: 0.501 ± 0.017
3.124IleAsp: 3.124 ± 0.038
2.854IleGlu: 2.854 ± 0.04
1.198IlePhe: 1.198 ± 0.028
4.962IleGly: 4.962 ± 0.061
0.918IleHis: 0.918 ± 0.02
1.88IleIle: 1.88 ± 0.037
1.028IleLys: 1.028 ± 0.027
4.061IleLeu: 4.061 ± 0.047
0.887IleMet: 0.887 ± 0.024
1.193IleAsn: 1.193 ± 0.022
2.447IlePro: 2.447 ± 0.035
1.261IleGln: 1.261 ± 0.024
3.412IleArg: 3.412 ± 0.045
2.247IleSer: 2.247 ± 0.034
2.463IleThr: 2.463 ± 0.042
3.821IleVal: 3.821 ± 0.051
0.574IleTrp: 0.574 ± 0.019
0.936IleTyr: 0.936 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
3.264LysAla: 3.264 ± 0.055
0.134LysCys: 0.134 ± 0.009
1.265LysAsp: 1.265 ± 0.03
1.065LysGlu: 1.065 ± 0.026
0.617LysPhe: 0.617 ± 0.018
1.887LysGly: 1.887 ± 0.037
0.478LysHis: 0.478 ± 0.015
1.156LysIle: 1.156 ± 0.026
0.707LysLys: 0.707 ± 0.025
2.575LysLeu: 2.575 ± 0.04
0.516LysMet: 0.516 ± 0.017
0.49LysAsn: 0.49 ± 0.016
1.805LysPro: 1.805 ± 0.03
0.916LysGln: 0.916 ± 0.021
1.844LysArg: 1.844 ± 0.032
1.154LysSer: 1.154 ± 0.024
1.377LysThr: 1.377 ± 0.03
1.851LysVal: 1.851 ± 0.039
0.294LysTrp: 0.294 ± 0.014
0.481LysTyr: 0.481 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
15.264LeuAla: 15.264 ± 0.127
1.062LeuCys: 1.062 ± 0.02
6.002LeuAsp: 6.002 ± 0.055
4.689LeuGlu: 4.689 ± 0.057
3.363LeuPhe: 3.363 ± 0.044
8.545LeuGly: 8.545 ± 0.08
2.333LeuHis: 2.333 ± 0.035
4.31LeuIle: 4.31 ± 0.044
2.565LeuLys: 2.565 ± 0.043
10.678LeuLeu: 10.678 ± 0.1
2.207LeuMet: 2.207 ± 0.034
2.404LeuAsn: 2.404 ± 0.04
6.514LeuPro: 6.514 ± 0.064
3.039LeuGln: 3.039 ± 0.049
8.39LeuArg: 8.39 ± 0.076
5.745LeuSer: 5.745 ± 0.065
5.652LeuThr: 5.652 ± 0.052
7.585LeuVal: 7.585 ± 0.076
1.28LeuTrp: 1.28 ± 0.033
2.006LeuTyr: 2.006 ± 0.036
0.001LeuXaa: 0.001 ± 0.001
Met
3.314MetAla: 3.314 ± 0.045
0.185MetCys: 0.185 ± 0.009
1.163MetAsp: 1.163 ± 0.026
1.023MetGlu: 1.023 ± 0.023
0.661MetPhe: 0.661 ± 0.018
1.651MetGly: 1.651 ± 0.032
0.536MetHis: 0.536 ± 0.015
1.196MetIle: 1.196 ± 0.024
0.714MetLys: 0.714 ± 0.022
2.669MetLeu: 2.669 ± 0.042
0.63MetMet: 0.63 ± 0.019
0.696MetAsn: 0.696 ± 0.02
1.742MetPro: 1.742 ± 0.034
1.004MetGln: 1.004 ± 0.021
1.964MetArg: 1.964 ± 0.032
1.418MetSer: 1.418 ± 0.025
1.777MetThr: 1.777 ± 0.029
1.653MetVal: 1.653 ± 0.029
0.204MetTrp: 0.204 ± 0.01
0.327MetTyr: 0.327 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.09AsnAla: 3.09 ± 0.044
0.26AsnCys: 0.26 ± 0.013
1.302AsnAsp: 1.302 ± 0.028
1.096AsnGlu: 1.096 ± 0.023
0.826AsnPhe: 0.826 ± 0.022
2.34AsnGly: 2.34 ± 0.043
0.504AsnHis: 0.504 ± 0.016
1.07AsnIle: 1.07 ± 0.025
0.514AsnLys: 0.514 ± 0.021
2.421AsnLeu: 2.421 ± 0.041
0.498AsnMet: 0.498 ± 0.016
0.68AsnAsn: 0.68 ± 0.024
1.87AsnPro: 1.87 ± 0.035
0.833AsnGln: 0.833 ± 0.02
1.811AsnArg: 1.811 ± 0.029
0.945AsnSer: 0.945 ± 0.024
1.194AsnThr: 1.194 ± 0.028
1.762AsnVal: 1.762 ± 0.031
0.41AsnTrp: 0.41 ± 0.015
0.539AsnTyr: 0.539 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
8.173ProAla: 8.173 ± 0.077
0.455ProCys: 0.455 ± 0.015
4.315ProAsp: 4.315 ± 0.044
3.615ProGlu: 3.615 ± 0.044
1.981ProPhe: 1.981 ± 0.032
5.876ProGly: 5.876 ± 0.057
1.292ProHis: 1.292 ± 0.027
2.348ProIle: 2.348 ± 0.038
1.332ProLys: 1.332 ± 0.029
5.23ProLeu: 5.23 ± 0.06
1.479ProMet: 1.479 ± 0.024
1.37ProAsn: 1.37 ± 0.027
4.216ProPro: 4.216 ± 0.076
1.723ProGln: 1.723 ± 0.031
3.749ProArg: 3.749 ± 0.044
2.772ProSer: 2.772 ± 0.036
2.699ProThr: 2.699 ± 0.035
4.808ProVal: 4.808 ± 0.048
0.856ProTrp: 0.856 ± 0.02
1.17ProTyr: 1.17 ± 0.026
0.001ProXaa: 0.001 ± 0.001
Gln
5.407GlnAla: 5.407 ± 0.075
0.253GlnCys: 0.253 ± 0.011
1.662GlnAsp: 1.662 ± 0.03
1.554GlnGlu: 1.554 ± 0.03
1.038GlnPhe: 1.038 ± 0.025
2.519GlnGly: 2.519 ± 0.038
0.769GlnHis: 0.769 ± 0.02
1.613GlnIle: 1.613 ± 0.03
0.816GlnLys: 0.816 ± 0.02
2.792GlnLeu: 2.792 ± 0.042
0.817GlnMet: 0.817 ± 0.021
0.754GlnAsn: 0.754 ± 0.022
2.268GlnPro: 2.268 ± 0.036
1.527GlnGln: 1.527 ± 0.04
2.846GlnArg: 2.846 ± 0.041
1.59GlnSer: 1.59 ± 0.032
1.767GlnThr: 1.767 ± 0.031
2.558GlnVal: 2.558 ± 0.034
0.416GlnTrp: 0.416 ± 0.015
0.631GlnTyr: 0.631 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
9.133ArgAla: 9.133 ± 0.088
0.747ArgCys: 0.747 ± 0.02
4.366ArgAsp: 4.366 ± 0.051
3.632ArgGlu: 3.632 ± 0.051
3.057ArgPhe: 3.057 ± 0.04
5.209ArgGly: 5.209 ± 0.055
2.212ArgHis: 2.212 ± 0.039
4.132ArgIle: 4.132 ± 0.047
1.88ArgLys: 1.88 ± 0.032
9.336ArgLeu: 9.336 ± 0.079
2.056ArgMet: 2.056 ± 0.031
1.873ArgAsn: 1.873 ± 0.031
4.613ArgPro: 4.613 ± 0.058
3.18ArgGln: 3.18 ± 0.053
7.274ArgArg: 7.274 ± 0.081
3.679ArgSer: 3.679 ± 0.043
3.849ArgThr: 3.849 ± 0.046
5.082ArgVal: 5.082 ± 0.06
1.144ArgTrp: 1.144 ± 0.023
1.773ArgTyr: 1.773 ± 0.031
0.001ArgXaa: 0.001 ± 0.001
Ser
5.865SerAla: 5.865 ± 0.068
0.474SerCys: 0.474 ± 0.016
2.498SerAsp: 2.498 ± 0.038
2.084SerGlu: 2.084 ± 0.032
1.849SerPhe: 1.849 ± 0.028
5.183SerGly: 5.183 ± 0.055
1.08SerHis: 1.08 ± 0.027
2.3SerIle: 2.3 ± 0.036
1.021SerLys: 1.021 ± 0.026
5.236SerLeu: 5.236 ± 0.048
1.196SerMet: 1.196 ± 0.026
1.231SerAsn: 1.231 ± 0.028
2.92SerPro: 2.92 ± 0.042
1.583SerGln: 1.583 ± 0.032
3.522SerArg: 3.522 ± 0.048
2.405SerSer: 2.405 ± 0.044
2.38SerThr: 2.38 ± 0.042
3.802SerVal: 3.802 ± 0.045
0.754SerTrp: 0.754 ± 0.02
1.098SerTyr: 1.098 ± 0.025
0.0SerXaa: 0.0 ± 0.0
Thr
6.778ThrAla: 6.778 ± 0.07
0.499ThrCys: 0.499 ± 0.017
2.784ThrAsp: 2.784 ± 0.044
2.637ThrGlu: 2.637 ± 0.041
1.751ThrPhe: 1.751 ± 0.034
5.441ThrGly: 5.441 ± 0.066
1.096ThrHis: 1.096 ± 0.024
2.833ThrIle: 2.833 ± 0.043
1.115ThrLys: 1.115 ± 0.025
6.08ThrLeu: 6.08 ± 0.069
1.235ThrMet: 1.235 ± 0.023
1.232ThrAsn: 1.232 ± 0.029
3.448ThrPro: 3.448 ± 0.046
1.57ThrGln: 1.57 ± 0.034
3.509ThrArg: 3.509 ± 0.039
2.456ThrSer: 2.456 ± 0.044
2.957ThrThr: 2.957 ± 0.052
4.57ThrVal: 4.57 ± 0.052
0.828ThrTrp: 0.828 ± 0.02
1.129ThrTyr: 1.129 ± 0.026
0.001ThrXaa: 0.001 ± 0.001
Val
10.18ValAla: 10.18 ± 0.086
0.732ValCys: 0.732 ± 0.018
3.816ValAsp: 3.816 ± 0.047
3.72ValGlu: 3.72 ± 0.043
2.529ValPhe: 2.529 ± 0.035
5.338ValGly: 5.338 ± 0.063
1.558ValHis: 1.558 ± 0.028
3.693ValIle: 3.693 ± 0.048
1.694ValLys: 1.694 ± 0.033
8.12ValLeu: 8.12 ± 0.075
1.888ValMet: 1.888 ± 0.032
1.993ValAsn: 1.993 ± 0.037
4.544ValPro: 4.544 ± 0.051
2.344ValGln: 2.344 ± 0.04
5.542ValArg: 5.542 ± 0.055
3.974ValSer: 3.974 ± 0.048
4.682ValThr: 4.682 ± 0.062
6.28ValVal: 6.28 ± 0.072
1.021ValTrp: 1.021 ± 0.024
1.349ValTyr: 1.349 ± 0.026
0.0ValXaa: 0.0 ± 0.0
Trp
1.366TrpAla: 1.366 ± 0.027
0.174TrpCys: 0.174 ± 0.01
0.763TrpAsp: 0.763 ± 0.02
0.592TrpGlu: 0.592 ± 0.017
0.557TrpPhe: 0.557 ± 0.016
0.948TrpGly: 0.948 ± 0.025
0.439TrpHis: 0.439 ± 0.017
0.724TrpIle: 0.724 ± 0.018
0.409TrpLys: 0.409 ± 0.018
1.88TrpLeu: 1.88 ± 0.038
0.383TrpMet: 0.383 ± 0.015
0.462TrpAsn: 0.462 ± 0.014
0.911TrpPro: 0.911 ± 0.024
0.683TrpGln: 0.683 ± 0.019
1.389TrpArg: 1.389 ± 0.025
0.782TrpSer: 0.782 ± 0.019
0.854TrpThr: 0.854 ± 0.024
0.909TrpVal: 0.909 ± 0.021
0.267TrpTrp: 0.267 ± 0.012
0.331TrpTyr: 0.331 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.528TyrAla: 2.528 ± 0.04
0.227TyrCys: 0.227 ± 0.01
1.383TyrAsp: 1.383 ± 0.028
0.979TyrGlu: 0.979 ± 0.026
0.749TyrPhe: 0.749 ± 0.023
2.016TyrGly: 2.016 ± 0.038
0.527TyrHis: 0.527 ± 0.016
0.748TyrIle: 0.748 ± 0.02
0.448TyrLys: 0.448 ± 0.017
2.132TyrLeu: 2.132 ± 0.034
0.39TyrMet: 0.39 ± 0.016
0.542TyrAsn: 0.542 ± 0.018
1.137TyrPro: 1.137 ± 0.025
0.766TyrGln: 0.766 ± 0.021
1.828TyrArg: 1.828 ± 0.033
0.948TyrSer: 0.948 ± 0.027
1.043TyrThr: 1.043 ± 0.025
1.556TyrVal: 1.556 ± 0.032
0.337TyrTrp: 0.337 ± 0.013
0.55TyrTyr: 0.55 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.001XaaMet: 0.001 ± 0.001
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 6183 proteins (1919277 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski