Amino acid dipepetide frequency for Legionella jordanis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.886AlaAla: 6.886 ± 0.122
1.134AlaCys: 1.134 ± 0.04
3.527AlaAsp: 3.527 ± 0.065
4.856AlaGlu: 4.856 ± 0.078
3.543AlaPhe: 3.543 ± 0.066
5.101AlaGly: 5.101 ± 0.106
1.844AlaHis: 1.844 ± 0.042
6.338AlaIle: 6.338 ± 0.092
5.073AlaLys: 5.073 ± 0.08
9.665AlaLeu: 9.665 ± 0.128
2.235AlaMet: 2.235 ± 0.051
3.671AlaAsn: 3.671 ± 0.075
2.525AlaPro: 2.525 ± 0.058
3.756AlaGln: 3.756 ± 0.085
3.537AlaArg: 3.537 ± 0.064
5.157AlaSer: 5.157 ± 0.091
3.644AlaThr: 3.644 ± 0.071
5.211AlaVal: 5.211 ± 0.081
0.835AlaTrp: 0.835 ± 0.032
2.582AlaTyr: 2.582 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.843CysAla: 0.843 ± 0.034
0.222CysCys: 0.222 ± 0.015
0.573CysAsp: 0.573 ± 0.025
0.697CysGlu: 0.697 ± 0.03
0.729CysPhe: 0.729 ± 0.03
0.897CysGly: 0.897 ± 0.034
0.365CysHis: 0.365 ± 0.019
0.811CysIle: 0.811 ± 0.029
0.562CysLys: 0.562 ± 0.028
1.457CysLeu: 1.457 ± 0.039
0.277CysMet: 0.277 ± 0.018
0.451CysAsn: 0.451 ± 0.022
0.528CysPro: 0.528 ± 0.027
0.588CysGln: 0.588 ± 0.024
0.534CysArg: 0.534 ± 0.023
0.826CysSer: 0.826 ± 0.036
0.487CysThr: 0.487 ± 0.025
0.659CysVal: 0.659 ± 0.029
0.139CysTrp: 0.139 ± 0.013
0.49CysTyr: 0.49 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
3.484AspAla: 3.484 ± 0.073
0.638AspCys: 0.638 ± 0.026
2.219AspAsp: 2.219 ± 0.057
3.627AspGlu: 3.627 ± 0.065
2.481AspPhe: 2.481 ± 0.053
2.504AspGly: 2.504 ± 0.056
1.015AspHis: 1.015 ± 0.037
3.228AspIle: 3.228 ± 0.061
3.054AspLys: 3.054 ± 0.068
5.196AspLeu: 5.196 ± 0.061
1.047AspMet: 1.047 ± 0.035
2.181AspAsn: 2.181 ± 0.044
1.863AspPro: 1.863 ± 0.042
1.482AspGln: 1.482 ± 0.045
1.927AspArg: 1.927 ± 0.043
3.079AspSer: 3.079 ± 0.058
1.592AspThr: 1.592 ± 0.043
3.0AspVal: 3.0 ± 0.061
0.722AspTrp: 0.722 ± 0.027
2.07AspTyr: 2.07 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
4.973GluAla: 4.973 ± 0.082
0.595GluCys: 0.595 ± 0.023
2.724GluAsp: 2.724 ± 0.052
4.627GluGlu: 4.627 ± 0.109
2.515GluPhe: 2.515 ± 0.055
3.216GluGly: 3.216 ± 0.06
1.754GluHis: 1.754 ± 0.044
4.452GluIle: 4.452 ± 0.075
4.242GluLys: 4.242 ± 0.085
7.032GluLeu: 7.032 ± 0.111
1.481GluMet: 1.481 ± 0.047
2.872GluAsn: 2.872 ± 0.055
1.963GluPro: 1.963 ± 0.049
3.906GluGln: 3.906 ± 0.067
3.192GluArg: 3.192 ± 0.072
3.311GluSer: 3.311 ± 0.062
2.911GluThr: 2.911 ± 0.054
3.66GluVal: 3.66 ± 0.072
0.652GluTrp: 0.652 ± 0.028
1.728GluTyr: 1.728 ± 0.043
0.001GluXaa: 0.001 ± 0.001
Phe
3.642PheAla: 3.642 ± 0.063
0.688PheCys: 0.688 ± 0.027
2.312PheAsp: 2.312 ± 0.052
2.512PheGlu: 2.512 ± 0.059
2.557PhePhe: 2.557 ± 0.064
2.747PheGly: 2.747 ± 0.06
1.065PheHis: 1.065 ± 0.04
3.447PheIle: 3.447 ± 0.066
2.605PheLys: 2.605 ± 0.055
5.108PheLeu: 5.108 ± 0.091
1.035PheMet: 1.035 ± 0.036
2.49PheAsn: 2.49 ± 0.062
1.91PhePro: 1.91 ± 0.04
1.695PheGln: 1.695 ± 0.038
1.779PheArg: 1.779 ± 0.043
3.685PheSer: 3.685 ± 0.07
2.161PheThr: 2.161 ± 0.05
2.426PheVal: 2.426 ± 0.053
0.594PheTrp: 0.594 ± 0.027
1.732PheTyr: 1.732 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.168GlyAla: 4.168 ± 0.091
0.808GlyCys: 0.808 ± 0.035
2.675GlyAsp: 2.675 ± 0.056
3.433GlyGlu: 3.433 ± 0.066
3.353GlyPhe: 3.353 ± 0.069
3.921GlyGly: 3.921 ± 0.094
1.623GlyHis: 1.623 ± 0.041
4.671GlyIle: 4.671 ± 0.075
3.824GlyLys: 3.824 ± 0.072
6.873GlyLeu: 6.873 ± 0.104
1.534GlyMet: 1.534 ± 0.043
2.43GlyAsn: 2.43 ± 0.054
1.729GlyPro: 1.729 ± 0.048
2.728GlyGln: 2.728 ± 0.065
2.729GlyArg: 2.729 ± 0.063
3.768GlySer: 3.768 ± 0.077
2.832GlyThr: 2.832 ± 0.071
3.874GlyVal: 3.874 ± 0.076
0.863GlyTrp: 0.863 ± 0.033
2.336GlyTyr: 2.336 ± 0.051
0.001GlyXaa: 0.001 ± 0.001
His
1.881HisAla: 1.881 ± 0.047
0.417HisCys: 0.417 ± 0.023
1.094HisAsp: 1.094 ± 0.031
1.496HisGlu: 1.496 ± 0.036
1.459HisPhe: 1.459 ± 0.042
1.609HisGly: 1.609 ± 0.044
0.915HisHis: 0.915 ± 0.036
1.474HisIle: 1.474 ± 0.039
1.115HisLys: 1.115 ± 0.037
3.053HisLeu: 3.053 ± 0.059
0.519HisMet: 0.519 ± 0.025
0.953HisAsn: 0.953 ± 0.033
1.392HisPro: 1.392 ± 0.046
1.309HisGln: 1.309 ± 0.039
1.271HisArg: 1.271 ± 0.042
1.802HisSer: 1.802 ± 0.053
0.883HisThr: 0.883 ± 0.031
1.308HisVal: 1.308 ± 0.037
0.469HisTrp: 0.469 ± 0.026
1.138HisTyr: 1.138 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.368IleAla: 6.368 ± 0.093
0.887IleCys: 0.887 ± 0.033
3.778IleAsp: 3.778 ± 0.062
4.472IleGlu: 4.472 ± 0.085
2.765IlePhe: 2.765 ± 0.064
4.233IleGly: 4.233 ± 0.088
1.767IleHis: 1.767 ± 0.053
4.694IleIle: 4.694 ± 0.08
4.377IleLys: 4.377 ± 0.078
7.097IleLeu: 7.097 ± 0.095
1.388IleMet: 1.388 ± 0.046
3.629IleAsn: 3.629 ± 0.072
3.288IlePro: 3.288 ± 0.064
2.761IleGln: 2.761 ± 0.048
3.126IleArg: 3.126 ± 0.069
4.872IleSer: 4.872 ± 0.083
3.546IleThr: 3.546 ± 0.068
3.855IleVal: 3.855 ± 0.071
0.616IleTrp: 0.616 ± 0.028
2.09IleTyr: 2.09 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
5.048LysAla: 5.048 ± 0.073
0.394LysCys: 0.394 ± 0.023
2.945LysAsp: 2.945 ± 0.056
4.352LysGlu: 4.352 ± 0.085
1.715LysPhe: 1.715 ± 0.048
3.083LysGly: 3.083 ± 0.058
1.516LysHis: 1.516 ± 0.045
4.159LysIle: 4.159 ± 0.08
4.148LysLys: 4.148 ± 0.082
5.947LysLeu: 5.947 ± 0.084
1.388LysMet: 1.388 ± 0.039
3.18LysAsn: 3.18 ± 0.069
2.907LysPro: 2.907 ± 0.057
3.61LysGln: 3.61 ± 0.067
3.124LysArg: 3.124 ± 0.065
3.507LysSer: 3.507 ± 0.07
3.479LysThr: 3.479 ± 0.069
3.027LysVal: 3.027 ± 0.06
0.566LysTrp: 0.566 ± 0.025
1.504LysTyr: 1.504 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
10.231LeuAla: 10.231 ± 0.112
1.405LeuCys: 1.405 ± 0.04
5.184LeuAsp: 5.184 ± 0.079
6.285LeuGlu: 6.285 ± 0.101
5.368LeuPhe: 5.368 ± 0.085
6.778LeuGly: 6.778 ± 0.093
2.57LeuHis: 2.57 ± 0.054
7.983LeuIle: 7.983 ± 0.11
7.024LeuLys: 7.024 ± 0.101
13.014LeuLeu: 13.014 ± 0.164
2.688LeuMet: 2.688 ± 0.053
5.735LeuAsn: 5.735 ± 0.081
5.105LeuPro: 5.105 ± 0.079
5.135LeuGln: 5.135 ± 0.087
4.95LeuArg: 4.95 ± 0.083
8.793LeuSer: 8.793 ± 0.118
5.768LeuThr: 5.768 ± 0.079
6.028LeuVal: 6.028 ± 0.078
1.137LeuTrp: 1.137 ± 0.04
3.183LeuTyr: 3.183 ± 0.061
0.001LeuXaa: 0.001 ± 0.001
Met
2.238MetAla: 2.238 ± 0.049
0.145MetCys: 0.145 ± 0.013
1.153MetAsp: 1.153 ± 0.037
1.252MetGlu: 1.252 ± 0.043
0.753MetPhe: 0.753 ± 0.03
1.485MetGly: 1.485 ± 0.045
0.58MetHis: 0.58 ± 0.025
1.428MetIle: 1.428 ± 0.04
1.527MetLys: 1.527 ± 0.039
2.616MetLeu: 2.616 ± 0.054
0.622MetMet: 0.622 ± 0.029
1.138MetAsn: 1.138 ± 0.033
1.113MetPro: 1.113 ± 0.039
1.226MetGln: 1.226 ± 0.039
1.16MetArg: 1.16 ± 0.033
1.548MetSer: 1.548 ± 0.042
1.32MetThr: 1.32 ± 0.042
1.401MetVal: 1.401 ± 0.038
0.175MetTrp: 0.175 ± 0.015
0.487MetTyr: 0.487 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.438AsnAla: 3.438 ± 0.068
0.607AsnCys: 0.607 ± 0.027
2.057AsnAsp: 2.057 ± 0.05
2.81AsnGlu: 2.81 ± 0.058
2.006AsnPhe: 2.006 ± 0.051
2.543AsnGly: 2.543 ± 0.058
1.407AsnHis: 1.407 ± 0.039
3.005AsnIle: 3.005 ± 0.058
2.737AsnLys: 2.737 ± 0.058
5.197AsnLeu: 5.197 ± 0.088
0.888AsnMet: 0.888 ± 0.03
2.401AsnAsn: 2.401 ± 0.058
2.807AsnPro: 2.807 ± 0.069
2.699AsnGln: 2.699 ± 0.06
2.206AsnArg: 2.206 ± 0.045
3.166AsnSer: 3.166 ± 0.064
2.034AsnThr: 2.034 ± 0.056
2.215AsnVal: 2.215 ± 0.05
0.647AsnTrp: 0.647 ± 0.026
1.824AsnTyr: 1.824 ± 0.048
0.001AsnXaa: 0.001 ± 0.001
Pro
3.188ProAla: 3.188 ± 0.065
0.452ProCys: 0.452 ± 0.024
2.094ProAsp: 2.094 ± 0.048
3.095ProGlu: 3.095 ± 0.064
2.008ProPhe: 2.008 ± 0.052
2.735ProGly: 2.735 ± 0.055
1.025ProHis: 1.025 ± 0.038
2.691ProIle: 2.691 ± 0.048
2.365ProLys: 2.365 ± 0.054
4.589ProLeu: 4.589 ± 0.081
0.928ProMet: 0.928 ± 0.032
1.918ProAsn: 1.918 ± 0.045
1.53ProPro: 1.53 ± 0.046
1.898ProGln: 1.898 ± 0.047
1.503ProArg: 1.503 ± 0.044
2.852ProSer: 2.852 ± 0.056
1.876ProThr: 1.876 ± 0.051
2.895ProVal: 2.895 ± 0.061
0.524ProTrp: 0.524 ± 0.025
1.444ProTyr: 1.444 ± 0.041
0.001ProXaa: 0.001 ± 0.001
Gln
4.409GlnAla: 4.409 ± 0.082
0.481GlnCys: 0.481 ± 0.023
2.045GlnAsp: 2.045 ± 0.053
3.08GlnGlu: 3.08 ± 0.07
2.315GlnPhe: 2.315 ± 0.051
2.783GlnGly: 2.783 ± 0.063
1.261GlnHis: 1.261 ± 0.039
3.159GlnIle: 3.159 ± 0.063
2.798GlnLys: 2.798 ± 0.066
5.663GlnLeu: 5.663 ± 0.099
1.083GlnMet: 1.083 ± 0.031
2.213GlnAsn: 2.213 ± 0.054
1.751GlnPro: 1.751 ± 0.048
3.224GlnGln: 3.224 ± 0.088
2.326GlnArg: 2.326 ± 0.054
2.922GlnSer: 2.922 ± 0.055
2.375GlnThr: 2.375 ± 0.052
2.787GlnVal: 2.787 ± 0.061
0.624GlnTrp: 0.624 ± 0.029
1.517GlnTyr: 1.517 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
3.39ArgAla: 3.39 ± 0.067
0.495ArgCys: 0.495 ± 0.027
2.176ArgAsp: 2.176 ± 0.05
3.23ArgGlu: 3.23 ± 0.065
2.299ArgPhe: 2.299 ± 0.057
2.477ArgGly: 2.477 ± 0.061
1.252ArgHis: 1.252 ± 0.041
3.235ArgIle: 3.235 ± 0.059
2.651ArgLys: 2.651 ± 0.056
5.485ArgLeu: 5.485 ± 0.089
1.118ArgMet: 1.118 ± 0.033
1.941ArgAsn: 1.941 ± 0.046
1.564ArgPro: 1.564 ± 0.042
2.412ArgGln: 2.412 ± 0.055
2.307ArgArg: 2.307 ± 0.058
2.517ArgSer: 2.517 ± 0.054
1.902ArgThr: 1.902 ± 0.046
2.792ArgVal: 2.792 ± 0.059
0.591ArgTrp: 0.591 ± 0.023
1.777ArgTyr: 1.777 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
4.738SerAla: 4.738 ± 0.081
0.893SerCys: 0.893 ± 0.033
2.738SerAsp: 2.738 ± 0.055
3.624SerGlu: 3.624 ± 0.064
3.254SerPhe: 3.254 ± 0.062
4.243SerGly: 4.243 ± 0.075
1.777SerHis: 1.777 ± 0.039
4.538SerIle: 4.538 ± 0.082
3.747SerLys: 3.747 ± 0.073
8.321SerLeu: 8.321 ± 0.1
1.733SerMet: 1.733 ± 0.048
3.119SerAsn: 3.119 ± 0.054
2.937SerPro: 2.937 ± 0.065
3.187SerGln: 3.187 ± 0.062
3.038SerArg: 3.038 ± 0.048
4.825SerSer: 4.825 ± 0.081
3.093SerThr: 3.093 ± 0.06
3.505SerVal: 3.505 ± 0.064
0.905SerTrp: 0.905 ± 0.035
2.328SerTyr: 2.328 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
4.157ThrAla: 4.157 ± 0.074
0.523ThrCys: 0.523 ± 0.027
2.087ThrAsp: 2.087 ± 0.041
2.573ThrGlu: 2.573 ± 0.063
1.919ThrPhe: 1.919 ± 0.046
3.479ThrGly: 3.479 ± 0.076
1.206ThrHis: 1.206 ± 0.039
3.337ThrIle: 3.337 ± 0.059
2.281ThrLys: 2.281 ± 0.051
5.457ThrLeu: 5.457 ± 0.084
0.997ThrMet: 0.997 ± 0.031
1.762ThrAsn: 1.762 ± 0.052
2.396ThrPro: 2.396 ± 0.054
2.067ThrGln: 2.067 ± 0.055
2.055ThrArg: 2.055 ± 0.048
2.939ThrSer: 2.939 ± 0.056
2.639ThrThr: 2.639 ± 0.064
3.218ThrVal: 3.218 ± 0.064
0.528ThrTrp: 0.528 ± 0.027
1.434ThrTyr: 1.434 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
4.867ValAla: 4.867 ± 0.077
0.674ValCys: 0.674 ± 0.026
3.066ValAsp: 3.066 ± 0.055
3.427ValGlu: 3.427 ± 0.065
2.685ValPhe: 2.685 ± 0.057
3.482ValGly: 3.482 ± 0.072
1.287ValHis: 1.287 ± 0.041
4.38ValIle: 4.38 ± 0.075
3.38ValLys: 3.38 ± 0.068
6.628ValLeu: 6.628 ± 0.096
1.501ValMet: 1.501 ± 0.042
2.85ValAsn: 2.85 ± 0.059
2.258ValPro: 2.258 ± 0.053
2.172ValGln: 2.172 ± 0.053
2.464ValArg: 2.464 ± 0.054
4.033ValSer: 4.033 ± 0.07
2.756ValThr: 2.756 ± 0.064
3.933ValVal: 3.933 ± 0.081
0.628ValTrp: 0.628 ± 0.026
1.766ValTyr: 1.766 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.738TrpAla: 0.738 ± 0.03
0.114TrpCys: 0.114 ± 0.01
0.514TrpAsp: 0.514 ± 0.023
0.553TrpGlu: 0.553 ± 0.026
0.618TrpPhe: 0.618 ± 0.025
0.71TrpGly: 0.71 ± 0.029
0.37TrpHis: 0.37 ± 0.021
0.793TrpIle: 0.793 ± 0.03
0.521TrpLys: 0.521 ± 0.029
1.813TrpLeu: 1.813 ± 0.054
0.307TrpMet: 0.307 ± 0.02
0.494TrpAsn: 0.494 ± 0.024
0.508TrpPro: 0.508 ± 0.028
0.907TrpGln: 0.907 ± 0.03
0.591TrpArg: 0.591 ± 0.026
0.651TrpSer: 0.651 ± 0.025
0.452TrpThr: 0.452 ± 0.023
0.684TrpVal: 0.684 ± 0.028
0.173TrpTrp: 0.173 ± 0.014
0.389TrpTyr: 0.389 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.432TyrAla: 2.432 ± 0.052
0.563TyrCys: 0.563 ± 0.023
1.448TyrAsp: 1.448 ± 0.041
1.803TyrGlu: 1.803 ± 0.047
1.848TyrPhe: 1.848 ± 0.049
2.144TyrGly: 2.144 ± 0.044
0.957TyrHis: 0.957 ± 0.03
1.784TyrIle: 1.784 ± 0.051
1.567TyrLys: 1.567 ± 0.042
4.154TyrLeu: 4.154 ± 0.078
0.6TyrMet: 0.6 ± 0.025
1.268TyrAsn: 1.268 ± 0.042
1.537TyrPro: 1.537 ± 0.043
2.052TyrGln: 2.052 ± 0.045
1.794TyrArg: 1.794 ± 0.049
2.289TyrSer: 2.289 ± 0.052
1.304TyrThr: 1.304 ± 0.037
1.758TyrVal: 1.758 ± 0.039
0.514TyrTrp: 0.514 ± 0.024
1.372TyrTyr: 1.372 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.001XaaGln: 0.001 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.015XaaXaa: 0.015 ± 0.007
Statistics based on 2787 proteins (923784 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski