Amino acid dipepetide frequency for Lupinus albus (White lupine) (Lupinus termis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.485AlaAla: 6.485 ± 0.043
1.135AlaCys: 1.135 ± 0.008
3.26AlaAsp: 3.26 ± 0.019
4.153AlaGlu: 4.153 ± 0.02
2.644AlaPhe: 2.644 ± 0.013
4.717AlaGly: 4.717 ± 0.042
1.451AlaHis: 1.451 ± 0.013
3.913AlaIle: 3.913 ± 0.016
3.756AlaLys: 3.756 ± 0.016
6.594AlaLeu: 6.594 ± 0.024
1.855AlaMet: 1.855 ± 0.01
2.567AlaAsn: 2.567 ± 0.013
3.062AlaPro: 3.062 ± 0.02
2.444AlaGln: 2.444 ± 0.015
3.912AlaArg: 3.912 ± 0.036
5.466AlaSer: 5.466 ± 0.019
3.669AlaThr: 3.669 ± 0.017
4.651AlaVal: 4.651 ± 0.024
0.736AlaTrp: 0.736 ± 0.007
1.843AlaTyr: 1.843 ± 0.01
0.0AlaXaa: 0.0 ± 0.0
Cys
0.941CysAla: 0.941 ± 0.008
0.456CysCys: 0.456 ± 0.006
0.844CysAsp: 0.844 ± 0.007
0.837CysGlu: 0.837 ± 0.007
0.79CysPhe: 0.79 ± 0.007
1.32CysGly: 1.32 ± 0.009
0.474CysHis: 0.474 ± 0.006
0.953CysIle: 0.953 ± 0.007
1.012CysLys: 1.012 ± 0.008
1.611CysLeu: 1.611 ± 0.01
0.391CysMet: 0.391 ± 0.005
0.851CysAsn: 0.851 ± 0.007
0.844CysPro: 0.844 ± 0.007
0.528CysGln: 0.528 ± 0.006
1.002CysArg: 1.002 ± 0.01
1.639CysSer: 1.639 ± 0.011
0.813CysThr: 0.813 ± 0.007
0.972CysVal: 0.972 ± 0.009
0.2CysTrp: 0.2 ± 0.003
0.503CysTyr: 0.503 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
3.652AspAla: 3.652 ± 0.019
0.852AspCys: 0.852 ± 0.008
3.626AspAsp: 3.626 ± 0.02
3.949AspGlu: 3.949 ± 0.016
2.307AspPhe: 2.307 ± 0.013
3.762AspGly: 3.762 ± 0.017
1.449AspHis: 1.449 ± 0.01
3.269AspIle: 3.269 ± 0.017
2.809AspLys: 2.809 ± 0.015
4.957AspLeu: 4.957 ± 0.021
1.358AspMet: 1.358 ± 0.01
2.297AspAsn: 2.297 ± 0.014
2.658AspPro: 2.658 ± 0.014
1.825AspGln: 1.825 ± 0.009
2.693AspArg: 2.693 ± 0.018
3.961AspSer: 3.961 ± 0.019
2.3AspThr: 2.3 ± 0.011
3.605AspVal: 3.605 ± 0.017
0.673AspTrp: 0.673 ± 0.007
1.629AspTyr: 1.629 ± 0.01
0.0AspXaa: 0.0 ± 0.0
Glu
4.704GluAla: 4.704 ± 0.021
0.84GluCys: 0.84 ± 0.006
3.745GluAsp: 3.745 ± 0.016
5.693GluGlu: 5.693 ± 0.035
2.349GluPhe: 2.349 ± 0.013
3.665GluGly: 3.665 ± 0.018
1.363GluHis: 1.363 ± 0.008
3.769GluIle: 3.769 ± 0.018
4.472GluLys: 4.472 ± 0.025
5.723GluLeu: 5.723 ± 0.021
1.618GluMet: 1.618 ± 0.011
3.195GluAsn: 3.195 ± 0.017
2.22GluPro: 2.22 ± 0.014
2.295GluGln: 2.295 ± 0.013
3.444GluArg: 3.444 ± 0.018
4.268GluSer: 4.268 ± 0.017
3.087GluThr: 3.087 ± 0.016
4.053GluVal: 4.053 ± 0.018
0.697GluTrp: 0.697 ± 0.006
1.643GluTyr: 1.643 ± 0.01
0.0GluXaa: 0.0 ± 0.0
Phe
2.477PheAla: 2.477 ± 0.011
0.849PheCys: 0.849 ± 0.007
2.394PheAsp: 2.394 ± 0.012
2.294PheGlu: 2.294 ± 0.012
1.826PhePhe: 1.826 ± 0.012
3.129PheGly: 3.129 ± 0.016
1.186PheHis: 1.186 ± 0.01
2.258PheIle: 2.258 ± 0.014
2.097PheLys: 2.097 ± 0.012
4.001PheLeu: 4.001 ± 0.018
1.026PheMet: 1.026 ± 0.007
1.871PheAsn: 1.871 ± 0.01
2.014PhePro: 2.014 ± 0.011
1.659PheGln: 1.659 ± 0.01
2.02PheArg: 2.02 ± 0.011
3.859PheSer: 3.859 ± 0.017
2.025PheThr: 2.025 ± 0.011
2.621PheVal: 2.621 ± 0.013
0.515PheTrp: 0.515 ± 0.005
1.286PheTyr: 1.286 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
4.336GlyAla: 4.336 ± 0.032
1.168GlyCys: 1.168 ± 0.009
3.557GlyAsp: 3.557 ± 0.02
3.734GlyGlu: 3.734 ± 0.02
2.999GlyPhe: 2.999 ± 0.016
5.538GlyGly: 5.538 ± 0.047
1.733GlyHis: 1.733 ± 0.012
3.748GlyIle: 3.748 ± 0.018
4.076GlyLys: 4.076 ± 0.018
5.807GlyLeu: 5.807 ± 0.023
1.53GlyMet: 1.53 ± 0.01
3.124GlyAsn: 3.124 ± 0.015
2.526GlyPro: 2.526 ± 0.02
2.309GlyGln: 2.309 ± 0.015
4.222GlyArg: 4.222 ± 0.043
5.592GlySer: 5.592 ± 0.022
3.403GlyThr: 3.403 ± 0.018
4.265GlyVal: 4.265 ± 0.02
0.816GlyTrp: 0.816 ± 0.007
2.081GlyTyr: 2.081 ± 0.012
0.0GlyXaa: 0.0 ± 0.0
His
1.813HisAla: 1.813 ± 0.014
0.493HisCys: 0.493 ± 0.006
1.346HisAsp: 1.346 ± 0.009
1.363HisGlu: 1.363 ± 0.01
1.175HisPhe: 1.175 ± 0.007
1.912HisGly: 1.912 ± 0.014
1.135HisHis: 1.135 ± 0.01
1.351HisIle: 1.351 ± 0.009
1.318HisLys: 1.318 ± 0.009
2.462HisLeu: 2.462 ± 0.013
0.59HisMet: 0.59 ± 0.006
1.184HisAsn: 1.184 ± 0.009
1.473HisPro: 1.473 ± 0.011
1.136HisGln: 1.136 ± 0.01
1.735HisArg: 1.735 ± 0.017
2.092HisSer: 2.092 ± 0.011
1.105HisThr: 1.105 ± 0.008
1.683HisVal: 1.683 ± 0.009
0.276HisTrp: 0.276 ± 0.004
0.877HisTyr: 0.877 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.846IleAla: 3.846 ± 0.017
1.048IleCys: 1.048 ± 0.008
3.254IleAsp: 3.254 ± 0.017
3.379IleGlu: 3.379 ± 0.016
2.429IlePhe: 2.429 ± 0.013
3.668IleGly: 3.668 ± 0.016
1.477IleHis: 1.477 ± 0.009
3.177IleIle: 3.177 ± 0.016
3.031IleLys: 3.031 ± 0.013
5.26IleLeu: 5.26 ± 0.024
1.318IleMet: 1.318 ± 0.009
2.518IleAsn: 2.518 ± 0.014
2.946IlePro: 2.946 ± 0.016
2.118IleGln: 2.118 ± 0.014
2.714IleArg: 2.714 ± 0.013
4.804IleSer: 4.804 ± 0.02
2.884IleThr: 2.884 ± 0.013
3.692IleVal: 3.692 ± 0.017
0.673IleTrp: 0.673 ± 0.006
1.6IleTyr: 1.6 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
3.856LysAla: 3.856 ± 0.02
0.903LysCys: 0.903 ± 0.008
3.309LysAsp: 3.309 ± 0.017
4.414LysGlu: 4.414 ± 0.022
2.228LysPhe: 2.228 ± 0.012
3.633LysGly: 3.633 ± 0.017
1.402LysHis: 1.402 ± 0.01
3.392LysIle: 3.392 ± 0.016
4.54LysLys: 4.54 ± 0.022
5.576LysLeu: 5.576 ± 0.022
1.532LysMet: 1.532 ± 0.01
2.864LysAsn: 2.864 ± 0.014
2.661LysPro: 2.661 ± 0.016
2.235LysGln: 2.235 ± 0.012
3.46LysArg: 3.46 ± 0.017
4.517LysSer: 4.517 ± 0.024
2.898LysThr: 2.898 ± 0.013
3.963LysVal: 3.963 ± 0.019
0.755LysTrp: 0.755 ± 0.007
1.715LysTyr: 1.715 ± 0.011
0.0LysXaa: 0.0 ± 0.0
Leu
6.417LeuAla: 6.417 ± 0.026
1.666LeuCys: 1.666 ± 0.011
4.936LeuAsp: 4.936 ± 0.019
5.827LeuGlu: 5.827 ± 0.021
3.758LeuPhe: 3.758 ± 0.02
5.581LeuGly: 5.581 ± 0.021
2.757LeuHis: 2.757 ± 0.014
4.767LeuIle: 4.767 ± 0.02
5.877LeuLys: 5.877 ± 0.024
9.044LeuLeu: 9.044 ± 0.037
2.101LeuMet: 2.101 ± 0.012
4.123LeuAsn: 4.123 ± 0.018
4.904LeuPro: 4.904 ± 0.018
4.016LeuGln: 4.016 ± 0.018
5.347LeuArg: 5.347 ± 0.023
8.032LeuSer: 8.032 ± 0.032
4.401LeuThr: 4.401 ± 0.017
6.153LeuVal: 6.153 ± 0.025
1.039LeuTrp: 1.039 ± 0.008
2.43LeuTyr: 2.43 ± 0.013
0.0LeuXaa: 0.0 ± 0.0
Met
1.999MetAla: 1.999 ± 0.01
0.342MetCys: 0.342 ± 0.004
1.431MetAsp: 1.431 ± 0.009
1.914MetGlu: 1.914 ± 0.012
0.849MetPhe: 0.849 ± 0.006
1.631MetGly: 1.631 ± 0.01
0.555MetHis: 0.555 ± 0.005
1.362MetIle: 1.362 ± 0.01
1.647MetLys: 1.647 ± 0.01
2.124MetLeu: 2.124 ± 0.011
0.703MetMet: 0.703 ± 0.008
1.237MetAsn: 1.237 ± 0.008
1.042MetPro: 1.042 ± 0.008
0.936MetGln: 0.936 ± 0.007
1.199MetArg: 1.199 ± 0.009
1.77MetSer: 1.77 ± 0.011
1.133MetThr: 1.133 ± 0.008
1.639MetVal: 1.639 ± 0.01
0.29MetTrp: 0.29 ± 0.004
0.658MetTyr: 0.658 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.774AsnAla: 2.774 ± 0.011
0.781AsnCys: 0.781 ± 0.007
2.431AsnAsp: 2.431 ± 0.015
2.656AsnGlu: 2.656 ± 0.015
2.135AsnPhe: 2.135 ± 0.012
3.187AsnGly: 3.187 ± 0.015
1.371AsnHis: 1.371 ± 0.009
2.797AsnIle: 2.797 ± 0.014
2.604AsnLys: 2.604 ± 0.013
4.499AsnLeu: 4.499 ± 0.021
1.252AsnMet: 1.252 ± 0.008
2.936AsnAsn: 2.936 ± 0.018
2.409AsnPro: 2.409 ± 0.014
1.92AsnGln: 1.92 ± 0.012
2.142AsnArg: 2.142 ± 0.011
3.915AsnSer: 3.915 ± 0.017
2.215AsnThr: 2.215 ± 0.012
3.028AsnVal: 3.028 ± 0.016
0.631AsnTrp: 0.631 ± 0.006
1.34AsnTyr: 1.34 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
3.139ProAla: 3.139 ± 0.025
0.756ProCys: 0.756 ± 0.007
2.355ProAsp: 2.355 ± 0.013
2.975ProGlu: 2.975 ± 0.015
2.009ProPhe: 2.009 ± 0.011
2.876ProGly: 2.876 ± 0.025
1.364ProHis: 1.364 ± 0.009
2.49ProIle: 2.49 ± 0.015
2.66ProLys: 2.66 ± 0.014
4.221ProLeu: 4.221 ± 0.017
1.034ProMet: 1.034 ± 0.008
2.334ProAsn: 2.334 ± 0.015
3.773ProPro: 3.773 ± 0.056
1.927ProGln: 1.927 ± 0.013
2.608ProArg: 2.608 ± 0.024
4.799ProSer: 4.799 ± 0.024
2.85ProThr: 2.85 ± 0.016
2.973ProVal: 2.973 ± 0.016
0.576ProTrp: 0.576 ± 0.007
1.459ProTyr: 1.459 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
2.451GlnAla: 2.451 ± 0.015
0.591GlnCys: 0.591 ± 0.005
1.699GlnAsp: 1.699 ± 0.012
2.296GlnGlu: 2.296 ± 0.013
1.487GlnPhe: 1.487 ± 0.01
2.438GlnGly: 2.438 ± 0.014
1.157GlnHis: 1.157 ± 0.01
2.097GlnIle: 2.097 ± 0.012
2.283GlnLys: 2.283 ± 0.014
3.539GlnLeu: 3.539 ± 0.016
1.009GlnMet: 1.009 ± 0.008
2.027GlnAsn: 2.027 ± 0.012
2.073GlnPro: 2.073 ± 0.014
2.317GlnGln: 2.317 ± 0.028
2.329GlnArg: 2.329 ± 0.017
2.842GlnSer: 2.842 ± 0.016
1.811GlnThr: 1.811 ± 0.011
2.357GlnVal: 2.357 ± 0.012
0.437GlnTrp: 0.437 ± 0.005
1.073GlnTyr: 1.073 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.708ArgAla: 3.708 ± 0.039
0.914ArgCys: 0.914 ± 0.007
2.811ArgAsp: 2.811 ± 0.017
3.369ArgGlu: 3.369 ± 0.019
2.186ArgPhe: 2.186 ± 0.012
3.673ArgGly: 3.673 ± 0.033
1.639ArgHis: 1.639 ± 0.02
3.007ArgIle: 3.007 ± 0.014
3.628ArgLys: 3.628 ± 0.016
5.107ArgLeu: 5.107 ± 0.024
1.354ArgMet: 1.354 ± 0.008
2.526ArgAsn: 2.526 ± 0.012
2.675ArgPro: 2.675 ± 0.029
2.113ArgGln: 2.113 ± 0.016
4.646ArgArg: 4.646 ± 0.061
4.032ArgSer: 4.032 ± 0.017
2.599ArgThr: 2.599 ± 0.013
3.533ArgVal: 3.533 ± 0.02
0.684ArgTrp: 0.684 ± 0.006
1.478ArgTyr: 1.478 ± 0.011
0.0ArgXaa: 0.0 ± 0.0
Ser
4.924SerAla: 4.924 ± 0.019
1.468SerCys: 1.468 ± 0.011
4.269SerAsp: 4.269 ± 0.019
4.53SerGlu: 4.53 ± 0.019
3.743SerPhe: 3.743 ± 0.017
5.525SerGly: 5.525 ± 0.019
2.106SerHis: 2.106 ± 0.011
4.563SerIle: 4.563 ± 0.02
4.796SerLys: 4.796 ± 0.019
8.035SerLeu: 8.035 ± 0.032
2.018SerMet: 2.018 ± 0.013
4.182SerAsn: 4.182 ± 0.02
4.217SerPro: 4.217 ± 0.03
3.01SerGln: 3.01 ± 0.016
4.179SerArg: 4.179 ± 0.02
9.921SerSer: 9.921 ± 0.045
4.553SerThr: 4.553 ± 0.02
4.871SerVal: 4.871 ± 0.018
1.069SerTrp: 1.069 ± 0.009
2.343SerTyr: 2.343 ± 0.011
0.0SerXaa: 0.0 ± 0.0
Thr
3.295ThrAla: 3.295 ± 0.016
0.908ThrCys: 0.908 ± 0.008
2.308ThrAsp: 2.308 ± 0.012
2.72ThrGlu: 2.72 ± 0.014
2.127ThrPhe: 2.127 ± 0.012
3.225ThrGly: 3.225 ± 0.014
1.222ThrHis: 1.222 ± 0.008
3.061ThrIle: 3.061 ± 0.016
2.869ThrLys: 2.869 ± 0.014
4.809ThrLeu: 4.809 ± 0.019
1.231ThrMet: 1.231 ± 0.008
2.395ThrAsn: 2.395 ± 0.012
2.709ThrPro: 2.709 ± 0.015
1.709ThrGln: 1.709 ± 0.01
2.554ThrArg: 2.554 ± 0.012
4.46ThrSer: 4.46 ± 0.021
3.34ThrThr: 3.34 ± 0.015
3.264ThrVal: 3.264 ± 0.012
0.671ThrTrp: 0.671 ± 0.007
1.488ThrTyr: 1.488 ± 0.011
0.0ThrXaa: 0.0 ± 0.0
Val
4.886ValAla: 4.886 ± 0.023
1.022ValCys: 1.022 ± 0.008
3.817ValAsp: 3.817 ± 0.015
4.428ValGlu: 4.428 ± 0.017
2.574ValPhe: 2.574 ± 0.014
4.265ValGly: 4.265 ± 0.02
1.595ValHis: 1.595 ± 0.011
3.543ValIle: 3.543 ± 0.015
3.852ValLys: 3.852 ± 0.017
5.958ValLeu: 5.958 ± 0.023
1.548ValMet: 1.548 ± 0.009
2.736ValAsn: 2.736 ± 0.013
3.159ValPro: 3.159 ± 0.015
2.307ValGln: 2.307 ± 0.013
3.27ValArg: 3.27 ± 0.019
5.105ValSer: 5.105 ± 0.019
3.315ValThr: 3.315 ± 0.015
4.932ValVal: 4.932 ± 0.02
0.705ValTrp: 0.705 ± 0.006
1.828ValTyr: 1.828 ± 0.013
0.0ValXaa: 0.0 ± 0.0
Trp
0.708TrpAla: 0.708 ± 0.007
0.222TrpCys: 0.222 ± 0.004
0.653TrpAsp: 0.653 ± 0.007
0.699TrpGlu: 0.699 ± 0.006
0.497TrpPhe: 0.497 ± 0.006
0.686TrpGly: 0.686 ± 0.007
0.324TrpHis: 0.324 ± 0.005
0.651TrpIle: 0.651 ± 0.007
0.853TrpLys: 0.853 ± 0.007
1.216TrpLeu: 1.216 ± 0.009
0.311TrpMet: 0.311 ± 0.004
0.669TrpAsn: 0.669 ± 0.007
0.499TrpPro: 0.499 ± 0.006
0.452TrpGln: 0.452 ± 0.005
0.765TrpArg: 0.765 ± 0.006
0.929TrpSer: 0.929 ± 0.009
0.6TrpThr: 0.6 ± 0.006
0.773TrpVal: 0.773 ± 0.007
0.219TrpTrp: 0.219 ± 0.004
0.353TrpTyr: 0.353 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.901TyrAla: 1.901 ± 0.011
0.566TyrCys: 0.566 ± 0.006
1.588TyrAsp: 1.588 ± 0.011
1.615TyrGlu: 1.615 ± 0.01
1.294TyrPhe: 1.294 ± 0.009
2.132TyrGly: 2.132 ± 0.013
0.761TyrHis: 0.761 ± 0.007
1.667TyrIle: 1.667 ± 0.012
1.673TyrLys: 1.673 ± 0.013
2.632TyrLeu: 2.632 ± 0.014
0.682TyrMet: 0.682 ± 0.006
1.385TyrAsn: 1.385 ± 0.01
1.314TyrPro: 1.314 ± 0.01
1.06TyrGln: 1.06 ± 0.008
1.443TyrArg: 1.443 ± 0.01
2.32TyrSer: 2.32 ± 0.012
1.362TyrThr: 1.362 ± 0.009
1.813TyrVal: 1.813 ± 0.013
0.418TyrTrp: 0.418 ± 0.005
0.985TyrTyr: 0.985 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46596 proteins (18656812 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski