Amino acid dipepetide frequency for Juncus maritimus associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.052AlaAla: 3.052 ± 1.242
1.017AlaCys: 1.017 ± 1.135
3.052AlaAsp: 3.052 ± 1.616
2.035AlaGlu: 2.035 ± 1.211
1.017AlaPhe: 1.017 ± 0.864
4.069AlaGly: 4.069 ± 2.346
3.052AlaHis: 3.052 ± 0.92
2.035AlaIle: 2.035 ± 1.127
8.138AlaLys: 8.138 ± 2.355
7.121AlaLeu: 7.121 ± 1.712
0.0AlaMet: 0.0 ± 0.0
4.069AlaAsn: 4.069 ± 1.384
4.069AlaPro: 4.069 ± 1.866
3.052AlaGln: 3.052 ± 2.242
5.086AlaArg: 5.086 ± 1.982
1.017AlaSer: 1.017 ± 0.694
1.017AlaThr: 1.017 ± 0.864
1.017AlaVal: 1.017 ± 0.694
0.0AlaTrp: 0.0 ± 0.0
2.035AlaTyr: 2.035 ± 0.814
0.0AlaXaa: 0.0 ± 0.0
Cys
2.035CysAla: 2.035 ± 1.728
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.017CysGlu: 1.017 ± 1.258
0.0CysPhe: 0.0 ± 0.0
1.017CysGly: 1.017 ± 1.093
0.0CysHis: 0.0 ± 0.0
1.017CysIle: 1.017 ± 1.258
0.0CysLys: 0.0 ± 0.0
2.035CysLeu: 2.035 ± 1.156
0.0CysMet: 0.0 ± 0.0
3.052CysAsn: 3.052 ± 1.616
1.017CysPro: 1.017 ± 1.258
3.052CysGln: 3.052 ± 2.279
1.017CysArg: 1.017 ± 0.864
2.035CysSer: 2.035 ± 1.211
1.017CysThr: 1.017 ± 1.258
0.0CysVal: 0.0 ± 0.0
1.017CysTrp: 1.017 ± 1.093
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.035AspAla: 2.035 ± 1.388
1.017AspCys: 1.017 ± 1.258
7.121AspAsp: 7.121 ± 3.05
3.052AspGlu: 3.052 ± 0.92
1.017AspPhe: 1.017 ± 0.864
5.086AspGly: 5.086 ± 2.79
1.017AspHis: 1.017 ± 0.864
2.035AspIle: 2.035 ± 1.156
5.086AspLys: 5.086 ± 2.105
4.069AspLeu: 4.069 ± 2.239
1.017AspMet: 1.017 ± 1.258
1.017AspAsn: 1.017 ± 1.258
2.035AspPro: 2.035 ± 1.219
1.017AspGln: 1.017 ± 0.864
1.017AspArg: 1.017 ± 0.694
4.069AspSer: 4.069 ± 2.36
2.035AspThr: 2.035 ± 1.156
3.052AspVal: 3.052 ± 1.242
5.086AspTrp: 5.086 ± 1.235
2.035AspTyr: 2.035 ± 0.814
0.0AspXaa: 0.0 ± 0.0
Glu
1.017GluAla: 1.017 ± 0.694
0.0GluCys: 0.0 ± 0.0
3.052GluAsp: 3.052 ± 1.05
10.173GluGlu: 10.173 ± 5.35
3.052GluPhe: 3.052 ± 1.242
4.069GluGly: 4.069 ± 1.84
2.035GluHis: 2.035 ± 1.609
3.052GluIle: 3.052 ± 1.447
3.052GluLys: 3.052 ± 2.242
4.069GluLeu: 4.069 ± 1.661
1.017GluMet: 1.017 ± 0.694
2.035GluAsn: 2.035 ± 0.814
2.035GluPro: 2.035 ± 1.651
2.035GluGln: 2.035 ± 1.127
0.0GluArg: 0.0 ± 0.0
2.035GluSer: 2.035 ± 1.395
2.035GluThr: 2.035 ± 1.388
2.035GluVal: 2.035 ± 2.269
1.017GluTrp: 1.017 ± 0.694
1.017GluTyr: 1.017 ± 0.694
0.0GluXaa: 0.0 ± 0.0
Phe
2.035PheAla: 2.035 ± 1.388
1.017PheCys: 1.017 ± 1.135
2.035PheAsp: 2.035 ± 1.127
1.017PheGlu: 1.017 ± 0.694
1.017PhePhe: 1.017 ± 0.694
3.052PheGly: 3.052 ± 0.92
3.052PheHis: 3.052 ± 1.52
3.052PheIle: 3.052 ± 1.616
1.017PheLys: 1.017 ± 1.093
3.052PheLeu: 3.052 ± 0.92
0.0PheMet: 0.0 ± 0.0
7.121PheAsn: 7.121 ± 3.097
3.052PhePro: 3.052 ± 0.92
4.069PheGln: 4.069 ± 1.149
5.086PheArg: 5.086 ± 2.771
4.069PheSer: 4.069 ± 1.058
1.017PheThr: 1.017 ± 0.864
1.017PheVal: 1.017 ± 0.694
0.0PheTrp: 0.0 ± 0.0
1.017PheTyr: 1.017 ± 0.864
0.0PheXaa: 0.0 ± 0.0
Gly
4.069GlyAla: 4.069 ± 1.629
0.0GlyCys: 0.0 ± 0.0
4.069GlyAsp: 4.069 ± 1.384
3.052GlyGlu: 3.052 ± 1.529
3.052GlyPhe: 3.052 ± 1.531
4.069GlyGly: 4.069 ± 2.346
3.052GlyHis: 3.052 ± 2.279
1.017GlyIle: 1.017 ± 1.093
5.086GlyLys: 5.086 ± 2.697
6.104GlyLeu: 6.104 ± 2.083
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
2.035GlyPro: 2.035 ± 1.219
4.069GlyGln: 4.069 ± 1.058
3.052GlyArg: 3.052 ± 1.529
8.138GlySer: 8.138 ± 2.467
1.017GlyThr: 1.017 ± 1.258
4.069GlyVal: 4.069 ± 1.149
0.0GlyTrp: 0.0 ± 0.0
3.052GlyTyr: 3.052 ± 1.025
0.0GlyXaa: 0.0 ± 0.0
His
1.017HisAla: 1.017 ± 0.694
2.035HisCys: 2.035 ± 1.12
1.017HisAsp: 1.017 ± 0.864
1.017HisGlu: 1.017 ± 0.694
1.017HisPhe: 1.017 ± 0.694
1.017HisGly: 1.017 ± 1.258
0.0HisHis: 0.0 ± 0.0
4.069HisIle: 4.069 ± 1.493
2.035HisLys: 2.035 ± 1.609
3.052HisLeu: 3.052 ± 1.362
1.017HisMet: 1.017 ± 0.756
5.086HisAsn: 5.086 ± 1.351
2.035HisPro: 2.035 ± 1.395
1.017HisGln: 1.017 ± 1.093
3.052HisArg: 3.052 ± 2.673
1.017HisSer: 1.017 ± 1.258
2.035HisThr: 2.035 ± 0.814
0.0HisVal: 0.0 ± 0.0
2.035HisTrp: 2.035 ± 0.814
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.052IleAla: 3.052 ± 1.723
0.0IleCys: 0.0 ± 0.0
6.104IleAsp: 6.104 ± 2.724
3.052IleGlu: 3.052 ± 1.52
5.086IlePhe: 5.086 ± 1.185
1.017IleGly: 1.017 ± 0.694
2.035IleHis: 2.035 ± 1.651
5.086IleIle: 5.086 ± 1.467
7.121IleLys: 7.121 ± 3.865
1.017IleLeu: 1.017 ± 1.135
0.0IleMet: 0.0 ± 0.0
7.121IleAsn: 7.121 ± 2.949
4.069IlePro: 4.069 ± 2.146
5.086IleGln: 5.086 ± 3.363
5.086IleArg: 5.086 ± 1.868
5.086IleSer: 5.086 ± 1.235
3.052IleThr: 3.052 ± 2.49
5.086IleVal: 5.086 ± 1.998
3.052IleTrp: 3.052 ± 2.077
4.069IleTyr: 4.069 ± 1.022
0.0IleXaa: 0.0 ± 0.0
Lys
3.052LysAla: 3.052 ± 2.673
3.052LysCys: 3.052 ± 2.077
3.052LysAsp: 3.052 ± 1.52
4.069LysGlu: 4.069 ± 2.423
4.069LysPhe: 4.069 ± 1.163
4.069LysGly: 4.069 ± 1.163
4.069LysHis: 4.069 ± 1.629
4.069LysIle: 4.069 ± 2.469
3.052LysLys: 3.052 ± 1.362
2.035LysLeu: 2.035 ± 1.609
0.0LysMet: 0.0 ± 0.0
6.104LysAsn: 6.104 ± 2.271
2.035LysPro: 2.035 ± 1.211
0.0LysGln: 0.0 ± 0.0
8.138LysArg: 8.138 ± 2.189
6.104LysSer: 6.104 ± 1.841
2.035LysThr: 2.035 ± 0.814
1.017LysVal: 1.017 ± 0.694
0.0LysTrp: 0.0 ± 0.0
4.069LysTyr: 4.069 ± 1.84
0.0LysXaa: 0.0 ± 0.0
Leu
4.069LeuAla: 4.069 ± 1.629
1.017LeuCys: 1.017 ± 0.694
6.104LeuAsp: 6.104 ± 2.341
3.052LeuGlu: 3.052 ± 1.531
4.069LeuPhe: 4.069 ± 2.346
4.069LeuGly: 4.069 ± 1.022
3.052LeuHis: 3.052 ± 0.92
5.086LeuIle: 5.086 ± 1.185
6.104LeuLys: 6.104 ± 3.04
1.017LeuLeu: 1.017 ± 1.135
4.069LeuMet: 4.069 ± 2.312
2.035LeuAsn: 2.035 ± 1.211
2.035LeuPro: 2.035 ± 1.651
5.086LeuGln: 5.086 ± 3.421
4.069LeuArg: 4.069 ± 1.749
4.069LeuSer: 4.069 ± 2.775
2.035LeuThr: 2.035 ± 1.388
1.017LeuVal: 1.017 ± 0.694
3.052LeuTrp: 3.052 ± 1.362
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.035MetAla: 2.035 ± 1.211
0.0MetCys: 0.0 ± 0.0
3.052MetAsp: 3.052 ± 1.723
0.0MetGlu: 0.0 ± 0.0
1.017MetPhe: 1.017 ± 1.093
1.017MetGly: 1.017 ± 0.864
0.0MetHis: 0.0 ± 0.0
1.017MetIle: 1.017 ± 1.093
0.0MetLys: 0.0 ± 0.0
2.035MetLeu: 2.035 ± 1.395
1.017MetMet: 1.017 ± 0.864
0.0MetAsn: 0.0 ± 0.0
1.017MetPro: 1.017 ± 0.864
1.017MetGln: 1.017 ± 0.694
1.017MetArg: 1.017 ± 1.258
1.017MetSer: 1.017 ± 1.093
1.017MetThr: 1.017 ± 0.864
1.017MetVal: 1.017 ± 1.258
2.035MetTrp: 2.035 ± 0.814
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.052AsnAla: 3.052 ± 1.05
2.035AsnCys: 2.035 ± 1.395
3.052AsnAsp: 3.052 ± 1.242
1.017AsnGlu: 1.017 ± 0.694
0.0AsnPhe: 0.0 ± 0.0
3.052AsnGly: 3.052 ± 1.783
2.035AsnHis: 2.035 ± 1.609
10.173AsnIle: 10.173 ± 3.704
4.069AsnLys: 4.069 ± 1.85
6.104AsnLeu: 6.104 ± 2.126
1.017AsnMet: 1.017 ± 0.864
1.017AsnAsn: 1.017 ± 0.694
8.138AsnPro: 8.138 ± 2.656
2.035AsnGln: 2.035 ± 1.395
1.017AsnArg: 1.017 ± 0.864
6.104AsnSer: 6.104 ± 2.271
1.017AsnThr: 1.017 ± 0.694
2.035AsnVal: 2.035 ± 1.127
1.017AsnTrp: 1.017 ± 0.694
6.104AsnTyr: 6.104 ± 1.794
0.0AsnXaa: 0.0 ± 0.0
Pro
3.052ProAla: 3.052 ± 1.783
1.017ProCys: 1.017 ± 1.258
4.069ProAsp: 4.069 ± 2.674
4.069ProGlu: 4.069 ± 1.558
1.017ProPhe: 1.017 ± 0.694
3.052ProGly: 3.052 ± 2.279
1.017ProHis: 1.017 ± 0.694
3.052ProIle: 3.052 ± 1.524
4.069ProLys: 4.069 ± 1.558
5.086ProLeu: 5.086 ± 2.79
1.017ProMet: 1.017 ± 0.864
4.069ProAsn: 4.069 ± 2.789
6.104ProPro: 6.104 ± 3.126
4.069ProGln: 4.069 ± 2.239
1.017ProArg: 1.017 ± 0.694
5.086ProSer: 5.086 ± 1.445
1.017ProThr: 1.017 ± 1.093
2.035ProVal: 2.035 ± 0.814
0.0ProTrp: 0.0 ± 0.0
3.052ProTyr: 3.052 ± 2.592
0.0ProXaa: 0.0 ± 0.0
Gln
3.052GlnAla: 3.052 ± 1.464
3.052GlnCys: 3.052 ± 2.279
1.017GlnAsp: 1.017 ± 1.258
3.052GlnGlu: 3.052 ± 1.616
5.086GlnPhe: 5.086 ± 1.929
2.035GlnGly: 2.035 ± 1.12
3.052GlnHis: 3.052 ± 1.025
7.121GlnIle: 7.121 ± 1.905
1.017GlnLys: 1.017 ± 1.258
3.052GlnLeu: 3.052 ± 1.616
1.017GlnMet: 1.017 ± 1.258
0.0GlnAsn: 0.0 ± 0.0
2.035GlnPro: 2.035 ± 2.516
4.069GlnGln: 4.069 ± 1.558
4.069GlnArg: 4.069 ± 1.733
5.086GlnSer: 5.086 ± 1.795
1.017GlnThr: 1.017 ± 0.694
3.052GlnVal: 3.052 ± 1.948
0.0GlnTrp: 0.0 ± 0.0
2.035GlnTyr: 2.035 ± 1.388
0.0GlnXaa: 0.0 ± 0.0
Arg
4.069ArgAla: 4.069 ± 1.797
0.0ArgCys: 0.0 ± 0.0
1.017ArgAsp: 1.017 ± 1.258
2.035ArgGlu: 2.035 ± 1.211
4.069ArgPhe: 4.069 ± 1.149
3.052ArgGly: 3.052 ± 1.025
2.035ArgHis: 2.035 ± 1.395
4.069ArgIle: 4.069 ± 2.367
5.086ArgLys: 5.086 ± 2.517
1.017ArgLeu: 1.017 ± 0.864
1.017ArgMet: 1.017 ± 0.864
2.035ArgAsn: 2.035 ± 1.156
2.035ArgPro: 2.035 ± 0.814
2.035ArgGln: 2.035 ± 1.651
6.104ArgArg: 6.104 ± 3.899
5.086ArgSer: 5.086 ± 1.467
5.086ArgThr: 5.086 ± 2.24
4.069ArgVal: 4.069 ± 1.296
1.017ArgTrp: 1.017 ± 1.135
3.052ArgTyr: 3.052 ± 1.529
0.0ArgXaa: 0.0 ± 0.0
Ser
8.138SerAla: 8.138 ± 2.839
2.035SerCys: 2.035 ± 1.156
5.086SerAsp: 5.086 ± 2.096
1.017SerGlu: 1.017 ± 0.694
3.052SerPhe: 3.052 ± 2.11
6.104SerGly: 6.104 ± 4.102
1.017SerHis: 1.017 ± 1.258
6.104SerIle: 6.104 ± 1.642
4.069SerLys: 4.069 ± 1.296
3.052SerLeu: 3.052 ± 1.52
2.035SerMet: 2.035 ± 1.126
6.104SerAsn: 6.104 ± 3.155
6.104SerPro: 6.104 ± 1.521
3.052SerGln: 3.052 ± 2.279
3.052SerArg: 3.052 ± 1.52
16.277SerSer: 16.277 ± 2.83
5.086SerThr: 5.086 ± 1.358
3.052SerVal: 3.052 ± 1.594
0.0SerTrp: 0.0 ± 0.0
3.052SerTyr: 3.052 ± 1.025
0.0SerXaa: 0.0 ± 0.0
Thr
2.035ThrAla: 2.035 ± 1.156
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
2.035ThrGlu: 2.035 ± 1.12
2.035ThrPhe: 2.035 ± 2.187
4.069ThrGly: 4.069 ± 1.443
1.017ThrHis: 1.017 ± 0.694
5.086ThrIle: 5.086 ± 1.351
0.0ThrLys: 0.0 ± 0.0
2.035ThrLeu: 2.035 ± 1.127
2.035ThrMet: 2.035 ± 1.24
5.086ThrAsn: 5.086 ± 3.189
4.069ThrPro: 4.069 ± 1.058
0.0ThrGln: 0.0 ± 0.0
1.017ThrArg: 1.017 ± 1.135
3.052ThrSer: 3.052 ± 1.025
2.035ThrThr: 2.035 ± 1.156
2.035ThrVal: 2.035 ± 1.156
0.0ThrTrp: 0.0 ± 0.0
1.017ThrTyr: 1.017 ± 0.694
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.017ValCys: 1.017 ± 0.694
0.0ValAsp: 0.0 ± 0.0
3.052ValGlu: 3.052 ± 2.191
4.069ValPhe: 4.069 ± 1.629
2.035ValGly: 2.035 ± 1.211
0.0ValHis: 0.0 ± 0.0
3.052ValIle: 3.052 ± 0.92
2.035ValLys: 2.035 ± 1.127
3.052ValLeu: 3.052 ± 1.242
1.017ValMet: 1.017 ± 1.093
4.069ValAsn: 4.069 ± 2.255
2.035ValPro: 2.035 ± 1.211
7.121ValGln: 7.121 ± 2.476
3.052ValArg: 3.052 ± 1.949
2.035ValSer: 2.035 ± 0.814
2.035ValThr: 2.035 ± 1.156
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
3.052TrpAla: 3.052 ± 1.242
0.0TrpCys: 0.0 ± 0.0
1.017TrpAsp: 1.017 ± 1.093
1.017TrpGlu: 1.017 ± 1.258
1.017TrpPhe: 1.017 ± 1.093
2.035TrpGly: 2.035 ± 0.814
0.0TrpHis: 0.0 ± 0.0
4.069TrpIle: 4.069 ± 3.193
1.017TrpLys: 1.017 ± 0.694
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.017TrpAsn: 1.017 ± 0.864
0.0TrpPro: 0.0 ± 0.0
1.017TrpGln: 1.017 ± 0.694
0.0TrpArg: 0.0 ± 0.0
1.017TrpSer: 1.017 ± 0.864
3.052TrpThr: 3.052 ± 1.52
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.035TyrAla: 2.035 ± 1.388
1.017TyrCys: 1.017 ± 0.864
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
2.035TyrPhe: 2.035 ± 1.388
1.017TyrGly: 1.017 ± 0.694
2.035TyrHis: 2.035 ± 1.728
1.017TyrIle: 1.017 ± 0.694
2.035TyrLys: 2.035 ± 0.814
6.104TyrLeu: 6.104 ± 2.393
1.017TyrMet: 1.017 ± 0.94
3.052TyrAsn: 3.052 ± 1.374
1.017TyrPro: 1.017 ± 0.694
1.017TyrGln: 1.017 ± 0.694
2.035TyrArg: 2.035 ± 1.395
5.086TyrSer: 5.086 ± 3.462
1.017TyrThr: 1.017 ± 0.864
4.069TyrVal: 4.069 ± 2.346
0.0TyrTrp: 0.0 ± 0.0
5.086TyrTyr: 5.086 ± 4.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (984 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski