Amino acid dipepetide frequency for Inoviridae sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.987AlaAla: 1.987 ± 0.872
0.662AlaCys: 0.662 ± 0.466
2.649AlaAsp: 2.649 ± 0.981
3.311AlaGlu: 3.311 ± 0.765
5.298AlaPhe: 5.298 ± 1.186
7.285AlaGly: 7.285 ± 1.382
0.662AlaHis: 0.662 ± 0.917
7.285AlaIle: 7.285 ± 4.225
2.649AlaLys: 2.649 ± 1.127
3.974AlaLeu: 3.974 ± 1.766
1.325AlaMet: 1.325 ± 1.433
3.311AlaAsn: 3.311 ± 0.806
3.311AlaPro: 3.311 ± 2.154
1.987AlaGln: 1.987 ± 1.011
0.662AlaArg: 0.662 ± 0.559
1.987AlaSer: 1.987 ± 1.011
2.649AlaThr: 2.649 ± 1.558
1.987AlaVal: 1.987 ± 1.732
0.0AlaTrp: 0.0 ± 0.0
1.325AlaTyr: 1.325 ± 1.834
0.0AlaXaa: 0.0 ± 0.0
Cys
2.649CysAla: 2.649 ± 0.604
0.0CysCys: 0.0 ± 0.0
1.987CysAsp: 1.987 ± 0.822
0.662CysGlu: 0.662 ± 0.466
0.662CysPhe: 0.662 ± 0.466
1.987CysGly: 1.987 ± 0.9
0.662CysHis: 0.662 ± 0.577
0.0CysIle: 0.0 ± 0.0
1.987CysLys: 1.987 ± 1.047
0.662CysLeu: 0.662 ± 0.559
0.0CysMet: 0.0 ± 0.0
1.325CysAsn: 1.325 ± 0.932
1.325CysPro: 1.325 ± 0.932
0.662CysGln: 0.662 ± 0.466
1.325CysArg: 1.325 ± 0.536
0.662CysSer: 0.662 ± 0.466
1.325CysThr: 1.325 ± 0.932
1.987CysVal: 1.987 ± 0.403
0.0CysTrp: 0.0 ± 0.0
1.987CysTyr: 1.987 ± 0.9
0.0CysXaa: 0.0 ± 0.0
Asp
1.987AspAla: 1.987 ± 0.822
1.325AspCys: 1.325 ± 1.118
5.298AspAsp: 5.298 ± 1.608
4.636AspGlu: 4.636 ± 1.892
2.649AspPhe: 2.649 ± 0.814
6.623AspGly: 6.623 ± 2.607
1.987AspHis: 1.987 ± 1.008
3.974AspIle: 3.974 ± 1.659
5.298AspLys: 5.298 ± 1.089
8.609AspLeu: 8.609 ± 0.582
1.987AspMet: 1.987 ± 0.403
5.96AspAsn: 5.96 ± 2.258
2.649AspPro: 2.649 ± 1.308
1.987AspGln: 1.987 ± 1.414
1.987AspArg: 1.987 ± 0.403
3.974AspSer: 3.974 ± 1.723
3.974AspThr: 3.974 ± 1.76
4.636AspVal: 4.636 ± 1.617
1.987AspTrp: 1.987 ± 2.096
1.987AspTyr: 1.987 ± 0.822
0.0AspXaa: 0.0 ± 0.0
Glu
4.636GluAla: 4.636 ± 2.721
0.662GluCys: 0.662 ± 0.577
0.662GluAsp: 0.662 ± 0.577
2.649GluGlu: 2.649 ± 0.804
3.974GluPhe: 3.974 ± 1.214
4.636GluGly: 4.636 ± 2.016
1.325GluHis: 1.325 ± 1.155
1.325GluIle: 1.325 ± 0.587
3.311GluLys: 3.311 ± 0.765
2.649GluLeu: 2.649 ± 0.711
1.325GluMet: 1.325 ± 0.633
1.987GluAsn: 1.987 ± 1.272
2.649GluPro: 2.649 ± 1.4
3.311GluGln: 3.311 ± 0.946
1.987GluArg: 1.987 ± 0.872
3.974GluSer: 3.974 ± 2.146
4.636GluThr: 4.636 ± 2.65
6.623GluVal: 6.623 ± 2.795
0.0GluTrp: 0.0 ± 0.0
1.325GluTyr: 1.325 ± 1.155
0.0GluXaa: 0.0 ± 0.0
Phe
2.649PheAla: 2.649 ± 1.936
1.987PheCys: 1.987 ± 0.822
5.298PheAsp: 5.298 ± 1.82
3.311PheGlu: 3.311 ± 1.386
4.636PhePhe: 4.636 ± 2.07
3.311PheGly: 3.311 ± 1.259
1.987PheHis: 1.987 ± 1.069
1.987PheIle: 1.987 ± 0.403
3.311PheLys: 3.311 ± 1.579
3.311PheLeu: 3.311 ± 1.512
0.662PheMet: 0.662 ± 0.466
5.298PheAsn: 5.298 ± 2.003
1.987PhePro: 1.987 ± 1.352
0.662PheGln: 0.662 ± 0.559
1.987PheArg: 1.987 ± 0.403
3.974PheSer: 3.974 ± 2.134
1.987PheThr: 1.987 ± 0.822
3.974PheVal: 3.974 ± 1.243
0.0PheTrp: 0.0 ± 0.0
2.649PheTyr: 2.649 ± 0.787
0.0PheXaa: 0.0 ± 0.0
Gly
1.987GlyAla: 1.987 ± 0.872
2.649GlyCys: 2.649 ± 1.308
3.974GlyAsp: 3.974 ± 1.44
2.649GlyGlu: 2.649 ± 0.804
5.298GlyPhe: 5.298 ± 1.157
6.623GlyGly: 6.623 ± 1.537
0.0GlyHis: 0.0 ± 0.0
6.623GlyIle: 6.623 ± 1.639
6.623GlyLys: 6.623 ± 1.375
6.623GlyLeu: 6.623 ± 2.732
1.987GlyMet: 1.987 ± 1.254
4.636GlyAsn: 4.636 ± 1.42
1.325GlyPro: 1.325 ± 0.587
4.636GlyGln: 4.636 ± 1.258
0.0GlyArg: 0.0 ± 0.0
7.947GlySer: 7.947 ± 1.369
3.311GlyThr: 3.311 ± 1.001
3.311GlyVal: 3.311 ± 1.746
0.662GlyTrp: 0.662 ± 0.577
1.987GlyTyr: 1.987 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.662HisCys: 0.662 ± 0.559
1.987HisAsp: 1.987 ± 1.069
1.987HisGlu: 1.987 ± 1.008
1.325HisPhe: 1.325 ± 1.043
1.325HisGly: 1.325 ± 0.628
0.662HisHis: 0.662 ± 0.559
3.311HisIle: 3.311 ± 1.697
2.649HisLys: 2.649 ± 0.804
2.649HisLeu: 2.649 ± 1.253
0.0HisMet: 0.0 ± 0.491
1.325HisAsn: 1.325 ± 0.628
0.662HisPro: 0.662 ± 0.577
0.0HisGln: 0.0 ± 0.0
0.662HisArg: 0.662 ± 0.577
0.0HisSer: 0.0 ± 0.0
1.325HisThr: 1.325 ± 0.628
0.0HisVal: 0.0 ± 0.0
0.662HisTrp: 0.662 ± 0.559
2.649HisTyr: 2.649 ± 1.6
0.0HisXaa: 0.0 ± 0.0
Ile
4.636IleAla: 4.636 ± 1.49
1.325IleCys: 1.325 ± 0.587
7.285IleAsp: 7.285 ± 2.274
2.649IleGlu: 2.649 ± 1.892
3.974IlePhe: 3.974 ± 1.605
2.649IleGly: 2.649 ± 1.173
0.662IleHis: 0.662 ± 0.577
3.974IleIle: 3.974 ± 1.243
3.974IleLys: 3.974 ± 0.805
6.623IleLeu: 6.623 ± 1.848
0.662IleMet: 0.662 ± 0.577
6.623IleAsn: 6.623 ± 2.282
3.311IlePro: 3.311 ± 1.343
2.649IleGln: 2.649 ± 1.127
1.325IleArg: 1.325 ± 1.128
3.974IleSer: 3.974 ± 1.76
3.311IleThr: 3.311 ± 1.141
2.649IleVal: 2.649 ± 1.366
0.662IleTrp: 0.662 ± 0.466
0.662IleTyr: 0.662 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
3.974LysAla: 3.974 ± 1.664
2.649LysCys: 2.649 ± 1.865
3.974LysAsp: 3.974 ± 1.773
5.298LysGlu: 5.298 ± 2.337
1.987LysPhe: 1.987 ± 0.872
2.649LysGly: 2.649 ± 0.814
3.311LysHis: 3.311 ± 1.658
2.649LysIle: 2.649 ± 1.224
7.947LysLys: 7.947 ± 4.226
0.662LysLeu: 0.662 ± 0.559
0.662LysMet: 0.662 ± 0.559
2.649LysAsn: 2.649 ± 1.188
1.987LysPro: 1.987 ± 1.398
4.636LysGln: 4.636 ± 0.959
3.311LysArg: 3.311 ± 1.658
4.636LysSer: 4.636 ± 1.554
3.311LysThr: 3.311 ± 1.189
5.298LysVal: 5.298 ± 2.715
1.987LysTrp: 1.987 ± 1.039
6.623LysTyr: 6.623 ± 0.876
0.0LysXaa: 0.0 ± 0.0
Leu
4.636LeuAla: 4.636 ± 1.279
1.325LeuCys: 1.325 ± 0.932
7.285LeuAsp: 7.285 ± 3.228
2.649LeuGlu: 2.649 ± 1.572
1.987LeuPhe: 1.987 ± 0.9
3.974LeuGly: 3.974 ± 2.252
1.987LeuHis: 1.987 ± 0.403
8.609LeuIle: 8.609 ± 2.314
3.311LeuLys: 3.311 ± 0.935
5.96LeuLeu: 5.96 ± 2.267
1.987LeuMet: 1.987 ± 1.026
1.325LeuAsn: 1.325 ± 1.155
4.636LeuPro: 4.636 ± 1.643
1.987LeuGln: 1.987 ± 1.026
2.649LeuArg: 2.649 ± 1.545
7.947LeuSer: 7.947 ± 2.549
3.974LeuThr: 3.974 ± 0.805
3.311LeuVal: 3.311 ± 1.336
0.662LeuTrp: 0.662 ± 0.559
0.662LeuTyr: 0.662 ± 0.917
0.0LeuXaa: 0.0 ± 0.0
Met
1.325MetAla: 1.325 ± 0.883
1.325MetCys: 1.325 ± 0.587
1.325MetAsp: 1.325 ± 1.067
0.662MetGlu: 0.662 ± 0.577
0.0MetPhe: 0.0 ± 0.0
2.649MetGly: 2.649 ± 0.787
1.325MetHis: 1.325 ± 0.628
1.987MetIle: 1.987 ± 1.011
1.325MetLys: 1.325 ± 0.587
1.987MetLeu: 1.987 ± 1.026
1.325MetMet: 1.325 ± 1.128
0.662MetAsn: 0.662 ± 0.559
0.0MetPro: 0.0 ± 0.0
0.662MetGln: 0.662 ± 0.559
0.0MetArg: 0.0 ± 0.0
1.987MetSer: 1.987 ± 0.9
0.662MetThr: 0.662 ± 0.466
0.0MetVal: 0.0 ± 0.0
0.662MetTrp: 0.662 ± 1.045
0.662MetTyr: 0.662 ± 0.577
0.0MetXaa: 0.0 ± 0.0
Asn
3.974AsnAla: 3.974 ± 1.056
1.325AsnCys: 1.325 ± 0.587
1.325AsnAsp: 1.325 ± 0.883
4.636AsnGlu: 4.636 ± 1.113
1.325AsnPhe: 1.325 ± 0.536
5.96AsnGly: 5.96 ± 2.978
1.325AsnHis: 1.325 ± 0.628
1.987AsnIle: 1.987 ± 1.039
2.649AsnLys: 2.649 ± 1.572
2.649AsnLeu: 2.649 ± 1.545
1.325AsnMet: 1.325 ± 1.067
1.987AsnAsn: 1.987 ± 1.398
3.974AsnPro: 3.974 ± 2.208
1.987AsnGln: 1.987 ± 0.88
3.311AsnArg: 3.311 ± 0.765
5.298AsnSer: 5.298 ± 1.608
5.96AsnThr: 5.96 ± 1.991
1.325AsnVal: 1.325 ± 1.043
0.662AsnTrp: 0.662 ± 0.577
1.987AsnTyr: 1.987 ± 1.012
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
9.272ProAsp: 9.272 ± 4.579
3.311ProGlu: 3.311 ± 1.343
3.311ProPhe: 3.311 ± 1.579
0.0ProGly: 0.0 ± 0.0
0.662ProHis: 0.662 ± 0.559
2.649ProIle: 2.649 ± 1.765
4.636ProLys: 4.636 ± 1.257
3.974ProLeu: 3.974 ± 0.887
0.662ProMet: 0.662 ± 0.466
4.636ProAsn: 4.636 ± 2.197
5.298ProPro: 5.298 ± 2.546
1.987ProGln: 1.987 ± 1.012
0.662ProArg: 0.662 ± 0.559
3.311ProSer: 3.311 ± 1.001
3.974ProThr: 3.974 ± 1.643
3.311ProVal: 3.311 ± 0.824
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.974GlnAla: 3.974 ± 2.775
1.325GlnCys: 1.325 ± 0.932
1.325GlnAsp: 1.325 ± 0.536
0.662GlnGlu: 0.662 ± 0.577
3.974GlnPhe: 3.974 ± 1.406
3.974GlnGly: 3.974 ± 1.801
1.325GlnHis: 1.325 ± 1.128
0.662GlnIle: 0.662 ± 0.559
2.649GlnLys: 2.649 ± 1.6
2.649GlnLeu: 2.649 ± 1.118
0.662GlnMet: 0.662 ± 0.559
3.311GlnAsn: 3.311 ± 1.341
0.0GlnPro: 0.0 ± 0.0
3.311GlnGln: 3.311 ± 1.341
1.987GlnArg: 1.987 ± 1.039
1.987GlnSer: 1.987 ± 1.732
5.96GlnThr: 5.96 ± 1.494
1.325GlnVal: 1.325 ± 0.932
0.0GlnTrp: 0.0 ± 0.0
1.325GlnTyr: 1.325 ± 0.932
0.0GlnXaa: 0.0 ± 0.0
Arg
2.649ArgAla: 2.649 ± 0.711
0.0ArgCys: 0.0 ± 0.0
4.636ArgAsp: 4.636 ± 2.264
0.662ArgGlu: 0.662 ± 0.577
1.987ArgPhe: 1.987 ± 0.9
0.662ArgGly: 0.662 ± 0.466
1.987ArgHis: 1.987 ± 1.069
2.649ArgIle: 2.649 ± 1.724
3.974ArgLys: 3.974 ± 1.519
1.987ArgLeu: 1.987 ± 1.026
1.325ArgMet: 1.325 ± 1.118
1.325ArgAsn: 1.325 ± 1.118
1.987ArgPro: 1.987 ± 0.822
0.0ArgGln: 0.0 ± 0.0
1.325ArgArg: 1.325 ± 1.118
1.325ArgSer: 1.325 ± 0.628
0.662ArgThr: 0.662 ± 0.559
3.311ArgVal: 3.311 ± 2.079
0.0ArgTrp: 0.0 ± 0.0
0.662ArgTyr: 0.662 ± 0.559
0.0ArgXaa: 0.0 ± 0.0
Ser
5.298SerAla: 5.298 ± 0.921
2.649SerCys: 2.649 ± 1.224
3.311SerAsp: 3.311 ± 1.623
1.325SerGlu: 1.325 ± 0.932
4.636SerPhe: 4.636 ± 3.042
3.974SerGly: 3.974 ± 1.214
1.987SerHis: 1.987 ± 1.039
6.623SerIle: 6.623 ± 1.476
5.298SerLys: 5.298 ± 1.186
4.636SerLeu: 4.636 ± 1.43
3.311SerMet: 3.311 ± 0.694
1.325SerAsn: 1.325 ± 0.587
5.298SerPro: 5.298 ± 1.18
3.974SerGln: 3.974 ± 0.809
3.311SerArg: 3.311 ± 2.115
3.974SerSer: 3.974 ± 1.329
2.649SerThr: 2.649 ± 1.181
3.311SerVal: 3.311 ± 1.341
0.0SerTrp: 0.0 ± 0.0
2.649SerTyr: 2.649 ± 1.127
0.0SerXaa: 0.0 ± 0.0
Thr
2.649ThrAla: 2.649 ± 0.981
1.325ThrCys: 1.325 ± 0.536
3.974ThrAsp: 3.974 ± 1.064
5.96ThrGlu: 5.96 ± 1.208
1.325ThrPhe: 1.325 ± 0.883
5.298ThrGly: 5.298 ± 2.35
0.662ThrHis: 0.662 ± 0.577
3.311ThrIle: 3.311 ± 1.407
1.987ThrLys: 1.987 ± 1.012
2.649ThrLeu: 2.649 ± 0.814
0.0ThrMet: 0.0 ± 0.0
3.974ThrAsn: 3.974 ± 1.643
4.636ThrPro: 4.636 ± 1.645
3.974ThrGln: 3.974 ± 0.805
0.662ThrArg: 0.662 ± 0.466
3.974ThrSer: 3.974 ± 2.119
3.974ThrThr: 3.974 ± 0.887
4.636ThrVal: 4.636 ± 1.156
1.325ThrTrp: 1.325 ± 1.067
1.987ThrTyr: 1.987 ± 0.822
0.0ThrXaa: 0.0 ± 0.0
Val
3.311ValAla: 3.311 ± 2.536
0.0ValCys: 0.0 ± 0.0
4.636ValAsp: 4.636 ± 1.354
3.311ValGlu: 3.311 ± 1.838
1.987ValPhe: 1.987 ± 1.676
5.298ValGly: 5.298 ± 1.609
0.662ValHis: 0.662 ± 0.577
2.649ValIle: 2.649 ± 1.188
3.311ValLys: 3.311 ± 0.95
4.636ValLeu: 4.636 ± 1.867
0.0ValMet: 0.0 ± 0.0
1.325ValAsn: 1.325 ± 0.587
4.636ValPro: 4.636 ± 2.016
2.649ValGln: 2.649 ± 0.804
2.649ValArg: 2.649 ± 1.272
3.311ValSer: 3.311 ± 1.407
3.974ValThr: 3.974 ± 1.34
5.96ValVal: 5.96 ± 3.872
0.662ValTrp: 0.662 ± 0.559
4.636ValTyr: 4.636 ± 1.156
0.0ValXaa: 0.0 ± 0.0
Trp
0.662TrpAla: 0.662 ± 0.466
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.325TrpPhe: 1.325 ± 1.128
0.662TrpGly: 0.662 ± 0.577
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.662TrpLys: 0.662 ± 0.577
1.325TrpLeu: 1.325 ± 1.046
0.0TrpMet: 0.0 ± 0.0
0.662TrpAsn: 0.662 ± 1.045
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.662TrpArg: 0.662 ± 0.559
2.649TrpSer: 2.649 ± 1.182
0.0TrpThr: 0.0 ± 0.0
0.662TrpVal: 0.662 ± 0.559
0.0TrpTrp: 0.0 ± 0.0
1.325TrpTyr: 1.325 ± 0.628
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.325TyrAla: 1.325 ± 1.507
0.662TyrCys: 0.662 ± 0.577
3.311TyrAsp: 3.311 ± 1.141
3.311TyrGlu: 3.311 ± 1.306
3.311TyrPhe: 3.311 ± 0.765
3.311TyrGly: 3.311 ± 1.512
1.325TyrHis: 1.325 ± 0.628
1.325TyrIle: 1.325 ± 0.628
2.649TyrLys: 2.649 ± 1.181
2.649TyrLeu: 2.649 ± 1.558
0.662TyrMet: 0.662 ± 0.577
1.325TyrAsn: 1.325 ± 0.587
1.987TyrPro: 1.987 ± 1.073
1.325TyrGln: 1.325 ± 0.536
2.649TyrArg: 2.649 ± 0.804
2.649TyrSer: 2.649 ± 0.604
0.662TyrThr: 0.662 ± 0.466
1.987TyrVal: 1.987 ± 1.047
0.662TyrTrp: 0.662 ± 0.559
2.649TyrTyr: 2.649 ± 1.127
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski