Amino acid dipepetide frequency for Japanese iris necrotic ring virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.17AlaAla: 6.17 ± 1.362
1.234AlaCys: 1.234 ± 0.369
0.0AlaAsp: 0.0 ± 0.0
3.291AlaGlu: 3.291 ± 0.298
2.468AlaPhe: 2.468 ± 0.647
2.879AlaGly: 2.879 ± 0.617
2.057AlaHis: 2.057 ± 0.449
4.936AlaIle: 4.936 ± 1.052
2.057AlaLys: 2.057 ± 0.364
6.993AlaLeu: 6.993 ± 0.471
2.468AlaMet: 2.468 ± 1.041
3.702AlaAsn: 3.702 ± 0.521
2.468AlaPro: 2.468 ± 0.228
2.879AlaGln: 2.879 ± 1.144
10.695AlaArg: 10.695 ± 0.844
6.17AlaSer: 6.17 ± 0.922
5.759AlaThr: 5.759 ± 1.086
5.759AlaVal: 5.759 ± 0.723
0.411AlaTrp: 0.411 ± 0.436
0.823AlaTyr: 0.823 ± 0.872
0.0AlaXaa: 0.0 ± 0.0
Cys
1.645CysAla: 1.645 ± 0.476
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.057CysGlu: 2.057 ± 0.449
0.0CysPhe: 0.0 ± 0.0
3.291CysGly: 3.291 ± 0.298
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.057CysLys: 2.057 ± 0.449
2.879CysLeu: 2.879 ± 0.644
0.823CysMet: 0.823 ± 0.238
0.0CysAsn: 0.0 ± 0.0
0.411CysPro: 0.411 ± 0.436
2.057CysGln: 2.057 ± 0.449
5.759CysArg: 5.759 ± 0.72
0.0CysSer: 0.0 ± 0.0
1.645CysThr: 1.645 ± 0.476
2.057CysVal: 2.057 ± 0.449
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.525AspAla: 4.525 ± 0.542
2.057AspCys: 2.057 ± 0.449
1.234AspAsp: 1.234 ± 0.323
1.234AspGlu: 1.234 ± 0.323
3.702AspPhe: 3.702 ± 0.521
3.702AspGly: 3.702 ± 0.421
0.0AspHis: 0.0 ± 0.0
1.645AspIle: 1.645 ± 0.73
0.0AspLys: 0.0 ± 0.0
3.291AspLeu: 3.291 ± 0.411
0.411AspMet: 0.411 ± 0.436
2.468AspAsn: 2.468 ± 0.528
3.702AspPro: 3.702 ± 1.496
0.823AspGln: 0.823 ± 0.238
1.234AspArg: 1.234 ± 0.323
3.702AspSer: 3.702 ± 0.497
0.823AspThr: 0.823 ± 0.238
2.057AspVal: 2.057 ± 0.449
0.0AspTrp: 0.0 ± 0.0
1.234AspTyr: 1.234 ± 0.369
0.0AspXaa: 0.0 ± 0.0
Glu
2.057GluAla: 2.057 ± 0.449
0.823GluCys: 0.823 ± 0.238
2.468GluAsp: 2.468 ± 0.738
4.525GluGlu: 4.525 ± 1.006
0.823GluPhe: 0.823 ± 0.238
1.234GluGly: 1.234 ± 0.323
2.468GluHis: 2.468 ± 0.714
2.057GluIle: 2.057 ± 0.703
2.468GluLys: 2.468 ± 0.647
5.759GluLeu: 5.759 ± 1.4
1.234GluMet: 1.234 ± 0.323
0.823GluAsn: 0.823 ± 0.238
3.291GluPro: 3.291 ± 0.411
2.057GluGln: 2.057 ± 0.449
6.582GluArg: 6.582 ± 1.603
6.17GluSer: 6.17 ± 0.822
3.702GluThr: 3.702 ± 1.161
2.468GluVal: 2.468 ± 0.683
0.0GluTrp: 0.0 ± 0.0
1.645GluTyr: 1.645 ± 0.73
0.0GluXaa: 0.0 ± 0.0
Phe
5.759PheAla: 5.759 ± 0.96
0.823PheCys: 0.823 ± 0.238
1.645PheAsp: 1.645 ± 0.476
2.468PheGlu: 2.468 ± 0.714
0.823PhePhe: 0.823 ± 0.238
6.993PheGly: 6.993 ± 0.849
3.291PheHis: 3.291 ± 0.736
1.645PheIle: 1.645 ± 0.359
1.234PheLys: 1.234 ± 0.369
0.823PheLeu: 0.823 ± 0.238
0.411PheMet: 0.411 ± 0.539
0.411PheAsn: 0.411 ± 0.436
0.411PhePro: 0.411 ± 0.436
0.823PheGln: 0.823 ± 0.238
1.234PheArg: 1.234 ± 0.323
3.291PheSer: 3.291 ± 0.298
6.17PheThr: 6.17 ± 1.034
3.291PheVal: 3.291 ± 0.945
0.823PheTrp: 0.823 ± 0.238
1.234PheTyr: 1.234 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
5.759GlyAla: 5.759 ± 1.005
1.645GlyCys: 1.645 ± 0.561
5.348GlyAsp: 5.348 ± 1.104
3.702GlyGlu: 3.702 ± 1.204
4.114GlyPhe: 4.114 ± 1.031
8.638GlyGly: 8.638 ± 1.21
0.0GlyHis: 0.0 ± 0.0
6.582GlyIle: 6.582 ± 1.431
2.879GlyLys: 2.879 ± 0.51
9.872GlyLeu: 9.872 ± 1.027
1.234GlyMet: 1.234 ± 0.621
0.823GlyAsn: 0.823 ± 0.238
2.879GlyPro: 2.879 ± 1.042
0.411GlyGln: 0.411 ± 0.436
0.823GlyArg: 0.823 ± 0.238
2.057GlySer: 2.057 ± 0.703
3.702GlyThr: 3.702 ± 0.55
5.759GlyVal: 5.759 ± 0.89
1.234GlyTrp: 1.234 ± 0.323
2.057GlyTyr: 2.057 ± 0.449
0.0GlyXaa: 0.0 ± 0.0
His
2.468HisAla: 2.468 ± 0.738
0.0HisCys: 0.0 ± 0.0
1.645HisAsp: 1.645 ± 0.359
0.0HisGlu: 0.0 ± 0.0
4.114HisPhe: 4.114 ± 1.373
0.0HisGly: 0.0 ± 0.0
3.291HisHis: 3.291 ± 1.607
1.645HisIle: 1.645 ± 0.561
0.823HisLys: 0.823 ± 0.54
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
2.468HisAsn: 2.468 ± 0.714
2.057HisPro: 2.057 ± 0.449
0.823HisGln: 0.823 ± 0.238
1.234HisArg: 1.234 ± 0.369
2.879HisSer: 2.879 ± 0.523
1.645HisThr: 1.645 ± 0.359
2.879HisVal: 2.879 ± 0.617
0.0HisTrp: 0.0 ± 0.0
0.823HisTyr: 0.823 ± 0.54
0.0HisXaa: 0.0 ± 0.0
Ile
1.645IleAla: 1.645 ± 0.73
0.823IleCys: 0.823 ± 0.238
1.234IleAsp: 1.234 ± 0.323
2.879IleGlu: 2.879 ± 0.523
2.879IlePhe: 2.879 ± 1.006
2.879IleGly: 2.879 ± 0.745
0.0IleHis: 0.0 ± 0.0
1.234IleIle: 1.234 ± 0.621
0.823IleLys: 0.823 ± 0.238
2.468IleLeu: 2.468 ± 0.572
0.0IleMet: 0.0 ± 0.0
4.525IleAsn: 4.525 ± 0.946
3.291IlePro: 3.291 ± 0.411
2.468IleGln: 2.468 ± 0.683
2.468IleArg: 2.468 ± 0.228
3.291IleSer: 3.291 ± 0.411
5.348IleThr: 5.348 ± 0.372
3.702IleVal: 3.702 ± 0.421
1.645IleTrp: 1.645 ± 0.476
4.114IleTyr: 4.114 ± 1.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.936LysAla: 4.936 ± 1.428
0.0LysCys: 0.0 ± 0.0
1.645LysAsp: 1.645 ± 0.476
0.0LysGlu: 0.0 ± 0.0
0.823LysPhe: 0.823 ± 0.238
1.645LysGly: 1.645 ± 0.476
0.0LysHis: 0.0 ± 0.0
0.823LysIle: 0.823 ± 0.238
2.468LysLys: 2.468 ± 0.228
5.759LysLeu: 5.759 ± 1.556
0.411LysMet: 0.411 ± 0.288
1.234LysAsn: 1.234 ± 0.428
3.702LysPro: 3.702 ± 0.82
1.234LysGln: 1.234 ± 0.369
5.348LysArg: 5.348 ± 1.023
1.234LysSer: 1.234 ± 0.323
1.234LysThr: 1.234 ± 0.621
3.291LysVal: 3.291 ± 0.298
1.645LysTrp: 1.645 ± 0.476
2.057LysTyr: 2.057 ± 0.364
0.0LysXaa: 0.0 ± 0.0
Leu
6.17LeuAla: 6.17 ± 0.715
4.114LeuCys: 4.114 ± 0.898
3.702LeuAsp: 3.702 ± 0.964
3.702LeuGlu: 3.702 ± 0.82
1.234LeuPhe: 1.234 ± 0.323
4.114LeuGly: 4.114 ± 0.804
2.057LeuHis: 2.057 ± 0.449
4.936LeuIle: 4.936 ± 1.546
4.936LeuLys: 4.936 ± 1.428
8.227LeuLeu: 8.227 ± 2.017
1.234LeuMet: 1.234 ± 0.369
4.114LeuAsn: 4.114 ± 0.552
1.645LeuPro: 1.645 ± 0.567
2.879LeuGln: 2.879 ± 0.827
6.993LeuArg: 6.993 ± 1.505
11.107LeuSer: 11.107 ± 1.035
5.348LeuThr: 5.348 ± 0.372
11.107LeuVal: 11.107 ± 1.764
3.702LeuTrp: 3.702 ± 1.101
1.234LeuTyr: 1.234 ± 0.323
0.0LeuXaa: 0.0 ± 0.0
Met
2.879MetAla: 2.879 ± 0.532
0.823MetCys: 0.823 ± 0.238
1.645MetAsp: 1.645 ± 0.71
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.057MetGly: 2.057 ± 0.449
0.0MetHis: 0.0 ± 0.0
0.411MetIle: 0.411 ± 0.436
0.0MetLys: 0.0 ± 0.0
4.114MetLeu: 4.114 ± 0.724
0.823MetMet: 0.823 ± 0.238
2.879MetAsn: 2.879 ± 0.617
0.823MetPro: 0.823 ± 0.54
0.823MetGln: 0.823 ± 0.238
0.411MetArg: 0.411 ± 0.436
2.057MetSer: 2.057 ± 0.449
0.823MetThr: 0.823 ± 0.872
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.411MetTyr: 0.411 ± 0.436
0.0MetXaa: 0.0 ± 0.0
Asn
0.411AsnAla: 0.411 ± 0.568
2.879AsnCys: 2.879 ± 0.523
1.645AsnAsp: 1.645 ± 0.561
2.057AsnGlu: 2.057 ± 0.449
2.057AsnPhe: 2.057 ± 0.364
3.291AsnGly: 3.291 ± 0.837
2.879AsnHis: 2.879 ± 0.779
4.114AsnIle: 4.114 ± 1.19
1.234AsnLys: 1.234 ± 0.369
2.468AsnLeu: 2.468 ± 0.228
0.823AsnMet: 0.823 ± 0.437
2.468AsnAsn: 2.468 ± 0.714
4.525AsnPro: 4.525 ± 0.563
0.0AsnGln: 0.0 ± 0.0
1.645AsnArg: 1.645 ± 0.359
2.057AsnSer: 2.057 ± 0.364
2.057AsnThr: 2.057 ± 0.449
1.234AsnVal: 1.234 ± 0.323
0.0AsnTrp: 0.0 ± 0.0
2.057AsnTyr: 2.057 ± 0.372
1.234AsnXaa: 1.234 ± 0.428
Pro
2.057ProAla: 2.057 ± 0.364
2.057ProCys: 2.057 ± 0.364
2.057ProAsp: 2.057 ± 0.708
2.879ProGlu: 2.879 ± 0.617
0.0ProPhe: 0.0 ± 0.0
1.645ProGly: 1.645 ± 0.817
0.0ProHis: 0.0 ± 0.0
2.879ProIle: 2.879 ± 1.042
3.702ProLys: 3.702 ± 0.421
7.404ProLeu: 7.404 ± 1.588
1.234ProMet: 1.234 ± 0.369
0.823ProAsn: 0.823 ± 0.54
1.234ProPro: 1.234 ± 0.525
4.525ProGln: 4.525 ± 0.587
2.879ProArg: 2.879 ± 0.523
4.525ProSer: 4.525 ± 1.308
4.936ProThr: 4.936 ± 1.389
3.291ProVal: 3.291 ± 0.671
2.057ProTrp: 2.057 ± 0.449
2.057ProTyr: 2.057 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
3.702GlnAla: 3.702 ± 0.961
0.0GlnCys: 0.0 ± 0.0
1.234GlnAsp: 1.234 ± 0.369
1.234GlnGlu: 1.234 ± 0.369
2.468GlnPhe: 2.468 ± 0.672
1.234GlnGly: 1.234 ± 0.323
2.879GlnHis: 2.879 ± 0.644
3.702GlnIle: 3.702 ± 0.749
1.234GlnLys: 1.234 ± 0.323
4.114GlnLeu: 4.114 ± 0.488
0.823GlnMet: 0.823 ± 0.238
0.411GlnAsn: 0.411 ± 0.568
4.936GlnPro: 4.936 ± 0.998
3.291GlnGln: 3.291 ± 1.006
1.645GlnArg: 1.645 ± 0.476
2.057GlnSer: 2.057 ± 0.364
2.879GlnThr: 2.879 ± 0.745
3.702GlnVal: 3.702 ± 0.664
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.702ArgAla: 3.702 ± 0.961
1.645ArgCys: 1.645 ± 0.561
1.645ArgAsp: 1.645 ± 0.567
7.816ArgGlu: 7.816 ± 1.724
5.348ArgPhe: 5.348 ± 0.692
6.17ArgGly: 6.17 ± 0.949
2.468ArgHis: 2.468 ± 0.738
2.879ArgIle: 2.879 ± 0.523
2.468ArgLys: 2.468 ± 0.714
7.404ArgLeu: 7.404 ± 0.801
2.879ArgMet: 2.879 ± 0.523
2.057ArgAsn: 2.057 ± 0.449
2.468ArgPro: 2.468 ± 0.672
1.645ArgGln: 1.645 ± 0.927
4.525ArgArg: 4.525 ± 1.038
2.879ArgSer: 2.879 ± 1.795
2.057ArgThr: 2.057 ± 0.364
7.404ArgVal: 7.404 ± 0.684
0.0ArgTrp: 0.0 ± 0.0
2.057ArgTyr: 2.057 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
4.936SerAla: 4.936 ± 1.834
2.057SerCys: 2.057 ± 0.364
6.582SerAsp: 6.582 ± 1.999
3.702SerGlu: 3.702 ± 1.118
3.291SerPhe: 3.291 ± 0.298
6.582SerGly: 6.582 ± 0.929
1.645SerHis: 1.645 ± 0.359
1.645SerIle: 1.645 ± 0.476
2.468SerLys: 2.468 ± 0.714
8.227SerLeu: 8.227 ± 0.663
2.879SerMet: 2.879 ± 0.582
3.702SerAsn: 3.702 ± 0.749
3.291SerPro: 3.291 ± 1.55
4.114SerGln: 4.114 ± 0.724
3.291SerArg: 3.291 ± 0.737
4.114SerSer: 4.114 ± 1.311
2.468SerThr: 2.468 ± 1.246
6.582SerVal: 6.582 ± 0.621
1.645SerTrp: 1.645 ± 0.359
1.645SerTyr: 1.645 ± 0.476
0.0SerXaa: 0.0 ± 0.0
Thr
2.468ThrAla: 2.468 ± 0.228
0.0ThrCys: 0.0 ± 0.0
0.411ThrAsp: 0.411 ± 0.436
2.057ThrGlu: 2.057 ± 0.516
5.759ThrPhe: 5.759 ± 0.928
2.468ThrGly: 2.468 ± 1.59
2.057ThrHis: 2.057 ± 0.449
3.291ThrIle: 3.291 ± 1.006
2.879ThrLys: 2.879 ± 0.561
3.291ThrLeu: 3.291 ± 1.55
2.057ThrMet: 2.057 ± 0.494
3.291ThrAsn: 3.291 ± 0.737
5.348ThrPro: 5.348 ± 1.227
4.525ThrGln: 4.525 ± 0.918
6.582ThrArg: 6.582 ± 0.901
3.291ThrSer: 3.291 ± 0.671
4.936ThrThr: 4.936 ± 3.033
6.993ThrVal: 6.993 ± 1.091
0.0ThrTrp: 0.0 ± 0.0
1.645ThrTyr: 1.645 ± 0.567
0.0ThrXaa: 0.0 ± 0.0
Val
10.695ValAla: 10.695 ± 1.619
0.0ValCys: 0.0 ± 0.0
2.879ValAsp: 2.879 ± 0.617
3.702ValGlu: 3.702 ± 0.726
2.057ValPhe: 2.057 ± 0.449
8.227ValGly: 8.227 ± 1.272
2.879ValHis: 2.879 ± 0.51
1.234ValIle: 1.234 ± 0.323
4.114ValLys: 4.114 ± 0.727
7.816ValLeu: 7.816 ± 0.347
0.823ValMet: 0.823 ± 0.238
3.291ValAsn: 3.291 ± 0.298
4.525ValPro: 4.525 ± 0.894
2.879ValGln: 2.879 ± 0.979
5.348ValArg: 5.348 ± 0.64
6.993ValSer: 6.993 ± 1.824
4.936ValThr: 4.936 ± 1.076
4.936ValVal: 4.936 ± 1.929
1.234ValTrp: 1.234 ± 0.525
2.468ValTyr: 2.468 ± 0.714
0.0ValXaa: 0.0 ± 0.0
Trp
0.411TrpAla: 0.411 ± 0.436
1.234TrpCys: 1.234 ± 0.369
0.823TrpAsp: 0.823 ± 0.54
0.0TrpGlu: 0.0 ± 0.0
0.823TrpPhe: 0.823 ± 0.238
0.823TrpGly: 0.823 ± 0.238
0.823TrpHis: 0.823 ± 0.238
0.0TrpIle: 0.0 ± 0.0
1.234TrpLys: 1.234 ± 0.323
0.823TrpLeu: 0.823 ± 0.54
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.645TrpSer: 1.645 ± 0.476
0.823TrpThr: 0.823 ± 0.238
3.702TrpVal: 3.702 ± 0.497
0.0TrpTrp: 0.0 ± 0.0
1.234TrpTyr: 1.234 ± 0.369
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.823TyrAla: 0.823 ± 0.238
2.468TyrCys: 2.468 ± 0.738
0.0TyrAsp: 0.0 ± 0.0
5.348TyrGlu: 5.348 ± 0.64
0.823TyrPhe: 0.823 ± 0.238
0.823TyrGly: 0.823 ± 0.872
0.0TyrHis: 0.0 ± 0.0
1.645TyrIle: 1.645 ± 0.561
0.411TyrLys: 0.411 ± 0.288
0.823TyrLeu: 0.823 ± 0.238
0.0TyrMet: 0.0 ± 0.0
1.645TyrAsn: 1.645 ± 0.476
1.234TyrPro: 1.234 ± 0.323
3.291TyrGln: 3.291 ± 0.952
0.823TyrArg: 0.823 ± 0.447
5.348TyrSer: 5.348 ± 0.883
2.468TyrThr: 2.468 ± 0.647
0.823TyrVal: 0.823 ± 0.238
0.0TyrTrp: 0.0 ± 0.0
0.823TyrTyr: 0.823 ± 0.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.823XaaGly: 0.823 ± 0.238
0.0XaaHis: 0.0 ± 0.0
0.411XaaIle: 0.411 ± 0.288
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2432 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski