Amino acid dipepetide frequency for Infectious pancreatic necrosis virus (strain Jasper) (IPNV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.656AlaAla: 8.656 ± 1.969
0.509AlaCys: 0.509 ± 0.346
4.582AlaAsp: 4.582 ± 0.992
5.092AlaGlu: 5.092 ± 1.189
2.546AlaPhe: 2.546 ± 0.594
5.601AlaGly: 5.601 ± 1.442
1.018AlaHis: 1.018 ± 0.691
4.582AlaIle: 4.582 ± 1.129
5.092AlaLys: 5.092 ± 0.381
6.619AlaLeu: 6.619 ± 0.998
2.037AlaMet: 2.037 ± 0.906
3.055AlaAsn: 3.055 ± 0.669
5.601AlaPro: 5.601 ± 1.204
2.037AlaGln: 2.037 ± 0.446
3.055AlaArg: 3.055 ± 0.826
6.619AlaSer: 6.619 ± 3.141
6.619AlaThr: 6.619 ± 1.606
3.055AlaVal: 3.055 ± 0.669
0.509AlaTrp: 0.509 ± 0.346
3.055AlaTyr: 3.055 ± 0.826
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.509CysPhe: 0.509 ± 0.346
1.018CysGly: 1.018 ± 0.223
0.509CysHis: 0.509 ± 1.342
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.018CysLeu: 1.018 ± 1.294
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.509CysPro: 0.509 ± 1.342
0.509CysGln: 0.509 ± 0.346
0.509CysArg: 0.509 ± 0.346
0.0CysSer: 0.0 ± 0.0
1.527CysThr: 1.527 ± 1.126
0.509CysVal: 0.509 ± 0.346
0.0CysTrp: 0.0 ± 0.0
0.509CysTyr: 0.509 ± 0.346
0.0CysXaa: 0.0 ± 0.0
Asp
2.546AspAla: 2.546 ± 1.085
0.0AspCys: 0.0 ± 0.0
3.055AspAsp: 3.055 ± 1.425
3.564AspGlu: 3.564 ± 0.787
3.055AspPhe: 3.055 ± 0.864
3.055AspGly: 3.055 ± 1.429
1.018AspHis: 1.018 ± 0.223
5.601AspIle: 5.601 ± 1.922
3.564AspLys: 3.564 ± 0.92
10.183AspLeu: 10.183 ± 0.85
1.018AspMet: 1.018 ± 0.691
3.564AspAsn: 3.564 ± 1.434
4.582AspPro: 4.582 ± 1.019
3.564AspGln: 3.564 ± 1.174
0.509AspArg: 0.509 ± 0.39
3.055AspSer: 3.055 ± 1.675
1.527AspThr: 1.527 ± 0.533
3.055AspVal: 3.055 ± 0.932
1.018AspTrp: 1.018 ± 0.779
0.509AspTyr: 0.509 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
7.128GluAla: 7.128 ± 1.373
0.0GluCys: 0.0 ± 0.0
4.582GluAsp: 4.582 ± 1.129
3.564GluGlu: 3.564 ± 1.174
2.037GluPhe: 2.037 ± 0.906
4.073GluGly: 4.073 ± 1.047
1.018GluHis: 1.018 ± 1.271
3.564GluIle: 3.564 ± 2.32
4.073GluLys: 4.073 ± 1.244
5.092GluLeu: 5.092 ± 0.381
1.527GluMet: 1.527 ± 0.432
3.055GluAsn: 3.055 ± 0.669
2.037GluPro: 2.037 ± 1.048
1.018GluGln: 1.018 ± 0.779
3.564GluArg: 3.564 ± 0.705
3.055GluSer: 3.055 ± 1.216
7.128GluThr: 7.128 ± 0.673
5.092GluVal: 5.092 ± 1.603
2.037GluTrp: 2.037 ± 1.048
1.018GluTyr: 1.018 ± 0.691
0.0GluXaa: 0.0 ± 0.0
Phe
2.037PheAla: 2.037 ± 0.75
0.0PheCys: 0.0 ± 0.0
2.037PheAsp: 2.037 ± 0.75
1.018PheGlu: 1.018 ± 0.691
0.509PhePhe: 0.509 ± 0.39
2.037PheGly: 2.037 ± 0.446
1.018PheHis: 1.018 ± 0.779
1.527PheIle: 1.527 ± 0.432
2.546PheLys: 2.546 ± 1.085
2.546PheLeu: 2.546 ± 1.289
0.509PheMet: 0.509 ± 0.39
1.018PheAsn: 1.018 ± 0.223
2.546PhePro: 2.546 ± 0.594
0.509PheGln: 0.509 ± 0.346
0.509PheArg: 0.509 ± 0.39
1.527PheSer: 1.527 ± 0.533
2.546PheThr: 2.546 ± 1.085
0.509PheVal: 0.509 ± 0.39
0.509PheTrp: 0.509 ± 0.39
1.018PheTyr: 1.018 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
5.601GlyAla: 5.601 ± 2.509
1.018GlyCys: 1.018 ± 0.691
4.073GlyAsp: 4.073 ± 0.892
4.073GlyGlu: 4.073 ± 1.015
1.527GlyPhe: 1.527 ± 0.533
3.564GlyGly: 3.564 ± 0.787
0.0GlyHis: 0.0 ± 0.0
4.582GlyIle: 4.582 ± 1.926
3.055GlyLys: 3.055 ± 1.675
5.092GlyLeu: 5.092 ± 1.603
1.018GlyMet: 1.018 ± 0.223
3.055GlyAsn: 3.055 ± 1.216
5.092GlyPro: 5.092 ± 2.26
2.037GlyGln: 2.037 ± 0.446
5.092GlyArg: 5.092 ± 2.163
6.11GlySer: 6.11 ± 1.377
3.564GlyThr: 3.564 ± 1.1
3.564GlyVal: 3.564 ± 1.174
2.037GlyTrp: 2.037 ± 0.906
1.527GlyTyr: 1.527 ± 1.169
0.0GlyXaa: 0.0 ± 0.0
His
0.509HisAla: 0.509 ± 0.346
0.0HisCys: 0.0 ± 0.0
1.018HisAsp: 1.018 ± 0.223
1.527HisGlu: 1.527 ± 1.126
0.0HisPhe: 0.0 ± 0.0
1.018HisGly: 1.018 ± 0.223
0.0HisHis: 0.0 ± 0.0
1.018HisIle: 1.018 ± 1.271
0.509HisLys: 0.509 ± 0.39
2.546HisLeu: 2.546 ± 0.912
1.527HisMet: 1.527 ± 0.533
0.509HisAsn: 0.509 ± 1.342
0.509HisPro: 0.509 ± 1.342
1.018HisGln: 1.018 ± 0.691
0.509HisArg: 0.509 ± 0.346
0.509HisSer: 0.509 ± 1.342
1.018HisThr: 1.018 ± 0.691
0.509HisVal: 0.509 ± 0.39
0.0HisTrp: 0.0 ± 0.0
1.018HisTyr: 1.018 ± 1.271
0.0HisXaa: 0.0 ± 0.0
Ile
4.582IleAla: 4.582 ± 0.992
0.509IleCys: 0.509 ± 0.346
2.037IleAsp: 2.037 ± 0.446
0.509IleGlu: 0.509 ± 0.39
1.527IlePhe: 1.527 ± 0.432
2.546IleGly: 2.546 ± 1.085
0.509IleHis: 0.509 ± 1.342
1.018IleIle: 1.018 ± 0.779
2.546IleLys: 2.546 ± 1.085
4.073IleLeu: 4.073 ± 0.603
1.018IleMet: 1.018 ± 0.223
2.037IleAsn: 2.037 ± 1.559
5.092IlePro: 5.092 ± 1.603
3.564IleGln: 3.564 ± 5.045
5.601IleArg: 5.601 ± 4.641
1.018IleSer: 1.018 ± 0.691
7.128IleThr: 7.128 ± 1.872
2.546IleVal: 2.546 ± 0.718
0.509IleTrp: 0.509 ± 0.39
2.546IleTyr: 2.546 ± 1.217
0.0IleXaa: 0.0 ± 0.0
Lys
7.128LysAla: 7.128 ± 0.083
0.0LysCys: 0.0 ± 0.0
2.546LysAsp: 2.546 ± 1.289
3.564LysGlu: 3.564 ± 1.424
1.018LysPhe: 1.018 ± 0.691
5.092LysGly: 5.092 ± 1.435
2.546LysHis: 2.546 ± 0.912
2.546LysIle: 2.546 ± 0.718
2.546LysLys: 2.546 ± 1.289
3.564LysLeu: 3.564 ± 0.92
1.018LysMet: 1.018 ± 0.691
3.055LysAsn: 3.055 ± 0.669
5.092LysPro: 5.092 ± 0.381
1.527LysGln: 1.527 ± 0.533
3.055LysArg: 3.055 ± 1.065
3.564LysSer: 3.564 ± 2.365
6.11LysThr: 6.11 ± 2.032
1.018LysVal: 1.018 ± 0.691
0.509LysTrp: 0.509 ± 1.342
3.564LysTyr: 3.564 ± 1.174
0.0LysXaa: 0.0 ± 0.0
Leu
6.11LeuAla: 6.11 ± 1.634
0.0LeuCys: 0.0 ± 0.0
6.11LeuAsp: 6.11 ± 2.13
9.165LeuGlu: 9.165 ± 2.225
1.018LeuPhe: 1.018 ± 0.223
2.546LeuGly: 2.546 ± 0.594
0.0LeuHis: 0.0 ± 0.0
4.582LeuIle: 4.582 ± 0.685
6.619LeuLys: 6.619 ± 1.56
11.202LeuLeu: 11.202 ± 3.344
3.564LeuMet: 3.564 ± 0.92
5.092LeuAsn: 5.092 ± 0.902
8.656LeuPro: 8.656 ± 1.766
3.564LeuGln: 3.564 ± 0.705
6.619LeuArg: 6.619 ± 3.387
5.601LeuSer: 5.601 ± 0.518
6.11LeuThr: 6.11 ± 0.639
6.619LeuVal: 6.619 ± 1.11
0.0LeuTrp: 0.0 ± 0.0
1.018LeuTyr: 1.018 ± 0.223
0.0LeuXaa: 0.0 ± 0.0
Met
1.527MetAla: 1.527 ± 1.292
0.509MetCys: 0.509 ± 0.346
2.546MetAsp: 2.546 ± 0.594
0.509MetGlu: 0.509 ± 0.39
0.509MetPhe: 0.509 ± 0.39
0.509MetGly: 0.509 ± 0.346
0.0MetHis: 0.0 ± 0.0
2.037MetIle: 2.037 ± 0.75
2.037MetLys: 2.037 ± 0.906
0.509MetLeu: 0.509 ± 0.346
0.509MetMet: 0.509 ± 0.39
2.546MetAsn: 2.546 ± 1.217
0.0MetPro: 0.0 ± 0.0
1.018MetGln: 1.018 ± 0.223
1.018MetArg: 1.018 ± 0.691
3.564MetSer: 3.564 ± 0.92
1.527MetThr: 1.527 ± 0.533
3.564MetVal: 3.564 ± 0.92
0.0MetTrp: 0.0 ± 0.0
1.018MetTyr: 1.018 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
2.546AsnAla: 2.546 ± 0.594
0.509AsnCys: 0.509 ± 0.39
3.055AsnAsp: 3.055 ± 0.864
2.546AsnGlu: 2.546 ± 1.289
0.509AsnPhe: 0.509 ± 0.39
3.055AsnGly: 3.055 ± 2.074
1.018AsnHis: 1.018 ± 1.294
2.546AsnIle: 2.546 ± 1.217
2.546AsnLys: 2.546 ± 1.765
3.564AsnLeu: 3.564 ± 0.705
1.018AsnMet: 1.018 ± 0.223
3.564AsnAsn: 3.564 ± 0.92
5.601AsnPro: 5.601 ± 1.672
3.055AsnGln: 3.055 ± 0.864
1.527AsnArg: 1.527 ± 2.607
2.546AsnSer: 2.546 ± 1.085
2.546AsnThr: 2.546 ± 1.289
1.018AsnVal: 1.018 ± 0.223
0.509AsnTrp: 0.509 ± 0.39
3.564AsnTyr: 3.564 ± 1.767
0.0AsnXaa: 0.0 ± 0.0
Pro
4.073ProAla: 4.073 ± 1.5
0.0ProCys: 0.0 ± 0.0
5.092ProAsp: 5.092 ± 0.902
5.601ProGlu: 5.601 ± 1.705
1.527ProPhe: 1.527 ± 0.533
5.092ProGly: 5.092 ± 1.603
1.018ProHis: 1.018 ± 0.223
3.055ProIle: 3.055 ± 0.864
6.619ProLys: 6.619 ± 2.401
4.582ProLeu: 4.582 ± 1.019
1.018ProMet: 1.018 ± 0.223
2.546ProAsn: 2.546 ± 0.594
1.527ProPro: 1.527 ± 0.432
4.582ProGln: 4.582 ± 1.129
4.582ProArg: 4.582 ± 0.511
4.073ProSer: 4.073 ± 0.603
6.619ProThr: 6.619 ± 0.408
4.582ProVal: 4.582 ± 0.685
0.0ProTrp: 0.0 ± 0.0
1.527ProTyr: 1.527 ± 0.533
0.0ProXaa: 0.0 ± 0.0
Gln
2.546GlnAla: 2.546 ± 0.718
0.0GlnCys: 0.0 ± 0.0
3.055GlnAsp: 3.055 ± 0.864
2.037GlnGlu: 2.037 ± 1.383
2.037GlnPhe: 2.037 ± 0.75
3.564GlnGly: 3.564 ± 2.32
1.018GlnHis: 1.018 ± 0.223
0.509GlnIle: 0.509 ± 0.39
1.018GlnLys: 1.018 ± 0.223
6.11GlnLeu: 6.11 ± 1.615
2.546GlnMet: 2.546 ± 0.64
0.509GlnAsn: 0.509 ± 0.39
0.509GlnPro: 0.509 ± 0.346
2.546GlnGln: 2.546 ± 0.718
1.018GlnArg: 1.018 ± 0.779
2.546GlnSer: 2.546 ± 1.085
3.564GlnThr: 3.564 ± 0.92
2.037GlnVal: 2.037 ± 1.105
1.018GlnTrp: 1.018 ± 1.271
0.509GlnTyr: 0.509 ± 0.39
0.0GlnXaa: 0.0 ± 0.0
Arg
2.546ArgAla: 2.546 ± 0.594
1.527ArgCys: 1.527 ± 2.591
3.564ArgAsp: 3.564 ± 2.197
4.582ArgGlu: 4.582 ± 3.731
1.527ArgPhe: 1.527 ± 0.432
3.055ArgGly: 3.055 ± 1.216
1.527ArgHis: 1.527 ± 2.591
3.055ArgIle: 3.055 ± 2.251
4.073ArgLys: 4.073 ± 2.096
6.11ArgLeu: 6.11 ± 1.338
1.018ArgMet: 1.018 ± 0.691
2.546ArgAsn: 2.546 ± 1.217
4.582ArgPro: 4.582 ± 0.511
2.546ArgGln: 2.546 ± 0.718
4.073ArgArg: 4.073 ± 1.047
1.527ArgSer: 1.527 ± 1.126
2.546ArgThr: 2.546 ± 1.085
2.037ArgVal: 2.037 ± 0.75
1.527ArgTrp: 1.527 ± 1.126
3.055ArgTyr: 3.055 ± 0.669
0.0ArgXaa: 0.0 ± 0.0
Ser
4.582SerAla: 4.582 ± 1.598
1.527SerCys: 1.527 ± 1.36
2.037SerAsp: 2.037 ± 0.906
4.073SerGlu: 4.073 ± 0.603
1.527SerPhe: 1.527 ± 0.533
6.11SerGly: 6.11 ± 0.794
1.527SerHis: 1.527 ± 1.037
3.055SerIle: 3.055 ± 0.864
3.055SerLys: 3.055 ± 1.216
6.11SerLeu: 6.11 ± 1.728
1.527SerMet: 1.527 ± 0.533
2.037SerAsn: 2.037 ± 1.048
3.564SerPro: 3.564 ± 1.434
1.018SerGln: 1.018 ± 0.691
3.055SerArg: 3.055 ± 2.319
6.11SerSer: 6.11 ± 0.162
5.601SerThr: 5.601 ± 1.204
3.055SerVal: 3.055 ± 2.251
2.037SerTrp: 2.037 ± 1.048
2.546SerTyr: 2.546 ± 0.594
0.0SerXaa: 0.0 ± 0.0
Thr
8.147ThrAla: 8.147 ± 1.777
0.0ThrCys: 0.0 ± 0.0
3.564ThrAsp: 3.564 ± 0.705
3.055ThrGlu: 3.055 ± 0.826
3.055ThrPhe: 3.055 ± 1.065
6.619ThrGly: 6.619 ± 0.297
0.509ThrHis: 0.509 ± 0.39
3.564ThrIle: 3.564 ± 2.063
5.092ThrLys: 5.092 ± 0.902
7.128ThrLeu: 7.128 ± 1.839
1.018ThrMet: 1.018 ± 0.223
3.564ThrAsn: 3.564 ± 1.1
7.128ThrPro: 7.128 ± 1.839
2.546ThrGln: 2.546 ± 1.217
6.619ThrArg: 6.619 ± 2.085
5.092ThrSer: 5.092 ± 2.17
3.564ThrThr: 3.564 ± 0.92
4.582ThrVal: 4.582 ± 1.833
1.018ThrTrp: 1.018 ± 0.779
2.037ThrTyr: 2.037 ± 0.446
0.0ThrXaa: 0.0 ± 0.0
Val
4.582ValAla: 4.582 ± 0.685
1.018ValCys: 1.018 ± 1.271
3.055ValAsp: 3.055 ± 1.065
6.11ValGlu: 6.11 ± 0.639
1.527ValPhe: 1.527 ± 1.037
2.546ValGly: 2.546 ± 1.289
0.509ValHis: 0.509 ± 0.346
2.037ValIle: 2.037 ± 1.4
3.055ValLys: 3.055 ± 0.669
3.564ValLeu: 3.564 ± 0.787
2.037ValMet: 2.037 ± 0.542
2.037ValAsn: 2.037 ± 1.383
2.546ValPro: 2.546 ± 0.594
0.509ValGln: 0.509 ± 0.346
2.546ValArg: 2.546 ± 0.594
3.564ValSer: 3.564 ± 0.787
5.092ValThr: 5.092 ± 0.902
4.582ValVal: 4.582 ± 1.296
1.527ValTrp: 1.527 ± 0.432
0.509ValTyr: 0.509 ± 0.346
0.0ValXaa: 0.0 ± 0.0
Trp
3.055TrpAla: 3.055 ± 2.396
0.0TrpCys: 0.0 ± 0.0
1.018TrpAsp: 1.018 ± 1.294
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.018TrpGly: 1.018 ± 0.691
0.0TrpHis: 0.0 ± 0.0
1.018TrpIle: 1.018 ± 1.271
0.0TrpLys: 0.0 ± 0.0
1.018TrpLeu: 1.018 ± 0.223
0.0TrpMet: 0.0 ± 0.0
1.527TrpAsn: 1.527 ± 0.533
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.527TrpArg: 1.527 ± 0.432
2.037TrpSer: 2.037 ± 1.105
1.018TrpThr: 1.018 ± 0.223
0.509TrpVal: 0.509 ± 0.39
0.0TrpTrp: 0.0 ± 0.0
1.018TrpTyr: 1.018 ± 0.779
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.037TyrAla: 2.037 ± 1.048
0.0TyrCys: 0.0 ± 0.0
1.018TyrAsp: 1.018 ± 0.691
4.073TyrGlu: 4.073 ± 2.111
0.509TyrPhe: 0.509 ± 0.346
3.564TyrGly: 3.564 ± 0.92
0.509TyrHis: 0.509 ± 0.346
1.018TyrIle: 1.018 ± 0.779
1.018TyrLys: 1.018 ± 0.223
3.564TyrLeu: 3.564 ± 0.92
1.018TyrMet: 1.018 ± 1.271
1.527TyrAsn: 1.527 ± 0.432
2.546TyrPro: 2.546 ± 1.289
1.527TyrGln: 1.527 ± 0.432
2.037TyrArg: 2.037 ± 0.75
2.037TyrSer: 2.037 ± 0.906
2.546TyrThr: 2.546 ± 1.217
1.018TyrVal: 1.018 ± 0.691
0.0TyrTrp: 0.0 ± 0.0
1.527TyrTyr: 1.527 ± 0.533
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1965 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski