Developing Paradigms for Functional Genomics of Protein Kinases and Phosphoproteins Using the CDPK Superfamily

NSF 2010 Project MCB 0114769

PI: Alice Harmon, University of Florida
Co-PI: John Cushman, University of Nevada, Reno
Co-PI: Jeffery Harper, The Scripps Research Institiute
Co-PI: Estelle Hrabak, University of New Hampshire
Co-PI: Michael Sussman, Universtiy of Wisconsin, Madison


Information Available
  1. Proposal Abstract
  2. Specific Goals and Deliverables
  3. CPK Gene Table - Calcium-dependent Protein Kinases
  4. CRK Gene Table - CPK-Like Protein Kinases
  5. PPCK Gene Table - Phosphoenolpyruvate Carboxylase Protein Kinases
  6. PEPRK Gene Table - PPCK-Like Protein Kinases
  7. SnRK Gene Table - SNF1-Like Protein Kinases
  8. CPK-GFP Fusions (Hrabak lab)
  9. Putative phosphoproteins identified by MS - Harmon Lab
    [[Excel] format
  10. CPK4 Interactors (Excel format) by Yeast 2-hybrid, Cushman Lab

Return to index

Abstract

Protein phosphorylation is a major mode of regulation of metabolism, gene expression, and cellular architecture in eukaryotic cells, and defining phosphorylation-based regulatory networks is essential for understanding the function of the Arabidopsis genome. The goal of this project is to define phosphorylation networks that are related to the function of calcium-dependent protein kinases (CDPKs) and four closely related families; CDPK-related kinases (CRKs), phosphoenolpyruvate carboxylase kinases (PPCKs), PPCK-related kinases (PEPRKs), and SNF-1 related kinases (SnRKs). These 84 kinases, which represent about 9% of all the protein kinases in the Arabidopsis genome, are involved in all aspects of plant development and physiology and participate in the coupling of cellular responses to environmental and developmental signals. A list of these protein kinases including gene identification numbers is available below and at http://www.arabidopsis.org. Further information on the genes including links to database records is available at http://plantsp.genomics.purdue.edu. This research will investigate the function of 64 members of these families through determination of the subcellular location of each kinase and identification of downstream targets and other proteins with which the kinases associate. This information will give insight into the physiological roles of each kinase by identifying signaling networks in which each participates. This research will also determine the target sequences in substrate proteins that are phosphorylated by each kinase, and these results will contribute to understanding the overlap in kinase function and cross-talk between signaling pathways. The results of this work will be made available on a yearly basis at the two URLs given above.

For educational outreach, we will develop lectures, class materials, and virtual laboratory exercises on the subjects of plant protein phosphorylation and genomics, for use in junior colleges. We will conduct a pilot study in which the effectiveness of these materials is evaluated.

Return to index

Specific Goals

1. Identify kinase substrates using substrate traps. Three major approaches will be pursued: a) Use a proven CDPK substrate trap design to identify potential substrates in a yeast two-hybrid system; b) Develop and utilize alternative substrate trap designs of general application to all serine/threonine (Ser/Thr) kinases: c) Evaluate alternative interaction screens..

Deliverables

A. Identification of substrates and other proteins that interact with 64 CDPK superfamily members and with general applicability to the entire Arabidopsis proteome.

B. Development and evaluation of substrate trap designs of general applicability to all Ser/Thr kinases for identification of substrates/interacting proteins in planta.

C. Evaluation of alternative bacterial and yeast two-hybrid systems for high-throughput implementation.

2. Determine the subcellular locations of membrane-associated or compartmentalized kinases. Our approach will be to co-express and visualize by confocal microscopy both kinases tagged with yellow or green fluorescent protein and markers specific for different membrane or cytoskeletal proteins tagged with cyan or red fluorescent protein.

Deliverables

A. A set of targeted reporters for co-visualization controls tagged with DsRFP and CFP.

B. 64 YFP or GFP tagged kinases.

C. Subcellular locations determined by cytology for 64 kinases.

3. Use mass spectrometry to identify substrates phosphorylated by kinases in vitro and map the phosphorylation sites in substrates and kinases.

Deliverables.

  1. Identities of hundreds of proteins phosphorylated by the kinases in vitro.
  2. B. Consensus substrate phosphorylation motifs for CDPKs, CRKs, PPCKs, and some SnRKs.

C. Establishment of mass spectrometry facilities that, in the future, will be available to the plant science community for the analysis of phosphoproteins.

4. Develop lectures, class materials, and web-based "virtual laboratory" exercises related to the subject of plant protein kinases and genomics and targeted for community colleges.

Deliverables.

A. Class materials for a 6 week module to be incorporated into introductory biology or botany courses.

B. Pilot study on the use of the teaching materials in community colleges and evaluation of their effectiveness.



Gene Tables for the Arabidopsis CDPK/SnRK Gene Family

Accession numbers are gi numbers for protein records derived from genomic sequences except for those marked with #, which are protein records derived from cDNA sequences. Sequences that are incorrectly annotated in GenBank are marked with asterisks. The correct annotation for these sequences is available at http://plantsp.genomics.purdue.edu.



Return to index

Table 1. Calcium-Dependent Protein Kinases (CPKs)

Gene Family Name PlantsP ID Genbank accession Standard gene name
CPK1 21951 9758462 At5g04870
CPK2 21670 9837343 At3g10660
CPK3 21867 836944 At4g23650
CPK4 21816 5916441 At4g09570
CPK5 21913 3080419 At4g35310
CPK6 26192 2623752 At2g17290
CPK7 10200 1399277# none
CPK8 22074 836948 At5g19450
CPK9 21708 9294561 At3g20410
CPK10 21443 6730697 At1g18890
CPK11 20998 8778378* At1g35670
CPK12 21981 836938 At5g23580
CPK13 21791 1314711 At3g51850
CPK14 21512 1871195 At2g41860
CPK15 21845 2894572 At4g21940
CPK16 26190 2708745 At2g17890
CPK17 21973 9759385 At5g12180
CPK18 21917 3036811* At4g36070
CPK19 21152 3367525* At1g61950
CPK20 21594 3928078 At2g38910
CPK21 21808 4115943 At4g04720
CPK22 21807 4115942* At4g04710
CPK23 21809 4115945* At4g04740
CPK24 21538 4582467 At2g31500
CPK25 21601 4510380 At2g35890
CPK26 21919 4467129 At4g38230
CPK27 21806 5706728* At4g04700
CPK28 22061 10177136 At5g66210
CPK29 21002 6721111* At1g76040
CPK30 21084 5882721 At1g74740
CPK31 11158 5732059* none
CPK32 21777 6706424 At3g57530
CPK33 21103 11054627 At1g50700
CPK34 21369 none At5g19360


Return to index

Table 2. CPK-Related Kinases (CRKs)

Gene Family Name PlantsP ID Genbank accession Standard gene name
CRK1 21623 3402722 At2g41140
CRK2 21712 9294612 At3g19100
CRK3 21571 3831444 At2g46700
CRK4 21984 10177904 AT5g24430
CRK5 21757 6561993 At3g50530
CRK6 21752 6553918 At3g49370
CRK7 21790 9662995 At3g56760
CRK8 21393 10120419 At1g49580


Return to index

Table 3. PEP Carboxylase Kinases (PPCKs)

Gene Family Name PlantsP ID Genbank accession Standard gene name
PPCK1 21409 6318613 At1g08650
PPCK2 21203 6175181 At3g04530


Return to index

Table 4. PPCK-Related Kinases (PEPRKs)

Gene Family Name PlantsP ID Genbank accession Standard gene name
PEPRK1 21496 8778638 At1g12580
PEPRK2 21141 none At1g12680


Return to index

Table 5. SnF1-Related Kinases (SnRKs)

Naming convention: SnRKx.y, where x is the subgroup and y is the gene number.

Gene Family Name PlantsP ID Genbank accession Standard gene name
SnRK1.1 21659 6714479 At3g01090
SnRK1.2 21696 9294036 At3g29160
SnRK1.3 21323 10177691 At5g39440
SnRK2.1 21293 9759353 At5g08590
SnRK2.2 21756 6561990 At3g50500
SnRK2.3 22065 9758138 At5g66880
SnRK2.4 21446 1931648 At1g10940
SnRK2.5 21362 10177046 At5g63650
SnRK2.6 21907 3297819 At4g33950
SnRK2.7 21924 4490752 At4g40010
SnRK2.8 21445 8052538 At1g78290
SnRK2.9 21604 3169176 At2g23030
SnRK2.10 21102 6751684 At1g60940
SnRK3.1 21940 7329654 At5g01810
SnRK3.2 21956 7546685 At5g07070
SnRK3.3 21928 5302772 At4g14580
SnRK3.4 22037 9758325 At5g57630
SnRK3.5 22004 9758928 At5g45810
SnRK3.6 22005 9758929 At5g45820
SnRK3.7 21627 3337349 At2g34180
SnRK3.8 17265 8777339 At5g58380
SnRK3.9 21834 4539389 At4g18700
SnRK3.10 21706 9294189 At3g23000
SnRK3.11 21986 7453645 At5g35410
SnRK3.12 21033 6715641 At1g01140
SnRK3.13 21869 7446447 At4g24400
SnRK3.14 21894 2980770 At4g30960
SnRK3.15 21941 7329655 At5g01820
SnRK3.16 21205 9294138 At3g17510
SnRK3.17 21568 3885328 At2g26980
SnRK3.18 21575 6623874 At2g25090
SnRK3.19 21556 3786023 At2g38490
SnRK3.20 21389 9502417 At1g29230
SnRK3.21 21026 8778506 At1g48260
SnRK3.22 21529 2347199 At2g30360
SnRK3.23 26216 12320856 At1g30270
SnRK3.24 21967 8979727 At5g10930
SnRK3.25 21306 none At5g25110