[net.sport.baseball] Carter vs Pena

djvh@drutx.UUCP (VanHandelDJ) (08/12/85)

First off, I agree with David Rubin that, at least offensively, Carter
is superior to Pena.

I was somewhat amused by the clever way David took Carter's 1976 stats
out of the argument.  Sure, Barry Foote batted more times than Carter in
1976.  But Gary Carter was to be the Expos starting catcher that year.
The reason Foote played a lot is because Carter wasn't doing the job.
	The main point, though, is this:

	The issue at hand is `Who is better: Pena or Carter?`, not who
was better years ago.  I can see including the last 2 or 3 years in some
kind of weighted average year, but I don't see why Carter's stats from
almost 10 years ago are important to this discussion.

	Also, David Rubin repeatedly states that the surrounding line-up
does not significantly affect a batter's performance.  How is this measured?
I would like to see a statiscal analysis showing that surrounding line-up
has little or no effect on batter performance.  I find this VERY hard to
believe.  Until I see this, or find time to do it myself, I cannot agree
with that part of the argument (though I agree that Mtl and Pgh were not
significantly different offensively anyway).

Well, I guess I'll sit back now and wait for the Vol I part of the response :-)

						Dave Van Handel
	druWI!Whocaresaboutbaseballanyway-it'sGreenBayPackerseasonnow!djvh

djvh@drutx.UUCP (VanHandelDJ) (08/12/85)

I agree with David Rubin.  Why do you subtract HR's from Runs Produced?
Sure the run would be counted twice.  But the same is done on any other
kind of run-scoring hit (A run is given to the scorer, an RBI to the batter,
even though it only produced 1 run).  
	Therefore, subtracting the HRs doesn't make any sense at all.

Dave Van Handel

djvh@drutx.UUCP (VanHandelDJ) (08/14/85)

> 
> Now, why don't we expand this whole discussion so that others
> can take part, too? After all, no one else has made any
> contributions to this, and we might as well conduct this by
> mail, rather than over the net.
>
	No one else?  I for one have posted twice regarding this
discussion.  Did you ignore these just like you ignored David Rubin's
remarks that didn't agree with yours.
 
> You feel that by computing statistics based upon the raw numbers
> which are available in various publications, you can attain an
> understanding of the inner workings of the game, to the point
> that you can describe the strengths and weaknesses of players,
> discuss strategies, etc. I feel that this is not true, that instead,
> statistics lead to a superficial appreciation of correlation, but
> no definite understanding of cause-and-effect. 
>
	Statistics may not be used to PROVE a point in baseball, but they
sure as hell are better than anything else we can use.  Analysis (as there
has been a lot of it) supports the argument that stats can be used to
understand ALL of the items you list above.
 
> This can be compared to the type of knowlegdge which current "Artificial
> Intelligence" systems employ. Such systems, e.g. MYCIN and its
> descendants, use associational knowledge to correlate stimuli and
> desired responses, e.g., "If symptom A and symptom B are present,
> then prescribe remedy X." This can be effective in many cases. But
> there is a great deal of knowledge which cannot be captured this
> way - the structural knowledge, i.e., WHY A and B tend to respond
> to X. Even adding probabilities to this knowledge does not change
> this limitation.
>
 	As long as stats show which type of player will help the team
win, who cares WHY (analysis of stats makes the WHY obvious).

> In the same way, baseball stats can reveal many correlations, and
> can thus lead to a greater appreciation of the game and its
> intricacies. But the structural knowledge, e.g., "Is player A
> better defensively than player B?","Will player A contribute more
> to my team than B?", is subjective, and hence dependent upon the
> knowledge that a person has acquired about the game. 
>
	Who's to say this is subjective.  Any good manager will base
his decision on the results obtained by past performance.
 
> Also, these baseball pros have access to three sources of info that
> you and I do not:
> 
>      1) They keep charts of every pitch, and every hit and play in
>         the field. Thus, they have MUCH better data to rate the
>         players;

		So they have better stats.  They are still stats, which
	shows the value of using the right statistics for the game.

			       (or do you really think you know more
> than they all do?)
>
	It's possible that David Rubin does know more.  The people that
vote for these awards aren't God; do you honestly feel that the best players
get to the all-star game each year?  You seem to accept the "Authority's"
decisions as law.
 
>                Responses to statistical verbiage:
> 
> > Also, we really don't want to consider years
> > before the principles established themselves in their respective teams
> > starting lineups, so we consider Carter in 1975 and 1977-1984 (in 1975,
> > Carter spent most of his playing time in the outfield (as the starting
> > left fielder, generally) while serving as the #2 catcher; in 1976, he
> > was just the back-up catcher) and Pena from 1981-1984.  
> 
> Why consider any years before 1984? The original question was which
> should start the 1985 all-star game, not which was better in 1975.
> I knew Carter was better than Pena before 1984 even without reading
> any stats :-)
>
	I agree with this.  However, we did need some form to measure ability.
 
> > It is widely recognized that the purpose of the offense is run
> > production, and there are two distinct ways in which a hitter may
> > contribute to it.  The first is to score runs, the second to drive
> > them in.  Thus, traditionally, fans have placed great store in the
> > most obvious measures of that production, runs scored and runs batted
> > in.  Unfortunately, those traditional measures are heavily dependent
> > on circumstances beyond the hitter's control: how well his teammates
> > fare in doing THEIR job.  You can't score if no one drives you in, and
> > you can't drive some one in if no one is on base.  If we are to
> > evaluate individual performance, we must look at statistics that are
> > NOT dependent on the action of anyone save the individual in question.
> 
> EXACTLY!!! If you can find a statistic that is truly independent of
> teammate's contributions, I'd love to see it. All the stats you list
> below (Putouts, %thrown out, DP's, BA, OBA, HR, R, RBI, slugging,
> etc.) are dependent on: teammates' seasons, manager's tactics, place
> in the lineup, ballparks, and others. Apparently one of your favorite
> stats for pitchers is Earned Runs Prevented. You love to post a detailed
> list to the net every so often. This stat is no more independent than
> the old ERA. For example, if a pitcher pitches for a team which scores
> fewer runs for him, then he may be lifted earlier, on the average.
> This leads to fewer innings for him, and a lower score on ERP. This
> effect can also be achieved bu playing for a manager who loves to go
> to the bullpen early (e.g. Chuck Tanner.) These stats can be revealing
> at times (Gooden leads by a large margin no matter how you measure
> things) but using them to make finer distinctions is meaningless.
>
	David Rubin repeatedly stated that no statistic is independent
of other players, only that some stats have less correlation to what the
other players do.  After years of studying stats, it is obvious to me that
slugging pct. and on-base pct. are by far the most useful for discussion
of offensive contribution.  These are not perfect, but are the best we have.
 
> > Thus, we look at on On-Base Percentage (a.k.a.
> > Average) to evaluate how well a player performs this function.  
> 
> You really love this stat. Fine. This is a free country. I think
> it is just another meaningless stat. Hits are better than walks
> any day. Since you love to compute, why not analyze how often runs
> are scored with only walks, versus how often runs are scored with
> only hits? Or how often runs are scored with no hits at all, versus
> how often runs are scored without walks? This is baseball, not the 
> on-base derby.
>
	You don't make any sense here.  Are you saying that a player
cannot do both?  Of course a hit is better than a walk.  But, if two
players each get 600 plate appearances, #1 gets 130 hits and 100 walks,
#2 gets 180 hits and 20 walks, who would you say contributed more?
Player #1 has a BA of .260 while Player #2 has a BA of .310.  So 
Player #2 is clearly better, right?  Assuming power figures are equal,
I would say Player #1 was more valuable, as he reached base 30 more
times in the same 600 plate appearances.
	This is reality, you can get on base either way (many times
both ways in the same inning !!!)

> 
> > Both Carter and Pena batted in the middle of their respective orders
> > (generally fifth for Carter, sixth for Pena), and probably have about
> > 45 such opportunities in a season.  Assuming that the opportunities
> > are uniformly distributed among out counts, 30 of these occurred with
> > none or one out (actually, this probably overestimates the number of
> > such opportunities, as outs accumulate as batters bat, thus implying
> > that more runners are on base, on average, with two out than with none
> > out).  Pena has good speed for a catcher, average for all runners, and
> > would probably advance to third about 33% of the time (choosing the
> > median value from Texas regulars); for Carter, my best guess is 20%
> > (he's not as hopeless on the bases as Sundberg).  The difference,
> > then, is probably about 30/3 - 30/5 = 4.  It does not make up for
> > Pena's negative contribution in his stolen base attempts.
> 
> By your argument, speed is a negligent factor in baseball, at least
> offensively. You apparently feel that breaking up double-plays at
> second base, or avoiding them at first, or taking the extra base,
> or causing an errant throw, are small factors. You like HRs and 
> walks. Well, why not take up this argument some time with a professional
> baseball person (which neither of us is) and tell him that speed is
> negligable offensively? From everything I have heard and read, this is
> not so. Again, we are fans. I trust what I hear from pros more than
> your amateur judgement (or my own). 
>
	Whitey Herzog has been the only manager who has consistently won
with speed in his lineup.  Power is far more valuable, and if you want it
proven, look at the top scoring teams each year.  Also look at the top
(Slugging Pct + OB Pct) teams each year.  You will see that the same teams
are at the top of each list.  Baltimore (late 60's and early 70's) won near
100 games annually, but rarely did they have a .300 hitter.  They did it
with power and the ability to draw walks.  They had a few decent pitchers
as well.
 
						Dave Van Handel
						drutx!djvh

djvh@drutx.UUCP (VanHandelDJ) (09/26/85)

> 
> EVERYBODY knows that lineups are interdependent! Try watching a game
> sometime (instead of just reading numbers). You'll see that when a runner
> is on base, it affects (among other things):
> 
>     1) the way the pitcher throws. Using the stretch instead of a full
>        windup definitely hurts most pitchers' performances. Otherwise, 
>        there would be no need for anyone to ever windup.
> 
>     2) the pitch selection;
> 
>     3) the defensive alignment.
> 
> Thus, if the batter ahead of, say Mattingly, gets on base more often,
> is a threat to steal, and gets in scoring position more often, he
> can (and does) affect whether Mattingly gets a hit or not.
>
	I agree with this.  Lineups are interdependent, and I have no
idea why David Rubin thinks otherwise.
 	As you know, I agree with David Rubin that OBA and SA are the
"best" available stats, but I *DO* realize that they aren't the only
thing we should look at.

> 
>     Thus, we see that some excellent managers, such as Whitey Herzog,
>     deliberately put a player like Coleman, who has a lower OBA and
>     slugging average than McGee, in the spot where he will get the most 
>     at-bats, thus effectively reducing the overall OBA and slugging pct of 
>     his team. Do you really think he is deliberately reducing the run-scoring
>     ability of his team? Or do you just think that all these baseball
>     professionals are sadly misguided? The only other alternative is
>     that TEAM RUN-SCORING ABILITY IS NOT DIRECTLY CORRELATED WITH
>     TEAM OBA OR SLUGGING, i.e., these stats aren't all you crack them
>     up to be. There must be other factors, e.g., speed.
>	
	True - speed does have something to do with it.  I believe that
TEAM RUNS *ARE* DIRECTLY CORELATED WITH TEAM (OBA+SA).  Having speed
in the lineup just helps to get more runs.  The interaction of players as
you describe above serves to maximize the hitting of the team, thus increasing
team (OBA + SA).
	We seem to be arguing, in a sense, about different things.  All I say
is that given the stats we have, (OBA +SA) is the best way we can measure a
hitter's offensive contribution.  I'm sure it is highly dependent on things
such as speed, lineup interactions, etc.  We don't have stats for
things like speed, moving runners over, the effect of Henderson on Mattingly,
etc.  Until we do, any *Statistical* argument must be made with the best stats
available.  Read the books David Rubin speaks of, and you should agree that
the best available stats are (OBA+SA).
	But I do agree that other things (that no one has been able to measure
yet) are important.
	I also think that Carter is better than Pena   :-) :-)
		(I just had to throw that in)

>     To rephrase this point, so that you will have less chance of
>     misinterpreting it, if Guerrero's slugging avg and OBA are what
>     are most important to the Dodgers, then he should bat leadoff,
>     so as to maximize the team's slugging avg and OBA. He doesn't,
>     and the very idea seems preposterous. Either Lasorda doesn't
>     understand the game as you do, or your emphasis on OBA and
>     slugging is wrong. Which is it?
>
	True, if David Rubin's statement of no lineup interaction is true,
then Guerrero should bat leadoff.  I just want you to know that I do not
agree with him on this.  The best possible interaction of players will
serve to maximize team SA + OBA, which (maybe some day I'll attempt to
*Prove* this) in turn will maximize team run production. 
	Perhaps if Guerrero did bat leadoff, his OBA + SA would drop
off tremendously.  David Rubin should realize this.

>> And even if we WERE to consider them, why does Paul believe that Carter
>> has his stats inflated by Hernandez, Strawberry, and Foster when NONE
>> of those three show any substantial increase in production over last
>> year?  
> 
> WRONG. Strawberry is having a much better year than last year. Note that
> Strawberry bats directly behind Carter, just as Mattingly bats directly
> behind Henderson. See below.
> 
	As Carter is not exactly a demon on the basepaths, I fail to see
why his batting AHEAD of Strawberry would benefit Strawberry.  It seems that
the opposite could be true; Carter benefits from having a strong hitter behind
him in the lineup.

>> I suppose Paul believes Carter has a special dispensation: in
>> moving from the Expos to the Mets, he gains by being surrounded by
>> Keith, Darryl, and George, while those three do NOT gain from Gary's
>> presence.  The fact is, the production of all four has remained about
>> the same over the past two years, an argument AGAINST lineup effects.
> 
	From the figures Paul included, I would have to agree that there
is no significant increase in production among the 4 Mets players.  The
only one that seems to be having a better year than he did in 1984 is
Strawberry.  It could be argued that this is due to his youth and continued
development, not to Carter's presence in the lineup.